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Preface to the Sixth English Edition 


This sixth English edition is based on the fifth English edition (2007) and corresponds to the improved 
seventh (2008), eighth (2012) and ninth (2013) German edition. It contains all the chapters of the 
mentioned editions, but in a renewed, revised and extended form (see also the preface to the fifth English 
edition). 


Special new parts to be mentioned here are such supplementary sections as Geometric and Coordinate 
Transformations and Plain Projections in Chapter on Geometry, as Quaternions and Applications in 
Chapter on Linear Algebra, as Lie Groups and Lie Algebras in Chapter on Algebra and Discrete Math- 
ematics and as Matlab in Chapter on Numerical Analysis. The Chapter on Computer Algebra Systems 
is restricted to Mathematica only as a representative example for such systems. 


Extended and revised paragraphs are given in Chapter on Geometry about Cardan and Euler angles 
and about Coordinate transformations, in Chapter on Integral Calculus about Applications of Definite 
Integrals, in Chapter on Optimization about Evaluation Strategies, in Chapter on Tables about Natural 
Constants and Physical Units (System SI). The Index has been completed to an extent as in the previous 
German editions. 


We would like to cordially thank all readers and professional colleagues who helped us with their valu- 
able statements, remarks and suggestions on the German editions of the book during the revision pro- 
cess. Special thanks go to Mrs. Professor Dr. Gabriella Szép (Budapest), who made this English 
edition possible by valuable contributions and the basic translation into the English. Furthermore our 
thanks go to all co-authors for the critical treatment of their chapters. 


Dresden, December 2014 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 


Preface to the Fifth English Edition 


This fifth edition is based on the fourth English edition (2003) and corresponds to the improved sixth 
German edition (2005). It contains all the chapters of the both mentioned editions, but in a renewed 
revised and extended form. 


E 


Soin the work at hand, the classical areas of Engineering Mathematics required for current practice are 
presented, such as “Arithmetic”, “Functions”, “Geometry”, “Linear Algebra", “Algebra and Discrete 
Mathematics”, (including “Logic”, “Set Theory”, “Classical Algebraic Structures”, “Finite Fields”, 
“Elementary Number Theory”, ” Cryptology”, “Universal Algebra", “Boolean Algebra and Switch Al- 
gebra", “Algorithms of Graph Theory”, “Fuzzy Logic”), “Differentiation” , “Integral Calculus", “Dif- 
ferential Equations”, “Calculus of Variations”, “Linear Integral Equations”, “Functional Analysis” , 
“Vector Analysis and Vector Fields”, “Function Theory”, “Integral Transformations”, “Probability 
Theory and Mathematical Statistics". 


Fields of mathematics that have gained importance with regards to the increasing mathematical mod- 
eling and penetration of technical and scientific processes also receive special attention. Included 
amongst these chapters are "Stochastic Processes and Stochastic Chains" as well as "Calculus of Er- 
rors”, “Dynamical Systems and Chaos”, “Optimization”, “Numerical Analysis", “Using the Com- 
puter” and “Computer Algebra Systems". 


The Chapter 21 containing a large number of useful tables for practical work has been completed by 


VI 


| adding tables with the physical units of the International System of Units (SI). 
Dresden, February 2007 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 


From the Preface to the Fourth English Edition 


The “Handbook of Mathematics" by the mathematician, I. N. BRONSHTEIN and the engineer, K. A. 
SEMENDYAYEV was designed for engineers and students of technical universities. It appeared for the 
first time in Russian and was widely distributed both as a reference book and as a text book for colleges 
and universities. It was later translated into German and the many editions have made it a permanent 
fixture in German-speaking countries, where generations of engineers, natural scientists and others in 
technical training or already working with applications of mathematics have used it. 


On behalf of the publishing house Harri Deutsch, a revision and a substantially enlarged edition was 
prepared in 1992 by Gerhard Musiol and Heiner Mühlig, with the goal of giving " Bronshtein” the mod- 
ern practical coverage requested by numerous students, university teachers and practitioners. The 
original style successfully used by the authors has been maintained. It can be characterized as “short, 
easily understandable, comfortable to use, but featuring mathematical accuracy (at a level of detail 
consistent with the needs of engineers)”*. Since 2000, the revised and extended fifth German edition of 
the revision has been on the market. Acknowledging the success that “BRONSTEIN” has experienced 


in the German-speaking countries, Springer Verlag Heidelberg/Germany is publishing a fourth English 
edition, which corresponds to the improved and extended fifth German edition. 


The book is enhanced with over a thousand complementary illustrations and many tables. Special 
functions, series expansions, indefinite, definite and elliptic integrals as well as integral transformations 
and statistical distributions are supplied in an extensive appendix of tables. 


In order to make the reference book more effective, clarity and fast access through a clear structure 
were the goals, especially through visual clues as well as by a detailed technical index and colored tabs. 


An extended bibliography also directs users to further resources. 


Special thanks go to Mrs. Professor Dr. Gabriella Szép (Budapest), who made this English debut ver- 
sion possible. 
Dresden, June 2003 


Prof. Dr. GERHARD MUSIOL Prof. Dr. HEINER MUHLIG 


Co-Authors 


Some chapters or sections are the result of cooperation with co-authors. 


Chapter resp. section Co-author 

Spherical Trigonometry (3.4.1 — 3.4.3.3) Dr. H. NICKEL f, Dresden 

Spherical Curves (3.4.3.4) Prof. L. MARSOLEK, Berlin 

Geometric Transformations, Coordinate Trans- 
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*See Preface to the First Russian Edition 
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Using the Computer: (19.8.1, 19.8.2), Interactive 

System: Mathematica (19.8.4.2), Maple (19.8. 
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1 Arithmetics 


1.1 Elementary Rules for Calculations 


1.1.1 Numbers 
1.1.1.1 Natural, Integer, and Rational Numbers 


1. Definitions and Notation 
The positive and negative integers, fractions, and zero together are called the rational numbers. In 
relation to these the following notations are used (see 5.2.1, 1., p. 327): 
e Set of natural numbers: N={0, 1, 2,3,...}, 
e Set of integers: Z= {...,-2,-1,0,1,2,...}, 
e Set of rational numbers: Q= (x|v = P with ρε, qEZ and q #0}. 
q 


The notion of natural numbers arose from enumeration and ordering. The natural numbers are also 
called the non-negative integers. 

2. Properties ofthe Set of Rational Numbers 

e The set of rational numbers is infinite. 

e The set is ordered, i.e., for any two different given numbers a and b one can tell which is the smaller 
one. 

e The set is dense everywhere, i.e., between any two different rational numbers a and b (a < b) there 
is at least one rational number c (a < c < b). Consequently, there is an infinite number of other rational 
numbers between any two different rational numbers. 

3. Arithmetical Operations 

The arithmetical operations (addition, subtraction, multiplication and division) can be performed with 
any two rational numbers, and the result is a rational number. The only exception is division by zero, 
which is not possible: The operation written in the form a : 0 is meaningless because it does not have 
any result: If a Z 0, then there is no rational number b such that b- 0 = a could be fulfilled, and if a = 0 
then b can be any of the rational numbers. The frequently occurring formula a : 0 — oo (infinity) does 
not mean that the division is possible; it is only the notation for the statement: If the denominator 
approaches zero and, e.g., the numerator does not, then the absolute value (magnitude) of the quotient 
exceeds any finite limit. 

4. Decimal Fractions, Continued Fractions 

Every rational number a can be represented as a terminating or periodically infinite decimal fraction 
or as a finite continued fraction (see 1.1.1.4, p. 3). 

5. Geometric Representation 

Fixing an origin the zero point 0, a positive direction the orientation, and the unit of length | the 
measuring rule, (sce also 2.17.1, p. 115 and (Fig. 1.1)), then every rational number a corresponds to 
a certain point on this line. This point has the coordinate a, and it is a so-called rational point. The 
line is called the numerical axis. Because the set of rational numbers is dense everywhere, between two 
rational points there are infinitely many further rational points. 


B 
1 ο 8 2 8 FAL 
3 4 2 1 0 1 2 2 3 3 Α AK. o> 
φο hd 9 * T LJ 9 o > 0 1 2 3 
1-1 * 
Figure 1.1 Figure 1.2 
© Springer-Verlag Berlin Heidelberg 2015 1 


LN. Bronshtein et al., Handbook of Mathematics, 
DOI 10.1007/978-3-662-46221-8 1 


2 1. Arithmetics 


1.1.1.2 Irrational and Transcendental Numbers 


The set of rational numbers is not satisfactory for calculus. Even though it is dense everywhere, it does 
not cover the whole numerical axis. If for example the diagonal AB of the unit square rotates around 
A so that B goes into the point Κ΄, then K does not have any rational coordinate (Fig. 1.2). 
The introduction of irrational numbers allows to assign a number to every point of the numerical axis. 
In textbooks there are given exact definitions for irrational numbers, e.g., by nests of intervals. For this 
survey it is enough to note that the irrational numbers take all the non-rational points of the numerical 
axis and every irrational number corresponds to a point of the axis, and that every irrational number 
can be represented as a non-periodic infinite decimal fraction. 
First of all, the non-integer real roots of the algebraic equation 

x" pa, ar" b-4eRajgdoag-—0 (n 1, integer; integer coefficients), (1.1a) 


belong to the irrational numbers. These roots are called algebraic irrationals. 
W A: The simplest examples of algebraic irrationals are the real roots of z^ — a = 0 (a > 0), as 
numbers of the form 4/a , if they are not rational. 
BB: /2—1414... A ΥΠΟ = 2.154... are algebraic irrationals. 
The irrational numbers which are not algebraic irrationals are called transcendental. 
W A: 7 = 3.141592... e = 2.718281... are transcendental numbers. 
E B: The decimal logarithm of the integers, except the numbers of the form 10”, are transcendental. 
The non-integer roots of the quadratic equation 

x^-Fajz--ag — 0 (αι,ας integers) (1.1b) 
are called quadratic irrationals. They have the form (a + bv D)/c (a,b,c integers, c # 0; D > 0, 
square-free number). 
Β The division of a line segment a in the ratio of the golden section x/a = (a — x)/a (see 3.5.2.3, 3., 
p. 194) leads to the quadratic equation x? + x — 1 = 0, ifa = 1. The solution x = (v5 — 1)/2 is a 


quadratic irrational. It contains the irrational number v5. 


1.1.1.3 Real Numbers 


Rational and irrational numbers together form the set of real numbers, which is denoted by R. 


1. Most Important Properties 

The set of real numbers has the following important properties (see also 1.1.1.1, 2., p. 1). It is: 

e Infinite. 

e Ordered. 

e Dense everywhere. 

e Closed, i.e., every point of the numerical axis corresponds to a real number. This statement does 
not hold for the rational numbers. 


2. Arithmetical Operations 

Arithmetical operations can be performed with any two real numbers and the result is a real number, 
too. The only exception is division by zero (see 1.1.1.1, 3., p. 1). Raising to a power and also its 
inverse operation can be performed among real numbers; so it is possible to take an arbitrary root of 
any positive number; every positive real number has a logarithm for an arbitrary positive basis, except 
that 1 cannot be a basis. 

A further generalization of the notion of numbers leads us to the concept of complex numbers (see 1.5, 
p. 34). 

3. Interval of Numbers 

A connected set of real numbers with endpoints a and b is called an interval of numbers with endpoints 
a and b, where a < b and a is allowed to be —oo and b is allowed to be 4-oo. If the endpoint itself does 
not belong to the interval, then this end of the interval is open, in the opposite case it is closed. 
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An interval is given by its endpoints a and b, putting them in braces: A bracket for a closed end of the 
interval and a parenthesis for an open one. It is to be distinguished between open intervals (a,b), half- 
open (half-closed) intervals [a, b) or (a, b] and closed intervals [a,b], according to whether none of the 
endpoints, one of the endpoints or both endpoints belong to it, respectively. Frequently the notation 
Ja, b| instead of (a, b) for open intervals, and analogously [a, b[ instead of [a,b) is used. In the case of 
graphical representations, in this book the open end of the interval is denoted by a round arrow head, 
the closed one by a filled point. 


1.1.1.4 Continued Fractions 


Continued fractions are nested fractions, by which rational and irrational numbers can be represented 
and approximated even better than by decimal representation (see 19.8.1.1, p. 1002 and W A and MI 
B on p. 4). 


1. Rational Numbers p 1 
The continued fraction of a rational number is fi- q = ay + 1 - (3) 
nite. Positive rational numbers which are greater es 1 
than 1 have the form (1.2). For abbreviation a t ————— 1 - 
` p i m x "Sus 
the symbol Jm lao; a1, 02, ..., an] is used with ES 


a, 21 (k =1,2,...,n). 
The numbers a; are calculated with the help of the Euclidean algorithm: 


Msg d (o«2 κ). (1.3a 
q q q 
σος (0 ee 1), (1.3b 
Ti Ti Ti 
= ay d È (0 22 i, (1.86 
T2 T2 Το 
Tn-2 à, Tn (o< Ta < η). (1.3d 
Tn—1 Tn—1 Tn—1 
ma = απ (ΠΠ = 0). (1.3e 
1 1 1 
E e eu 24 ΕΞ 21 Τ [2;3, 1,6]. 
loeum 
T 6 


2. Irrational Numbers 


Continued fractions of irrational numbers do not break off. They are called infinite continued fractions 
with [ao; a1, aa, . . .]. 

If some numbers ax are repeated in an infinite continued fraction, then this fraction is called a periodic 
continued fraction or recurring chain fraction. Every periodic continued fraction represents a quadratic 
irrationality, and conversely, every quadratic irrationality has a representation in the form of a periodic 
continued fraction. 

E The number V2 = 1.4142135.. . is a quadratic irrationality and it has the periodic continued fraction 


representation 2 = [1;2,2,2,...]. 
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3. Aproximation of Real Numbers 
If a = [ao; a1, a2, . . .| is an arbitrary real number, then every finite continued fraction 


ay = [09;41,02,..., αρ] = Ë (1.4 
q 
represents an approximation of a. The continued fraction a, is called the k-th approximant of a. I 
can be calculated by the recursive formula 
Pk ακρκ--ι T Pk—2 
Qk (k > 1; pı = 1l, po = @; 9-1 = 0, Qo = 1). (1.5 
dk Qkdk—-1 T Qk—2 


According to the Liouville approximation theorem, the following estimat holds: 


a 
Seca (1.6 
dk 


ja — αἱ = |a 


Furthermore, it can be shown that the approximants approach the real number a with increasing ac- 
curacy alternatively from above and from below. The approximants converge to α especially fast if the 
numbers a; (i = 1,2,..., k) in (1.4) have large values. Consequently, the convergence is worst for the 
numbers [1;1,1,...]. 
W A: From the decimal presentation of 7 the continued fraction representation 7 = [3; 7, 15, 1, 292, . . .] 
follows with the help of (1.3a)-(1.3e). The corresponding approximants (1.5) with the estimate accord- 


1 á * 
z z 9. 1073, as τ with |r — a| < =% 9. 10™, 


~ 8- 107°. The actual errors are much smaller. They are less than 


22 
ing to (1.6) are: αι 7 with |r — o4| < 


355 ith | E 1 
= with |z — a: 

113 $3 713 
1.3: 107? for o4, 8.4- 107? for a; and 2.7- 1077 for a3. The approximants a1, o? and a3 represent better 
approximations for π than the decimal representation with the corresponding number of digits. 

ΒΒ: The formula of the golden section z/a = (a — x)/x (see 1.1.1.2, p. 2, 3.5.2.3, 3., p. 194 and 
17.3.2.4, 4., p. 908) can be represented by the following two continued fractions: x = afl; 1, 1, ...] and 


Q3 


rae (1+ V5) 50 (2:4,4,4,...]). The approximant ay delivers in the first case an accuracy of 
0.018 a, in the second case of 0.000 001 a. 


1.1.1.5 Commensurability 


Two numbers a and b are called commensurable, i.e., measurable by the same number, if both are an 
integer multiple of a third number c. From a = mc, b = nc (m,n € 7) it follows that 


a i 

572 (x rational). (1.7) 
b 
Otherwise a and b are incommensurable. 

W A: The length of a side and a diagonal of a square are incommensurable because their ratio is the 
irrational number V2. 

W B: The lengths of the golden section are incommensurable, because their ratio contains the irra- 
tional number V5 (see 1.1.1.2, p. 2 and 3.5.2.3, 3.,p. 194). Therefore the sides and diagonals in a 
regular pentagon are incommensurable (see ΒΒ in 3.1.5.3, p. 139). Today Hippasos from Metapontum 
(450 BC) is considered to have discovered the irrational numbers via this example. 


1.1.2 Methods for Proof 
Mostly three types of proofs are used: 
e direct proof, 

e indirect proof, 
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e proof by (mathematical or arithmetical) induction. 
Furthermore there are constructive proofs. 


1.1.2.1 Direct Proof 


The starting point is a theorem which has already been proven (premise p) and the truth of the state- 
ment of the new theorem is derived from it (conclusion q). The logical steps mostly used for the con- 
clusions are implication and equivalence (see 5.1, p. 323). 
1. Direct Proof by Implication 

The implication p => q means that the truth of the conclusion follows from the truth of the premise 
(see “ Implication " in the truth table, 5.1.1, p. 323). 


a+b 


W Prove the inequality > Vab fora > 0, b > 0. The premise is the well-known binomial formula 


(a+b)? = a? }- 2ab +b. By subtracting 4ab follows (a +b)? — 4ab = (a — b)? > 0. From this inequality 
the statement is obtained certainly if the investigations are restricted only to the positive square roots 
because of a > 0 and b > 0. 

2. Direct Proof by Equivalence 
The proof will be delivered by verifying an equivalent statement. In practice it means that all the 
arithmetical operations which have to be used for changing p into q must be uniquely invertible. 


, 1 
WB Prove the inequality 1+ a +a? +--+- +a” < I fr0<a<l. 
a 


Multiplying by 1 — a yields 1 — a + a — à? +a? — à? +- +a” — ant! = 1 — ant! «1. 
This last inequality is true because of the assumption 0 < a"*! < 1. The starting inequality also holds 
because all the arithmetical operations to be used are uniquely invertible. 


1.1.2.2 Indirect Proof or Proof by Contradiction 


To prove the statement q: Starting from its negation q, and from q arriving at a false statement r, i.e., 
q = r (see also 5.1.1, 7., p. 325). In this case q must be false, because using the implication a false 
assumption can result only in a false conclusion (see truth table 5.1.1, p. 323). If ὦ is false q must be 
true. 


α 
E Prove that the number v2 is irrational. Suppose, V2 is rational. So the equality J2= 7 holds for 


some integers a, b and b # 0. Assuming that the numbers a, b are coprime numbers, i.e., they do not 
2 


- , a : ; x: 
have any common divisor, then follows (/2)? = 2 = g! a? = 2b, therefore, a? is an even number, 


and this is possible only if a = 2n is an even number. Deducing a? = 4n? = 2b? holds, hence b must be 
an even number, too. It is obviously a contradiction to the assumption that a and b are coprimes. 


1.1.2.3 Mathematical Induction 


Theorems and dependent on natural numbers n are proven with this method. The principle of math- 
ematical induction is the following: If the statement is valid for a natural number no, and if from the 
validity of the statement for a natural number n > no the validity of the statement follows for n + 1, 
then the statement is valid for every natural number n > no. According to these, the steps of the proof 
are: 

1. Basis of the Induction: The truth of the statement is to be shown for n = no. Mostly no = 1 can 
be choosen. 

2. Induction Hypothesis: The statement is valid for an integer n (premise p). 

3. Induction Conclusion: Formulation the proposition for n + 1 (conclusion q). 

4. Proof of the Implication: p > η. 

Steps 3. and 4. together are called the induction step or logical deduction from n to n + 1. 
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1 1 1 n 
W Prove the formula sn Το σα 4^7 An £1) icum 
The steps of the proof by induction are: 
1 1 
1.n21: δι DS TET is obviously true. 
1 1 1 1 n 
2. Suppose sy, ia | 2.3 t 3.4 pe nth) = τι holds for ann > 1. 
. mE n+1 
3. Supposing 2. it is to show: $441 = . 
n+2 
1 1 1 1 1 1 
4. The proof: s, t t be | n d = 
i obl ο TUM σποτ n(n-1) (n+l(nt+2) "'(n4lI(n-2) 
n 1 n? -2n 1 (n 4- 1)? n+1 


n+1 = (n+1)(n+2) (n+ 1)(n+2)” (n+ (ας) n42° 
1.1.2.4 Constructive Proof 


In approximation theory, for instance, the proof of an existence theorem usually follows a constructive 
process, i.e., the steps of the proof give a method of calculation for a result which satisfies the proposi- 
tions of the existence theorem. 

W The existence of a third-degree interpolation-spline function (see 19.7.1.1, 1., p. 996) can be proved 
in the following way: It is to be shown that the calculation of the coefficients of a spline satisfying 
the requirements of the existence theorem results in a tridiagonal linear equation system, which has a 
unique solution (see 19.7.1.1, 2., p. 997). 


1.1.3 Sums and Products 


1.1.3.1 Sums 
1. Definition 


To briefly denote a sum the summation sign Y is used: 


n 
αι αρ 9... à, = >> ag. (1.8) 
k=1 
With this notation the sum of n summands ap (k = 1,2,...,n) is denoted, k is called the running 


index or summation variable. 
2. Rules of Calculation 
1. Sum of Summands Equal to Each Other , i.e., a, = a for k = 1,2,...,m: 


Sas = πα. (1.9a 
k=1 

2. Multiplication by a Constant Factor 
5 Cak = C 5 ak. (1.9b 
k=1 k=1 


3. Separating aSum 


n m n 
» αι δα X a (1 «πιςπ). (1.96 
k=1 k=1 k=m+1 


4. Addition of Sums with the Same Length 


n 


Σία. + be +e +...) = Mast Dobe + Voce +... (1.9d 
k=l k=l k=l 
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5. Renumbering 


n m+n—1L n—m-H 
5 ak = P» Ok—m41: 5 Gg — » Gk--m-1- (1.96) 
k=1 k=m k=m 


6. Exchange the Order of Summation in Double Sums 


Σ (X a) => (Soa). (1.99 |. 


k=1 


1.1.3.2 Products 


1. Definition 
The abbreviated notation for a product is the product sign |]: 


n 
αιαο... An = Il αν. (1.10) 
k=1 
With this notation a product of n factors a, (k = 1,2,...,n) is denoted, where k is called the running 
index. 


2. Rules of Calculation 


1. Product of Coincident Factors , 1.ο., ap = a for k = 1,2,...,m: 
Τα. =". (1.11a 
he 


2. Factoring out a Constant Factor 


II cak) = =e Ton (1.11b 
k=1 


3. Separating into Partial Products 


I ak = (Π «) l ῄ s) (1 «€ m « n). (1.11c 


k=m+1 


4. Product of Products 
Il dpe os. = (i D (i 7 (i α) Sess (1.11d 
k=1 k=1 k=1 k=1 


5. Renumbering 


n m-rn—1l n n—m-H 
Il ak = Il ο Il ak = Il Ok--m-1- (1.110) 
k=1 k=m k=m k=l 


6. Exchange the Order of Multiplication in Double Products 


I (ή 2 -TI (i 2 (1.111) 


k=1 


1.1.4 Powers, Roots, and Logarithms 
1.1.4.1 Powers 


The notation a” is used for the algebraic operation of raising to a power. The number a is called the 
base, x is called the exponent or power, and α΄ is called the power. Powers are defined as in Table 1.1. 


8 1. Arithmetics 


For the allowed values of bases and exponents there are the following 
Rules of Calculation: 


x 


αὖ αν = a9, at: a= Z za δ; (1.12 
αἱ 

a’ b = (αὐ), a:b = τ. = (2) ; (1.13 

(a^)! = (αν) = a", (1.14 

a* =e"™* (a> 0). (1.15 


Here Ina is the natural logarithm of a where e = 2.718281828459 ... is the base. Special powers are 


(-1)"= { μον (1.164) a? = 1 for any a #0. 1.16b 


—1, if n odd, ' 


Table 1.1 Definition of powers 


basea exponent x power a* 
0 1 
arbitrary real, Z0 | n = 1,2,3,... a" =a-a-a:...:q (a tothe power n) 
D Re a o 
n factors 
1 
n = —1,-2,-3,... | a” = — 
Qa" 
rational: ? at = fap 
ᾳ 
positive real (p, q integer, q > 0) | (q-th root of a to the power p) 
irrational: 
Dk Pk 
lim ΤΕ lim a% 
k>œ qk k—oo 
0 positive 0 


1.1.4.2 Roots 
According to Table 1.1 the n-th root of a positive number a is the positive number denoted by 

Va (a 0, real; n > 0, integer). (1.17a) 
This operation is called taking of the root or extraction of the root, a is the radicand, n is the radical or 
index. 
The solution of the equation 


x” =a (a real or complex; n > 0, integer) (1.17b) 
is often denoted by x = i/a. But there is no reason to be confused: In this relation the notation de- 
notes all the solutions of the equation, i.e., it represents n different values x, (k = 1,2,...,n) to be 


calculated. In the cace of negative or complex values they are to be determined by (1.140b) (see 1.5.3.6, 
p. 38). 


Β A: The equation x? = 4 has two real solutions, namely στο = +2. 


E B: The equation α) = —8 has three roots among the complex numbers: αι = 1 + iv3, £% = 
—2 and z3 = 1 — i3, but only one among the reals. 
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1.1.4.3 Logarithms 


1. Definition 
The logarithm u of a positive number x > 0 to the base b > 0, b Z 1, is the exponent of the power 
which has the value x with b in the base. It is denoted by u = log, x. Consequently the equation 


bY =x (1.184) yields log, x = u (1.18b) 
and conversely the second one yields the first one. In particular holds 


—oo for b> 1, 


+oo for b < 1. (1.180) 


log, 1 =0, log,b=1, log,0= { 
The logarithm of negative numbers can be defined only among the complex numbers. The logarithmic 
functions see 2.6.2, p. 73. 
To take the logarithm of a given number means to find its logarithm. To take the logarithm of an 
expression means it is transformed like (1.19a, 1.19b). The determination of a number or an expression 
from its logarithm is called raising to a power. 
2. Some Properties of the Logarithm 
a) Every positive number has a logarithm to any positive base, except the base b — 1. 
b) For z > 0 and y > 0 the following Rules of Calculation are valid for any b (which is allowed to be 
a base): 


log (xy) —logz--logy. log B = log z — logy, (1.19a) 
y 
, ; 1 
Ίορα” --π]ορα, in particular log Wz = —logz. (1.19b) 
n 


With (1.19a, 1.19b) the logarithm of products and fractions can be calculated as sums or differences of 
logarithms . 


3a? 9} 37? 3/ . i 
W Take the logarithm of the expression : vy : log : Vy — log (32? yy) — log (2zu?) 
2zu3 2zu? 


1 
= log3 + 2log x 4 3 8Y log 2 — log z — 3 log u. 
Often the reverse transformation is required, i.e., an expression containing logarithms of different amou- 
nts is to be rewritten into one, which is the logarithm of one expression. 


1 3a? 3 
E log3+ 2loga 4 3 logy log 2 — log z — 3 log u = log e 
zu 


c) Logarithms to different bases are proportional, i.e., the logarithm to a base a can be change into a 
logarithm to the base b by multiplication: 


log, x = M log,x where M = log, b = --- (1.20) 
log, a 

M is called the modulus of the transformation. 
1.1.4.4 Special Logarithms 
1. The logarithm to the base 10 is called the decimal or Briggsian logarithm, in formulas: 

ορια =lgx and log(xl0°) = a + log z. (1.21) 
2. The logarithm to the base e is called the natural or Neperian logarithm, in formulas: 

log, x = In x. (1.22) 
The modulus of transformation to change from the natural logarithm into the decimal one is 

M = loge = T 0.4342944819 , (1.23) 


In 10 
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and to change from the decimal into the natural one it is 


1 
Μι = a In 10 = 2.3025850930 . (1.24 


3. The logarithm to base 2 is called the binary logarithm, in formulas: 

logg;r —ldzr or logo; rz = lbz. (1.25 
4. The values of the decimal and natural logarithm can be found in logarithm tables. Some time ago 
thelogarithm was used for numerical calculation of powers, and it often made numerical multiplication 
and division easier. Mostly the decimal logarithm was used. Today pocket calculators and persona 
computers make these calculations. 
Every number given in decimal form (so every real number), which is called in this relation the antilog, 
can be written in the form 

x= $105 with 1 < $ «10 (1.26a 
by factoring out an appropriate power of ten: 10* with integer k. This form is called the half-logarithmic 
representation. Here 3 is given by the sequence of figures of x, and 10^ is the order of magnitude of 2. 
Then for the logarithm holds 

logz =k+log@ with O<logt<1, ie, logt=0,.... (1.26b) 
Here k is the so-called characteristic and the sequence of figures behind the decimal point of log ĉ is 
called the mantissa. The mantissa can be found in logarithm tables. 


W | 324 = 2.5105, the characteristic is 2, the mantissa is 5105. Multiplying or dividing this number 
by 10", for example 324000; 3240; 3.24; 0.0324, their logarithms have the same mantissa, here 5105, 
but different characteristics. That is why the mantissas are given in logarithm tables. In order to get 
the mantissa of a number zx first the decimal point has to be moved to the right or to the left to get a 
number between 1 and 10, and the characteristic of the antilog x is determined by how many digits k 
the decimal point was moved. 
5. Slide rule Beside the logarithm, the slide rule was of important practical help in numerical calcu- 
lations. The slide rule works by the principle of the form (1.19a), so multiplying and dividing is done 
by adding and subtracting numbers. On the slide rule the scale-segments are denoted according to the 
logarithm values, so multiplication and division can be performed as addition or subtraction (see Scale 
and Graph Papers 2.17.1, p. 115). 


1.1.5 Algebraic Expressions 
1.1.5.1 Definitions 
1. Algebraic Expression 


One or more algebraic quantities, such as numbers or symbols, are called an algebraic expression or 
term if they are connected by the symbols, +, —,-,:, ./, etc., as well as by different types of braces 
for fixing the order of operations. 
2. Identity 
is an equality relation between two algebraic expressions if for arbitrary values of the symbols in them 
the equality holds. 
3. Equation 
is an equality relation between two algebraic expressions if the equality holds only for a few values of 
the symbols. For instance an equality relation 

F(x) = f(x) (1.27) 
between two functions with the same independent variable is considered as an equation with one variable 
if it holds only for certain values of the variable. If the equality is valid for every value of x, it is called 
an identity, or one says the equality holds identically, written as formula F(x) = f(x). 
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4. Identical Transformations 

are performed in order to change an algebraic expression into another one if the two expression are 
identically equal. The goal is to have another form, e.g., to get a shorter form or a more convenient 
form for further calculations. Often it is of interest to have the expression in a form which is especially 
good for solving an equation, or taking the logarithm, or calculating the derivative or integral of it, etc. 


1.1.5.2 Algebraic Expressions in Detail 


1. Principal Quantities 

Principal quantities are those general numbers (literal symbols) occurring in algebraic expressions, ac- 
cording to which the expressions are classified. They must be fixed in any single case. In the case of 
functions, the independent variables are the principal quantities. The other quantities not given by 
numbers are the parameters of the expression. In some expressions the parameters are called coeffi- 
cients. 

W So-called coefficients occur e.g. in the cases of polynomials, Fourier series, and linear differential 
equations, etc. 

An expression belongs to a certain class depending on which kind of operations are performed on the 
principal quantities. Usually, the last letters of the alphabet x,y,z, u, v,... are used to denote the 
principal quantities and the first letters a,b, c,... are used for parameters. The letters m, n, p, ... are 
usually used for positive integer parameter values, e.g. for indices in summations or in iterations. 

2. Integral Rational Expressions 

are expressions which contain only addition, subtraction, and multiplication of the principal quantities, 
including powers of them with non-negative integer exponents. 

3. Rational Expressions 

contain also division by principal quantities, i.e., division by integral rational expressions, so principal 
quantities can have negative integers in the exponent. 

4. Irrational Expressions 

contain roots, i.e., non-integer rational powers of integral rational or rational expressions with respect 
to their principal quantities, of course. 

5. Transcendental Expressions 

contain exponential, logarithmic or trigonometric expressions of the principal quantities, i.e., there 
can be irrational numbers in the exponent of an expression of principal quantities, or an expression 
of principal quantities can be in the exponent, or in the argument of a trigonometric or logarithmic 
expression. 


1.1.6 Integral Rational Expressions 
1.1.6.1 Representation in Polynomial Form 


Every integral rational expression can be changed into polynomial form by elementary transformations, 
as in addition, subtraction, and multiplication of monomials and polynomials. 
E (—a? + 2a?x — 2?) (4a? + 8ax) + (αὖτ) + 2α2α7 + 4a?a? — daa*) 
= —4a? + 8a*x — 4a?x? — Batx + 16a?2? — 8ax* H 2a?3? — 4ax* — a? — 4a?a? + A4ax* 
5a? + 13a?z? — 2a?z? — Baxt. 


is 


1.1.6.2 Factoring Polynomials 

Polynomials often can be decomposed into a product of monomials and polynomials. To do so, factoring 
out, grouping, special formulas and special properties of equations can be used. 

Bl A: Factoring out: 8ax?y — 6bx?y? + Aca? = 2a?(4ay — 3bry? + 2613). 

E B: Grouping: 62? + xy — y? — 10xz — 5yz = 62? + 3xy — 2zy — y? — 1012 — 5yz = 3x(2x + y) — 
y(2x + y) — 5z(2x + y) = (2x + υ)(8α — y — 52). 
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E C: Using the properties of equations (see also 1.6.3.1, p. 43): P(x) = αὖ — 2x? + 41} + 22? — δα”. 


a) Factoring out x°. b) Realizing that a; = 1 and az = —1 are the roots of the equation P(x) = 0 and 
dividing P(x) by z?(—1)(z--1) = z!— x? gives the quotient 2? — 2: 4-5. This expression can no longer 
be decomposed into real factors because p = —2, q = 5, p?/4 — q < 0, so finally the decomposition is 


x9 — Qa? + dat + 213 — 5x? = z?(x — 1)(α + 1)(z? — 2x + 5). 
1.1.6.3 Special Formulas 


(zx y) =a? X2zxy y^, (1.28 
(z +y +z} = r? +y? + 2? +2ry + 2rz + 2yz, (1.29 
ο ο ο = yH HH Ha 
T2xy + 252 1-::' + 2ru + 2yz +: +2yu +: + 2tu, (1.30 
(z +y)? = a? + 327y + 3ry? + y’. (1.31 


The calculation of the expression (x + y)” is done by the help of the binomial formula (see (1.36a)— 
(1.37a)). 


(e+ y)(e@—y) = a? - 9, (1.32 
p^ — y" n—1 n—2 n—2 n—1 
ce πα γι τα Hy” (for integer n, and n > 1), (1.33 
r—y 
r+ y” . : 
: Z = gt gh Py tees ay”? ty (for odd n, and n > 1), (1.34 
uty 
z” — y” n-1 n—2 n-2 n-1l 
x a" Ty fee ary y (for even n, and n > 1). (1.35 
uty 
1.1.6.4 Binomial Theorem 
1. Power of an Algebraic Sum of Two Summands (First Binomial Formula) 
The formula 
(a | Du =aq"4 na?-!b L n(n — 1) n-2pp f n(n — Μία - 2) n3 
0 7 2 i 3! 
poss σος a Smt D) grm He nab" | +b” (1.36a) 
m! 
is called the binomial theorem, where a and b are real or complex values and n = 1,2,.... Using the 


binomial coefficients delivers a shorter and more convenient notation: 


(a+b)? = (o) a” + J a" b+ (2 a"? + (5 a" ess (, " jJ ab"! + (") b" (1.36b) 


(a - b)" =Y (7) a" *bF, (1.36c) 


k=0 : 


Or 


2. Power of an Algebraic Difference (Second Binomial Formula) 


(n—1 21: — 1)n— 2 
(a b)" - a? na^ ib | mn ) n-2 n(n ην ) 


Ε ές ( jani 1) ed (n -m+ 1) pp ped ( 1)"b" (1.37a) 


απ 3p3 


m! 
Or 
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(a — b^ = 5 (5 (-1y5an-k)*. (1.37b) 


k=0 


3. Binomial Coefficients 
The definition is for non-negative and integer n and k: 


H Lo o Πεμ (1.382) 

k (n — k)!k! x A 

where n! is the product of the positive integers from 1 to n, and it is called n factorial: 
n!=1-2-3-...-n, and by definition 0! = 1. (1.38b) 


The binomial coefficients can easily be seen from the Pascal triangle in Table 1.2. The first and the 
last number is equal to one in every row; every other coefficient is the sum of the numbers standing on 
left and on right in the row above it. 


Simple calculations verify the following formulas: 


n n n! n n n . 
πω ue asm (us (len (len am 
n+1i\_ (n n—1 n—2 

μη ο pe 

n+1 n+1 n . n n—kín 

( k )- cra) (1:394) e -u) si 
( 1 


Table 1.2 Pascal’s triangle 


n Coefficients 

0 1 

1 1 1 

2 1 2 1 

3 1 3 3 1 

4 1 4 6 4 1 

5 1 5 10 10 5 1 

6 1 6 15 20 15 6 1 

T t t T T T T 

6 6 6 6 6 6 6 6 

0 0 0 0 @ & @ 


For an arbitrary real value a (a € IR) and a non-negative integer k one can define the binomial coeffi- 


jon Q 
ceni $ 
k 


() . a(a — 1)(a — 2): (a — k 1) 
k! 


for integer k and k > 1, (2) =, (1.40) 
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3! 16° 
4. Properties of the Binomial Coefficients 
e The binomial coefficients increase until the middle of the binomial formula (1.36b), then decrease. 
e The binomial coefficients are equal for the terms standing in symmetric positions with respect to 
the start and the end of the expression. 
e The sum of the binomial coefficients in the binomial formula of degree n is equal to 2” . 
e The sum of the coefficients at the odd positions is equal to the sum of the coefficients at the even 
positions. 


A0 
ο» 

Nie 

μιλώ 
| 

bho | 

— 

wie 

= 

= 

pz 

I 

two 

> 
| 

σι 


5. Binomial Series 


The formula (1.36a) of the binomial theorem can also be extended for negative and fraction exponents. 
If |b| < a, then (a + b)" has a convergent infinite series (sce also 21.5, p. 1057): 


(n—1 2,2 , n(n — 1)(n — 2 at 
(a j b)" = a? na^ b a ) n3 n(n am ) -3g befits (1.41) 


1.1.6.5 Determination of the Greatest Common Divisor of Two 
Polynomials 


It is possible that two polynomials P(x) of degree n and Q(x) of degree m with n > m have a common 

polynomial factor, which contains 2. The least common multiple of these factors is the greatest common 
divisor of the polynomials. 

B P(x) = (x -- 1)}(α — 2)(x — 4), Q(x) = (x — 1)(x — 2)(x — 3); the greates common devisor is 
(x — 1)(x — 2). 

If P(x) and Q(x) do not have any common polynomial factor, they are called relatively prime or coprime. 
In this case, their greatest common divisor is a constant. 


The greatest common divisor of two polynomials P(x) and Q(x) can be determined by the Euclidean 
algorithm without decomposing them into factors: 
1. Division of P(x) by Q(x) = Ro(x) results in the quotient Tı (x) and the remainder Rı (x): 


P(x) = Q(x)Ti(x) + Ry (x). (1.42a) 
2. Division of Q(x) by R(x) results in the quotient 15 (11) and the remainder f(x): 
Q(x) = Ri(x)To(x) + Ra(x). (1.42b) 


3. Division of R(x) by Rə(x) results in T3(r) and R(x), etc. The greatest common divisor of the 
two polynomials is the last non-zero remainder Γκ). This method is known from the arithmetic of 
natural numbers (see 1.1.1.4, p. 3). 


The determination of the greatest common divisor can be used, e. g., when equations musz be solved 
to separate the roots with higher multiplicity or to apply the Sturm method (see 1.6.3.2, 2., p. 44). 


1.1.7 Rational Expressions 


1.1.7.1 Reducing to the Simplest Form 
Every rational expression can be written in the form of a quotient of two coprime polynomials. To do 
this, only elementary transformations are necessary such as addition, subtraction, multiplication and 
division of polynomials and fractions and simplification of fractions. 

22 +y 
We lom μαμὰ 


; 1, 9^7 
x G + =) ‘ 
22 


W Find the most simple form of 
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1.1.7.2 Determination of the Integral Rational Part 


A quotient of two polynomials with the same variable x is a proper fractionif the degree of the numerator 
is less than the degree of the denominator. In the opposite case, it is called an improper fraction. Every 
improper fraction can be decomposed into a sum of a proper fraction and a polynomial by dividing the 
numerator by the denominator, i.e., separating the integral rational part. 
3x4 — 10aa? + 22a?a? — 24a?x + 10a4 

x? — 2ax + 3a? : 


W Determine the integral rational part of R(x) 


. m i ; P . . =) 3r = 4 
(31—10a2?--22a?3? —24a?x --10a*) : (x? — 2ax + 3a?) = 3x? — 4ax + 5a? + x mm 
3r1— 6ax?4- 9a2x2 α΄ — 2α2 — 3a 
— 4aa?--13a?a? —24a? x 
— 4ax?-- 8a?3?—12a?a 
5a?1?—12a?x +10a* 


—2a?x — δα. 
a? — 2ax + 3a?’ 
The integral rational part of a rational function R(x) is considered to be as an asymptotic approximation 
for R(x) because for large values of |x|, the value of the proper fraction part tends to zero, and R(x) 
behaves as its polynomial part. 


— 9ghg— ὅρα. R(x) = 3x? — 4ax + 5a? 4 


1.1.7.3 Partial Fraction Decomposition 
Every proper rational fraction 

P(x) απο" + Gnu" + +++ + αια + αρ 
Q(x) ba" + by, x71 $+ + bya + bo 


with coprime polynomials in the numerator and denominator can be uniquely decomposed into a sum of 


R(x) = (n <m) (1.43) 


partial fractions. The coefficients αρ, a1,..., an, bo, 01, . .. , bn are real or complex numbers. The partial 
fractions have the form 
A Di+E 2 
a (L44a) and ——“t*" _ where (5) get (1.44b) 
(x — a)* (x? + px + q)" 2 


In the followings real coefficients are assumed in R(x) in (1.43). 

First the leading coefficient b,,, of the denominator Q(x) is transformed into 1 by dividing the numerator 
and the denominator of (1.43) by the original value of bm. In the case of real coefficients the following 
three cases are to be distinguished. 

In the case of complex coefficients in R(x) only the first two cases can occur, since complex polynomials 
can be factorized into a product of first degree polynomials. Every proper rational fraction R(x) can 
be expanded into a sum of fractions of the form (1.44a), where A and α are complex numbers. 


1. Partial Fraction Decomposition, Case 1 
The denominator Q(x) has m different simple roots αι... ,Qm. Then the expansion has the form 
P(x αμα deca A A A, 
( ) m n * 0 1 2 ο ate m (1.452) 


Q(z) (x—o)(r-—os...(r—-o&) z-a T-a f— Am 
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with coefficients 


P(ay) P(ag) P(am) 
A , Ag = ———— ; 1.451 
= Oa)’ ^7 σα) Qa, n 
. _,. dQ 
where in the numerators of (1.45b) the values of the derivative are taken for £ = o4, © = a»,.... 
1 
6z7-x+1 A B σ 
ποπ πμ ee ee iE 
; , P(0) P(1) P(-1) 
P(x) = 6x? — x +1, Q(x) 2-332 —1, A LB 3and C — — 4, 
md à 10) aw) Q1) 
P(x) 1, 3 , 4 
Q(x) T r—1 «+1 
An other possibility to determine the coefficients Αι, A5,... , Am is the method of comparing coeffi- 
cients (see 4., p. 17). 
2. Partial Fraction Decomposition, Case 2 
The denominator Q(x) has l multiple real roots a), a2,...,a, with multiplicities k;, ko, ..., kj respec- 
tively. Then the decomposition has the form 
P(x) αμα" + a, 42771 +--+ +a A, Ap TN Ak, 
Q(r) (x-a) (z-a)... (x-a) r-a (r-a) | (y — a)" 
Bı By By, Ly, 
—— des Eee : 1.46 
TS — 05 a (x — a3)? is (x — ay)" T (x — oj) ( ) 
21-51 Ay B Bo Bs 


σ(α — 1)? z md | (s — y i m- The coefficients A1, B1, B», B3 can be determined 


by the method of comparing coefficients. 
3. Partial Fraction Decomposition, Case 3 
If the denominator Q(x) has also complex roots, then its factorization is 


Q(x) = (r— o4)^ (x — αὐ) ΠΕ. oj)" 
(2? + pix + n)" (2? + 2px + a3)? -+ (2? + pe + qr)™ (1.47) 


according to (1.168), p. 44. Here 04,05, ..., αι are the | real roots of polynomial Q(x). Beside these 
roots Q(x) has r complex conjugate pairs of roots, which are the roots of the quadratic factors z? — 


AA 
pit + qi (1 —1,2,....r). The numbers p;, q; are real, and (5) — q; « 0 holds. In this case the partial 


fraction decomposition has the form 


P(x) | αμα αμ 127} +--+ ax ++ αρ 
Q(x) (a — a)! (a — az)? +++ (x? + pix + qi)" (a? + pox + qo) ++ 
= Αι NS A» | TX ἥ Ax, +4 Bi “5 By SE ἥ Biz | oe 
z-a (x—o (z-a) z-a (α-αχ)} (x — ag) 
Cız + D , Cox + Do = Cui £d Da 
urprc-qoG-prc-qy- (a? + pir qu) | 
! E+ F, TM Fox + Fy US Emt Fm; TN (1.48) 
T? + paz τα (a? + pox + qa? (a? + pav + qo)" 
5a? — 4r + 16 A Ciz + D Cox + D 
E = = = + EET t — z- The coefficients A, C1, D1, C2, D» are 


(r—3)(z3?— 1? r—-3 2-241 (α-αΕ1) 
to be determined by the method of comparing coefficients. 
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4. Method of Comparing Coefficients 

In order to determine the coefficients ΑἹ, A», . . . , Κι, F1... in (1.48) the expression (1.48) has to be mul- 
tiplied by Q(x), then the result Ζ(α) is compared with P(x), since Z(r) = P(x). After ordering Z(x) 
by the powers of x, one gets a system of equations by comparing the coefficients of the corresponding 
x—powers in Z(r) and P(x). This method is called the method of comparing coefficients or method of 


undetermined coefficients. 
ας etl A B σ A(x? -Ὁ + Βα(α +1) τοπία 1) = 


a3—3 r α-1 «+1 x(x? -- 1) 
Comparing the coefficients of the same powers of x, one gets the system of equations 6 = A+B+C, 
—1 = B — C , 1 = —A, and its solutions are A = —1, B = 3,C = 4. 


1.1.7.4 Transformations of Proportions 
The equality 
c= 5 (1.49a) yields ad = be, 4 E 4 5 LM (1.49b) 
G 


b d j c d b a a 


and furthermore 


actb ctd atb ctd ate b+d a+b c+d 


= = , = ; = : 1.49c 
b d. σα ο Ἰ c d α-ὐὂ c-d το 
From the equalities of the proportions 
Q1 Q2 An, f αι 1-49 1-’'' n 
Sata. a 1.50a vs the DU 
nc n (1.50a) it follows that cheek, b (1.50b) 


1.1.8 Irrational Expressions 

Every irrational expression can be written in a simpler form by 1. simplifying the exponent, 2. taking 
out terms from the radical sign and 3. moving the irrationality into the numerator. 

1. Simplifying the Exponent The exponent can be simplified if the radicand can be factorized 
and the index of the radical and the exponents in the radicand have a common factor; the index of the 
radical and the exponents must be divided by their greatest common divisor. 


L| 16x? -- 2xH + z") = ye -αδ(ᾳ — 1)? = 408 (a edi 


2. Moving the Irrationality There are different ways to move the irrationality into the numerator. 


HA: ui - Su HB. gr 
|z “VG 4yz? 


C: A EE ui Vu 
L- 3 Ν 
ng Ec uy 
D: 1 = i? — zy Vy LU -cyy VP 
CoU (2+ v9) (2 - eg + Vv) w+ y 
3. Simplest Forms of Powers and Radicals Also powers and radicals can be transformed into 
the simplest form. 


ΒΛ: 81α6 9x3 3r /v 3x /z(y/2-- az) — 3x 2x + 32? 
ένο yap (V2— yr} νὸ- να 2-r BEI 
B: (Vz + YT + a3 + Va) (ντ t+ Ve Vas) (x 1/24 72/34 3/44 ο (a1/2— 1/34 


ples = qa ο 4 18/12 75/6 — g 18/12 411/12 | 43/4 | χ11/13 gp 4,5/6. 411/12. 18/12 
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7/6 -r= q5/4 - q19/12 — χ1/12 4 73/4 = Vx - ZSE = Wall dje Yr? = a (1 11/6 11/9 | gi?) — 
Va3 (1— Ya — YT + y). 

1.2 Finite Series 

1.2.1 Definition of a Finite Series 


'The sum 


n 
Sn = ao + a1 +g +++ + an = Mai (1.51) 
i=0 
is called a finite series. The summands a; (i = 0,1,2,...,n) are given by certain formulas, they are 


numbers, and they are the terms of the series. 


1.2.2 Arithmetic Series 
1. Arithmetic Series of First Order 


is a finite series where the terms form an arithmetic sequence, i.e., the difference of two terms standing 
after each other is a constant: 


Aa; = aj4; — a; = d = const holds, so a; = ao id. (1.52a) 
Thus holds: 

Sn = ao + (ao + d) + (ao + 2d) +--+ + (ao + nd) (1.52b) 

Ἶ » 1 
Sn = ara (n+1)= Lis (2ag + nd). (1.52c) 
2 2 

2. Arithmetic Series of k-th Order 
is a finite series, where the k-th differences A^a; of the sequence αρ, a1, a2, ...,a,, are constants. The 
differences of higher order are calculated by the formula 

A"aj = A" lai, — Δ la; (v = 2,3,...,k). (1.53a) 


It is convenient to calculate them from the difference schema (also difference table or triangle schema): 


ao 

Aao 
a A?ag 

Aa, Ae ao 
az A?a, EN Άλαρ 

Aa» Aa, 
a3 A?a5 “Aa, 07. (1.53b) 

A" ag 
k 
A Gn—k 
3 : 
A 5-3 
Aa, 5 

Aa, a 

απ 


The following formulas hold for the terms and the sum: 


ai = ao + B Aag + (2) Aao + + (;) Af ao (521,2, 4m) (1.53c) 


dl ( t ') ao + (" T ') Aag + (" i Ἴ Mages (; Mi ) Akag. (1.53d) 


: 2 
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1.2.3 Geometric Series 


The sum (1.51) is called a geometric series, if the terms form a geometric sequence, i.e., the ratio of two 


successive terms is a constant: 
Qi+1 


=q = const holds, so a; = αρα. (1.54a 
αι 
Thus holds: 
F n+l . 1 
Sn = ao + agq + ao^ + -< + aod" = αρ af for q#1l, (1.54b 
Sn = (n+ 1)aọ for q=1. (1.54c 
For n — oo (see 7.2.1.1, 2., p. 459), there is an infinite geometric series, which has a limit if |g| « 1, 
and this limit is called sum s: 
ΜΉΝ (1.514 
=q 
1.2.4 Special Finite Series 
1 
14+24+3+---+(n—-1)+n mn ) (1.55 
n+1)(2p+ 
(epi ο nee = n n) (1.56 
14+345+---+(2n—3)+(2n-1) 2 m? , (1.57 
2-4 6... - (2n - 2) - 2n 2 n(n - 1), (1.58 
P : , +1)(2n+1 
Prag F(n—1? +n ak , E Ἢ (1.59 
M πμ n? (n +1)? 
13.4.23 4.33 + F(n— 1? +n? B me ) ; (1.60 
, F 4n? —1 
1? 32 5? 4... (2n - 1? B E ) (1.61 
1? +33 +5 +---+(2n 2 1? = n?(2n? — 1), (1.62 
44244344 ee n(n + 1)(2n + 1)(3n? + 3n 1). (1.63 
30 
1- (η Du ua 
14 2x + 32° 4 Hnr” (n ü M = (x z 1). (1.64 
1.2.5 Mean Values 
(See also 16.3.4.1, 1., p. 839 and 16.4, p. 848) 
1.2.5.1 Arithmetic Mean or Arithmetic Average 
The arithmetic mean of the n quantities a1, a2, ..., a, is the expression 
αι Ἔαο cra 12 
ΦΑ - 1 3 ^ E 5 Ak . (1.65a) 
k=1 
For two values a and b holds: 
b 
m M (1.65b) 


The values a, x4 and b form an arithmetic sequence. 
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1.2.5.2 Geometric Mean or Geometric Average 
The geometric mean of n positive quantities a1, d2,..., a, is the expression 


E] 
tq = a102... ân = (ü «) , (1.664) 
k=1 


=] For two positive values a and b holds 
za = Vab. (1.66b) 


The values a, zc; and b form a geometric sequence. 
If a and b are given line segments, then a segment 
with length zc = Vab can be given by the help of 
one of the constructions shown in Fig. 1.3a or in 
Fig. 1.3b. 

A special case of the geometric mean is given by di- 
viding a line segment according to the golden sec- 
tion (see 3.5.2.3, 3., p. 194). 


Figure 1.3 


1.2.5.3 Harmonic Mean 


The harmonic mean of n quantities a1, 02,..., an (a; Æ 0;i = 1,2,...,n) is the expression 
1,1 1 jd [a4 21> 
n= [i ΠΣ | $^ | (1.67a) 
na, αν αμ n E Ük 
For two values a and b holds 
σα. Xx 2ab 
ty = |= (-+— ο ty= 5 1.671 
oe E (; Ἰ dix a+b ( ) 
1.2.5.4 Quadratic Mean 
The quadratic mean of n quantities a1, a5,. .. , an is the expression 
1 7 5 . 126 3 
xro =  —(ai? + ag? ra) = .|— Σ) αἴ. (1.68a) 
n T 
For two values a and b holds 
a? +b? 
To = 2 (1.68b) 


The quadratic mean is important in the theory of observational error (see 16.4, p. 848). 


1.2.5.5 Relations Between the Means of Two Positive Values 


a+b 2ab a? +b? 
For #4 = z’ πα Vab, TH appo TQ 2 we have 


1. ifa < b, then 

a< ty «vg «mcT4«mc:Q«b, (1.094) 
2. ifa = b, then 

a= gra = tg = ty =t =b. (1.69b) 
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1.3 Business Mathematics 

Business calculations are based on the use of arithmetic and geometric series, on formulas (1.52a)— 
(1.52c) and (1.54a)-(1.54d). However these applications in banking are so varied and special that a 
special discipline has developed using specific terminology. So business arithmetic is not confined only 
to the calculation of the principal by compound interest or the calculation of annuities. It also includes 
the calculation of interest, repayments, amortization, calculation of instalment payments, annuities, 
depreciation, effective interest yield and the yield on investment. Basic concepts and formulas for cal- 
culations are discussed below. For studying financial mathematics in detail, you will have to consult 
the relevant literature on the subject (see [1.2], [1.8]). 


Insurance mathematics and risk theory use the methods of probability theory and mathematical statis- 
tics, and they represent a separate discipline, so they don't be discussed here (see [1.4], [1.5]). 


1.3.1 Calculation of Interest or Percentage 
1.3.1.1 Percentage or Interest 


The expression p percent of K means πώ where K denotes the principal in business mathematics. 
The symbol for percent is 96, i.e., the following equalities hold: 


p% = T or 1% = 0.01. (1.70) 


1.3.1.2 Increment 
If K is raised by p%, the increased value is 


t k(14 4. 
K=K (1+ E) (1.71) 


Relating the increment Kx to the new value K, the proportion is Kx :K= p : 100, so K contains 


p- 100 
100 + p 
percent of increment. 


W Ifan article has a value of € 200 and a 15% extra charge is added, the final value is € 230. This price 
15 - 100 


115 
1.3.1.3 Discount or Reduction 


Reducing the value K by p% rebate yields the reduced value 


p= (1.72) 


contains p = = 13.04 percent increment for the user. 


K=K (- i) (1.73) 


Comparing the reduction KW to the new value K gives 


pa 5. (1.74) 


percent of rebate. 
W If an article has a value € 300, and they give a 10% discount, it will be sold for € 270. This price 


10 100 
contains p — ερ 11.11 percent rebate for the buyer. 
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1.3.2 Calculation of Compound Interest 
1.3.2.1 Interest 


Interest is either payment for the use of a loan or it is a revenue realized from a receivable. For a principal 
K, placed for a whole period of interest (usually one year) 


-> P 
K 1.75 
100 (1.75) 
interest is paid at the end of the period of interest. Here pis the rate of interest for the period of interest, 
and one says that p% interest is paid for the principal K. 


1.3.2.2 Compound Interest 

Compound interest is computed on the principal and on any interest earned that has not been paid or 
withdrawn. It is the return on the principal for two or more time periods. The interest of the principal 
increased by interest is called compound interest. 

In the following different cases are discussed which depend on how the principal is changing. 

1. Single Deposit 

Compounded annually the principal K increases after n years up to the final value K,. At the end of 
the n-th year this value is: 


p n 
K, κο) 1.76 
n 100 ( ) 
For a briefer notation the substitution 1 + πη = q is used and q is called the accumulation factor or 


growth factor. 
Interest may be compounded for any period of time: annually, half-annually, monthly, daily, and so on. 
Dividing the year into m equal interest periods the interest will be added to the principal K at the end 


of every period. Then the interest is K 100m for one interest period, and the principal increases after 
m 


n years with m interest periods up to the value 


` - p mn 
Kea ak 1 : 1. 
5 ( τ τση) (1-77) 


The quantity (1 + is) is known as the nominal rate, and (1 + us) as the effective rate. 
100 100m. 

ΒΛ principal of € 5000, with a nominal interest 7.2% annually, increases within 6 years a) com- 

pounded annually to Kg = 5000(1-4-0.072)9 = € 7588.20, b) compounded monthly to Κυ = 5000(1 + 

0.072/12)" = € 7691.74. 


2. Regular Deposits 
Suppose depositing the same amount F in equal intervals. Such an interval must be equal to an interest 


period. The depositions can be made at the beginning of the interval, or at the end of the interval. At 
the end of the n-th interest period the balance K, is 


a) Depositing at the Beginning: b) Depositing at the End: 
n..] n] 
Ky, = Egl—. (1.78a) ilg (1.78b) 
q-1 g-1 


3. Depositing in the Course of the Year 

A year or an interest period is divided into m equal parts. At the beginning or at the end of each of 
these time periods the same amount E is deposited and bears interest until the end of the year. In this 
way, after one year the balance K; is 
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a) Depositing at the Beginning: b) Depositing at the End: 
m+1) m — 1) 
ΚΒ |n + d l (1.79a) K =F In 4 Dd . (1.79b) 


In the second year the total K, bears interest, and further deposits and interests are added like in the 
first year, so after n years the balance K, for midterm deposits and yearly interest payment is: 


a) Depositing at the Beginning: b) Depositing at the End: 
: (m+1)p| q^—1 ; (m—1)p| q^ —1 
Κα = E|md4 . (1.802 K,=E + : 1.80] 
x E 200 Jci (1.80a) K m 200 "me (1.80b) 


Β At a yearly rate of interest p = 5.2% a depositor deposits € 1000 at the end of every month. After 
how many years will it reach the balance € 500 000? 


From (1.80b), for instance, from 500 000 — 1000 [5 + 
22.42 years. 


11: =] 1.052" — 1 
200 0.052 


, follows the answer, n = 


1.3.3 Amortization Calculus 


1.3.3.1 Amortization 


Amortization is the repayment of credits. The assumptions: 

1. For a debt S the debtor is charged at p% interest at the end of an interest period. 

2. After N interest period the debt is completely repaid. 

The charge of the debtor consists of interest and principal repayment for every interest period. If the 
interest period is one year, the amount to be paid during the whole year is called an annuity. 

There are different possibilities for a debtor. For instance, the repayments can be made at the interest 
date, or meanwhile; the amount of repayment can be different time by time, or it can be constant during 
the whole term. 


1.3.3.2 Equal Principal Repayments 


The amortization instalments are paid during the year, but no midterm compound interest is calcu- 
lated. The following notation should be used: 
e S debt (interest payment at the end of a period with p%), 


5 -- 

e T = —— principal repayment (T = const), 
ml 

e m number of repayments during one interest period, 

e N number of interest periods until the debt is fully repaid. 

Besides the principal repayments the debtor also has to pay the interest charges: 

a) Interest Z,, for the n-th Interest Period: b) Total Interest Z to be Paid for a Debt 

S, mN Times, During N Interest Periods 

with an Interest Rate p% : 

Zn 


ps | 1 ( m+ 3! 

= 1 zn . (1.81a) N pS IN—1 1 

100 N 2m Fy Z a=? | m+ iki 
2 ” 100 2 2m |. (181b) 


n=1 


E A debt of € 60000 has a yearly interest rate of 8%. The prin- l. year: Zı = €4360 
cipal repayment of € 1000 for 60 months should be paid at the 2. year: Zə= € 3400 
end of the months. How much is the actual interest at the end of 3. year: Z3 = € 2440 
each year? The interest for every year is calculated by (1.814) with 4. year: Ζι-- € 1480 
S = 60000, p = 8, N = 5 and m = 12. They are enumerated in 5. year: Zs € 520 
the annexed table. Z= €12200 
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8- 60000 [5-1 13 
100 È | | € 12200. 


The total interest can be calculated also by (1.519) as Z = 24 


1.3.3.3 Equal Annuities 


For equal principal repayments T = 


- the interest payable decreases over the course of time (see 


the previous example). In contrast to this, in the case of equal annuities the same amount is repaid 
for every interest period. A constant annuity A containing the principal repayment and the interest is 
repaid, i.e., the charge of the debtor is constant during the whole period of repayment. 

With the notation 

e S debt (interest payment of p% at the end of a period), 

ο A annuity for every interest period (A const), 

ο a one instalment paid m times per interest period (a const), 


ος - 1 -- in the accumulation factor, 


after n interest periods the remaining outstanding debt S, is: 


(1.82) 


— 1 n. 1 
S,=Sqr-—a E (m * 4 


200 q-1- 
Here the term Sq" denotes the value of the debt S after n interest periods with compound interest 


(see (1.76)). The second term in (1.82) gives the value of the midterm repayments a with compound 
interest (see (1.80b) with E — a). For the annuity holds 


(m — 1)p 
200 i 


A=a |n t (1.83) 


Here paying A once means the same as paying a m times. From (1.83) it follows that A > ma. Because 


after N interest periods the debt must be completely repaid, from (1.82) for Sy = 0 considering (1.83) 
for the annuity holds: 


wank q-1 


A-S ; 
ΕΤ ΤΝ 


(1.84) 


To solve a problem of business mathematics, from (1.84), any of the quantities A,S,q or N can be 
expressed, if the others are known. 

W A: A loan of € 60000 bears 8% interest per year, and is to be repaid over 5 years in equal instalments. 
How much is the yearly annuity A and the monthly instalment a? From (1.84) and (1.83) we get: 


; 15027. 
A= 60 ooo 28 — = € 15027.39, a = RI ki = € 1207.99. 
1 


~ 1.085 200 
WB B: A loan of S = € 100000 is to be repaid during N = 8 years in equal annuities with an interest rate 
of 7.5%. At the end of every year € 5000 extra repayment must be made. How much will the monthly 


instalment be? For the annuity A per year according to (1.84) follows A = 100000 -- 


11.7.5 
200 


at the end of the year, from (1.83) A — a [12 H 
charge is a = € 972.62. 


| + 5000 = 17072.70 holds, so the monthly 
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1.3.4 Annuity Calculations 
1.3.4.1 Annuities 


If a series of payments is made regularly at the same time intervals, in equal or varying amounts, at the 
beginning or at the end of the interval, it is called annuity payments. Το distinguish are: 

a) Payments on an Account The periodic payments, called rents, are paid on an account and bear 
compound interest. Therefore the formulas of 1.3.2 are to be used. 

b) Receipt of Payments The payments of rent are made from capital bearing compound interest. 
Here the formulas of the annuity calculations in 1.3.3 are to be used, where the annuities are called 
rents. If no more than the actual interest is paid as a rent, it is called a perpetual annuity. 

Rent payments (deposits and payoffs) can be made at the interest terms, or at shorter intervals during 
the period of interest, i.e. in the course of the year. 


1.3.4.2 Future Amount of an Ordinary Annuity 
The date of the interest calculations and the payments should coincide. The interest is calculated at 
p% compound interest, and the payments (rents) on the account are always the same, R. The future 
value of the ordinary annuity Rn, i.e., the amount to which the regular deposits increase after n periods 
amounts to: 
qr 1 . p 
RR with q=1+-—_. 1.85 

cH η q tii (1.85) 
The present value of an ordinary annuity Ro is the amount which should be paid at the beginning of the 
first interest period (one time) to reach the final value R,, with compound interest during n periods: 


Rn ; p 
Roa th q=1+—. 
σφ Ws ΚΡ 


W A man claims € 5000 at the end of every year for 10 years from a firm. Before the first payment 
the firm declares bankruptcy. Only the present value of the ordinary annuity Rp can be asked from the 
administration of the bankrupt's estate. With an interest of 4% per year the man gets: 
1,g4^—-1 1-47 1— 1.047" 

L R1 = 8000 — € 40554.48. 
œ q-—1 q-1 0.04 


(1.86) 


Ro= 


1.3.4.3 Balance after n Annuity Payments 
For ordinary annuity payments capital K is at our disposal bearing p% interest. After every interest 
period an amount r is paid. The balance K, after n interest periods, i.e., after n rent payments, is: 


n..] 4 
Ky = K- Ry = Kq^ d = with q=1+ a , (1.87a) 
Conclusions from (1.87a): 
c= Ke (1.87b) Consequently K, = K holds, so the capital does not change. This 
is the case of perpetual annuity. 
r> KG (1.87c) The capital will be completely used up after N rent payments. 
From (1.872) it follows for Ky = 0: 
N 
" rq'—1 
K=— , 1.87d 
κ gi (1.87d) 


If midterm interest is calculated and midterm rents are paid, and the original interest period is divided 
into m equal intervals, then in the formulas (1.85)—(1.87a) n is replaced by mn and accordingly q = 


p p 
14 Γ zd . 
" 100 vs = 100m 
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W What amount must be deposited monthly at the end of the month for 20 years, from which a rent of 
€ 2000 should be paid monthly for 20 years, and the interest period is one month with an interest rate 
of 0.5%. 
From (1.87d) follows for n = 20-12 = 240 the sum K which is necessary for the required payments: 
K LLL UNE iie € 279161.54. TI thly d its R i by (1.85) 
C= E .54. The necessary monthly deposits R are given by (1.85): 
1.0052 0.005 ; y VERB ου S 


1.00540 —1 . 
Rao = 279161.54 = R ooog e R=€ 604.19. 


1.3.5 Depreciation 
1.3.5.1 Methods of Depreciation 


Depreciation is the term most often used to indicate that assets have declined in service potential in 
a given year either due to obsolescence or physical factors. Depreciation is a method whereby the 
original (cost) value at the beginning of the reporting year is reduced to the residual value at year-end. 
The following concepts are used: 

e A depreciation base, 

e N useful life (given in years), 

e It, residual value after n years (n < N), 

ea, (n — 1,2,..., N) depreciation rate in the n-th year. 


The methods of depreciation differ from each other depending on the amortization rate: 

e straight-line method, i.e., equal yearly rates, 

e decreasing-charge method, 1.6., decreasing yearly rates. 

1.3.5.2 Straight-Line Method 

The yearly depreciations are constant, i.e., for amortization rates a, and the remaining value R,, after 
n years follows: 

|. A — Ry 
MI 
Substituting Ry = 0, then the value of the given thing is reduced to zero after N years, i.e., it is totally 
depreciated. 


=a, (1.88) Ren (n =1,2,...,.N). (1.89) 


an 


W The purchase price of a machine is A = € 50000. In 5 years it should be depreciated to a value R5 = 
€ 10000. 


Year] Depreciation] Depreciation]Residual|Cumulated depr. in % ee "mr 
base expense value ofthe depr. base us depr oe 
: z T TON F TE . nexed amortization schedule: 
3 34 000 8000 26 000 23. 5 Itshowsthat the percentage of ac- 
4 26 000 8000 18 000 30.8 cumulated depreciation with re- 
5 18 000 8000 10 000 444 spect to the actual initial value is 


increasing. 
1.3.5.3 Arithmetically Declining Balance Depreciation 


In this case the depreciation is not constant. It is decreasing yearly by the same amount d, by the 
so-called multiple. For depreciation in the n-th year follows: 


à,—aj—(n—1)d (n—2,3,..., N +1; αι and d are given). (1.90) 


N 
Considering the equality A — Ry = > a, from the previous equation it follows that: 


9[Ναι — (A — Ry)] | 
N(N-1) 


d- (1.91) 


1.9 Business Mathematics 27 


For d = 0 follows the special case of straight-line depreciation. If d > 0, it follows from (1.91) that 


A — Ry 
Wee 7% (1.92) 
where a is the depreciation rate for straight-line depreciation. The first depreciation rate αι of the 
arithmetically-declining balance depreciation must satisfy the following inequality: 
A τ «αις 24 τ ; 
W A machine of € 50000 purchase price is to be depreciated to the value of € 10000 within 5 years by 
arithmetically declining deprecia- 


(1.93) 


Year Depretiation| Depreciation Residual | Depreciation in 96 tion. In the first year € 15000 
base expense value of depr. base should be depreciated. 
1 50 000 15 000 35 000 90.0 The annexed depreciation sched- 
2 35 000 11 500 23 500 32.9 ule is calculated by the given for- 
3 23 500 8 000 15 500 34.0 mulas, and it shows that with the 
4 15 500 4 500 11 000 29.0 exception of the last rate the per- 
5 11 000 1 000 10 000 9.1 centage of depreciation is fairly 
equal. 


1.3.5.4 Digital Declining Balance Depreciation 
Digital depreciation is a special case of arithmetically declining depreciation. Here it is required that 
the last depreciation rate ay should be equal to the multiple d. From ay = d it follows that 


= NEM (1.944) a, = Nd, ag = (N — 1)d, ...,ay =d. (1.94b) 
W The purchase price of a machine is € A = 50000. This machine is to be depreciated in 5 years to 
Year | Depreciation | Depreciation Residual | Depreciation in 96 the value R5 — € 10000 by 
base expense value | ofthe depr. base digital depreciation. 

1 50 000 a, = 5d = 13335] 36665 26.7 The annexed depreciation 

2 36 665 dg = 4d = 10668] 25997 29.1 schedule, calculated by the 

3 25 997 az = 3d = 8001, 17996 30.8 given formulas, shows that 

4 17996 a4 = 2d = 5334| 12662 29.6 the percentage of the de- 

5 12662 a; = d= 2667| 9995 21.1 preciation is fairly equal. 


1.3.5.5 Geometrically Declining Balance Depreciation 
Consider geometrically declining depreciation where p% of the actual value is depreciated every year. 
For the residual value R, after n years holds: 
p n 

Rn A(t a) (i= 18032 (1.95) 
Usually A (the acquisition cost) is given. The useful life of the asset is N years long. If from the quan- 
tities Ry, p and N, two is given, the third one can be calculated by the formula (1.95). 
ΒΑ: A machine with a purchase value € 50000 is to be geometrically depreciated yearly by 10%. 
After how many years will its value drop below € 10000 for the first time? Based on (1.95), yields 
In(10 000/50 000) 

In(1 — 0.1) 

W B: Fora purchase price of A = € 1000 the residual value R, should be represented for n = 1,2,..., 10 
years by a) straight-line, b) arithmetically declining, c) geometrically declining depreciation. The re- 
sults are shown in Fig. 1.4. 


N= 


= 15.27 years. 
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n 1.3.5.6 Depreciation with Different 
R, Types of Depreciation Account 

Since in the case of geometrically declining deprecia- 

tion the residual value cannot become equal to zero 

for a finite n, it is reasonable after a certain time, e.g., 

after m years, to switch over to straight-line depreci- 


arithmetically declining 
geometrically declining 


600 . : h 
ation. m is to be determined to an amount that from 
400 this time on the geometrically declining depreciation 
rate is smaller than the straight-line depreciation rate. 
200 From this requirement it follows that: 
7 m>N-— E (1.96) 
0 2 4 6 8 llOn P 


Here m is the last year of geometrically declining de- 
preciation and N is the last year of linear depreciation 
Figure 1.4 when the residual value becomes zero. 
W A machine with a purchase value of € 50000 is to be depreciated to zero within 15 years, for m 
years by geometrically declining depreciation with 14% of the residual value, then with the straight- 
100 


line method. From (1.96) follows m > 15— d4 = 7.76, i.e., after m = 8 years it is reasonable to switch 


over to straight-line depreciation. 


1.4 Inequalities 


1.4.1 PureInequalities 
1.4.1.1 Definitions 


1. Inequalities 
Inequalities are comparisons of two real algebraic expressions represented by one of the following signs: 


Typel > (“greater”) Type II x (^smaller") 

Type HI #4 (“not equal") Type Hla <> (“greater or smaller") 
TypeIV > (“greater or equal") TypeIVa # (“not smaller”) 
TypeV < (“smaller or equal") Type Va Ἢ (“not greater") 


The notation III and IIIa, IV and IVa, and V and Va have the same meaning, so they can be replaced 

by each other. The notation III can also be used for those types of quantities for which the notions of 

"greater" or "smaller" cannot be defined, for instance for complex numbers or vectors, but in this case 

it cannot be replaced by IIIa. 

2. Identical Inequalities, Inequalities of the Same and of the Opposite Sense, 
Equivalent Inequalities 

1. Identical Inequalities are valid for arbitrary values of the letters contained in them. 

2. Inequalities of the Same Sense belong to the same type from the first two, i.e., both belong to 

type lor both belong to type II. 

3. Inequalities of the Opposite Sense belong to different types of the first two, i.e., one to type 

1, the other to type II. 

4. Equivalent Inequalities are inequalities if they are valid exactly for the same values of the un- 

knowns contained in them. 

3. Solution of Inequalities 

Similarly to equalities, inequalities can contain unknown quantities which are usually denoted by the 

last letters of the alphabet. The solution of an inequality or a system of inequalities means the determi- 

nation of the limits for the unknowns between which they can change, keeping the inequality or system 


1.4 Inequalities 29 


of inequalities true. 
Solutions can be looked for any kind of inequality; mostly pure inequalities of type I and II are to be 
solved. 


1.4.1.2 Properties of Inequalities of Type I and IT 


1. Change the Sense of the Inequality 
If a>bholds, then 6b<a_ is valid, (1.97a 
if a<bholds, then ba _ is valid. (1.97b 
2. Transitivity 
If a>b and b>chold, then ac is valid; (1.98a 
if a<b and b<chold, then a<c _ is valid. (1.98b 
3. Addition and Subtraction of a Quantity 
If ab holds, then atc>b+c is valid; (1.99a 
if a«b holds, then αποςύθο is valid. (1.99b 


By adding or subtracting the same amount to the both sides of inequality, the sense of the inequality 
does not change. 


4. Addition of Inequalities 


If a>b and c>d hold, then a+c>b+d  isvalid; (11004) 
if a<b and c<dhold, then a+e<b+d  isvalid. (1.100b) 
Two inequalities of the same sense can be added. 
5. Subtraction of Inequalities 
If a>b and c<dhold, then a—c>b-—d _ is valid; (1.101a) 
if a<b and c>dhold, then a—c<b—d is valid. (1.101b) 


Inequalities of the opposite sense can be subtracted; the result keeps the sense of the first inequality. 
Subtracting inequalities of the same sense is not allowed. 


6. Multiplication and Division of an Inequality by a Quantity 


b 
If a>b and c»0 hold, then ac>be anc a > — are valid, (1.102a 
c σα 
; a b : 
if a<b and 6320 hold, then ac<be and -< - are valid, (1.102b 
δ: Ὁ 
; a b : 
if ab and c«0 hold, then ac<be and -< - are valid, (1.102c 
E 
: a b : 
if a<b and c«0 hold, then ac>be and — > -are valid. (1.102d 
G g 


Multiplication or division of both sides of an inequality by a positive value does not change the sense 
of the inequality. Multiplication or division by a negative value changes the sense of the inequality. 
7. Inequalities and Reciprocal Values 


If 0<a<b or a<b<0O hold, then - » ; is valid. (1.103) 
a 
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1.4.2 Special Inequalities 
1.4.2.1 Triangle Inequality for Real Numbers 


For arbitrary real numbers a, b, a4, a2, . .. , an, there are the inequalities 
|a + b| € |α| + |b|; Jay tag +-+ + an| < la| + [ao] +--+ + lanl- (1.104) 
'The absolute value of the sum of two or more real numbers is less than or equal to the sum of their 


absolute values. The equality holds only if the summands have the same sign. 


1.4.2.2 Triangle Inequality for Complex Numbers 


For n complex numbers z1,25,...,2, € € 
n n 
Yo ze) = [σι + 22 +t + za < [eal + lee] +++ jm»; (1.105) 
k=1 k=1 


1.4.2.3 Inequalities for Absolute Values of Differences of Real and Complex 
Numbers 

For arbitrary real numbers a,b € R, there are the inequalities 

lal — [Bl] < la — δι < lal + lèl. (1.106) 
The absolute value of the difference of two real numbers is less than or equal to the sum of their absolute 
values, but greater than or equal to the absolute value of the difference of their absolute values. For 
two arbitrary complex numbers 21, 22 € € 


[za] — |50|| < [a — 22| S |a] + Lol - (1107) 


1.4.2.4 Inequality for Arithmetic and Geometric Means 


αι + 3 tt 
AOI» T DU" vaya, for aj 0. (1.108) 


TL 


The arithmetic mean of n positive numbers is greater than or equal to their geometric mean. Equality 
holds only if all the n numbers are equal. 


1.4.2.5 Inequality for Arithmetic and Quadratic Means 


2 42 m 2 
Qaa te tan? (1.109) 
TL 


The absolute value of the arithmetic mean of numbers is less than or equal to their quadratic mean. 


1.4.2.6 Inequalities for Different Means of Real Numbers 


For the harmonic, geometric, arithmetic, and quadratic means of two positive real numbers a and b 
with a « b the following inequalities hold (sce also 1.2.5.5, p. 20): 


αι αρ... n 
n 


a «X ty «X xg « v4 « xg <b. (1.1102) 
Here 
a+b 2ab a? +b? 
αρ ο ta = Vab, xg = ee ro = z` (1.110b) 


1.4.2.7 Bernoulli’s Inequality 
For every real number a > —1 and integer n > 1 holds 
(l+a)">1l+na. (1.111) 
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The equality holds only for n = 1, or a = 0. 
1.4.2.8 Binomial Inequality 
For arbitrary real numbers a,b € R holds 


lab| < πα v). (1.112) 


1.4.2.9 Cauchy-Schwarz Inequality 


1. Cauchy-Schwarz Inequality for Real Numbers 
The Cauchy-Schwarz inequality holds for arbitrary real numbers a;, bj € R : 


[aibi + a2b2 ++++ + anbn| < γαι» Ha? ++++4 ΠΕΙ Eb? +--+ + bn? (1.113a 
or 

(αιδι + agby + +++ + abn)? < (a4? F ao? «4 as?) (bi? Eb) eee + bn”). (1.113b 
For two finite sequences of n real numbers, the sum of the pairwise products is less than or equal to 
the product of the square roots of the sums of the squares of these numbers. Equality holds only i 
αι : bi = ag : ba = +++ = ay: dn. 
If n = 3 and {a1, a2, a3} and {b1, by, b3} are considered as vectors in a Cartesian coordinate system, 
then the Cauchy-Schwarz inequality means that the absolute value of the scalar product of two vectors 
is less than or equal to the product of absolute values of these vectors. If n > 3, then this statemen! 
can be extended for vectors in n-dimensional Euclidean space. 
2. Cauchy-Schwarz Inequality for Complex Numbers 
Considering that for complex numbers |z|? = z*z (23 is the complex conjugate of 2), the inequality 
(1.113b) is valid also for arbitrary complex numbers z;, wj € C: 
(ziw + 2201 + +++ + ση) (ziwi + zowa ο. + στι) 
< (zi*z1 + 29 20 $+ + zs Zn) (Wi w, + Wo* We + +++ + Wr*Wn). 
3. Cauchy-Schwarz Inequality for Convergent Infinite Series and Integrals 
An analogous statement to (1.113b) is the Cauchy-Schwarz inequality for convergent infinite series and 
for certain integrals: 


n=1 n=1 n=1 


CES ad < (f'ireoras) (f'id). (1.115) 


1.4.2.10 Chebyshev Inequality 


If ay, d2,..., Qn, b1, b2,..., bn are real positive numbers, then the following inequalities hold: 
(A) (^ Tto) < abı + abo + +++ + nbn (1.116a) 
n n n 
fo a; <ag<...<a, and bi < bg <...< dy, 
Or dic ασ... 0, and b > b>... > δη, 
and 
(5 ο ““Ὶ (> Tb Tec 2 > ab, + abo + +++ + anbn (1.116) 
n n n 


for a <@<..0% 0, and by > bo >... > by. 


For two finite sequences with n positive numbers, the product of the arithmetic means of these sequences 
is less than or equal to the aritmetic mean of the pairwise products if both sequences are increasing or 
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both are decreasing; but the inequality is valid in the opposite sense if one of the sequences is increasing 
and the other one is decreasing. 


1.4.2.11 Generalized Chebyshev Inequality 


If a1, 2,..., Qn, b1, b2,..., bn are real positive numbers, then the following inequalities hold: 
ye Tas +--+ + an" i Tb xb p {55} + (agb3)* + +++ + (Gnbn)* (1.1172) 
n n = n fem 
fo a <a <... <an and bı <b <... < bn 
or >a... > an and bı > b>... > bn 
and 
Z Tas +: + ant eft Lb. orb > qu + (asb3)" + +++ + (dnbn)* (1.117b) 
n n = n ` 
for αι <a <... <an and bı > b>... > bn. 
1.4.2.12 Hölder Inequality 
1. Hölder Inequality for Series 
1 1 
If p and q are two real numbers such that — + = = 1 is fulfilled, and if x1, £2, . . . , x, and y1, Y2,- - - , Yn 


D q 
are arbitrary 2n complex numbers, then the following inequality holds: 


1 
q 


T πι 
μμ ο (1.1180) 
k=1 k=1 k=1 


This inequality is also valid for countable infinite pairs of numbers: 


ΡΜ (1.1186) 
k=1 


k=1 k=1 


where from the convergence of the series on the right-hand side the convergence of the left-hand side 
follows. 


2. Holder Inequality for Integrals 


If f(x) and g(a) are two measurable functions on the measure space (X, A, μ) (see 12.9.2, p. 695), then 
the following inequality holds: 


ΠΟΠ {λων a i [ve η z (1180) 


1.4.2.13 Minkowski Inequality 


1. Minkowski Inequality for Series 
If p > 1 holds, and (x; ) £zf* and (yi), with £k, Yk € C are two sequences of numbers, then holds: 


X i 
p Ῥ 


ΣΕ » e| d» (1.1194) 
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2. Minkowski Inequality for Integrals 
If f(a) and g(x) are two measurable functions on the measure space (X, A, μ) (see 12.9.2, p. 695), then 
1 


holds: 
ΠΠ < og 


1.4.3 Solution of Linear and Quadratic Inequalities 
1.4.3.1 General Remarks 


During the solution of an inequality it is transformed into equivalent inequalities step by step. Similarly 
to the solution of an equation the same expression can be added to both sides; formally, it may seem 
that a summand is brought from one side to the other, changing its sign. Furthermore one can multiply 
or divide both sides of an inequality by a non-zero expression, where the inequality keeps its sense i 
this expression has a positive value, and changes its sense if this expression has a negative value. An 
inequality of first degree can always be transformed into the form 

ax > b. (1.120 


The simplest form of an inequality of second degree is 


F ΠΠ : (1.119b) 
X 


αἲ » πι (1.121a) or z? <m (1.121b 


and in the general case it has the form 


ax? +br+c>0 (1.122a) or ar? +br+ce< o0. (1.122b 
1.4.3.2 Linear Inequalities 
The linear inequality of first degree (1.120) has the solution 


b b 
z>- fora>0 (1.123a) and zc « — for a « 0. (1.123b) 
a a 
: 2 
E 5r+3<8r+1, 517—812 «1—3, 3r < —2, gos. 
1.4.3.3 Quadratic Inequalities 
Inequalities of second degree in the form 
27 >m (1.124a) and α «πὶ (1.124b 
have solutions 
a) zx? » m: Form 70 thesolutionis z » Vm and x<-ym (|v| » Vm), (1.125a 
for m < 0 the inequality obviously holds for any x. .125b 
b) αὖ «πι: Form 0 thesolutionis — ym « x « vm (|z| < vm), (1.126a 
for m € 0 there is no solution. .126b 


1.4.3.4 General Case for Inequalities of Second Degree 


ax? bx 4- c» 0 (1.1272) or az?-Fbx c « 0. 127b 


First dividing the inequality by a. If a < 0 then the sense of the inequality changes, but in any case i 
will have the form 


r? +pr+q<0 (1.127c) or a? +px+q>0. 127d 
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By completing the square it follows that 
2 2 mr? 2 
με PY Y 3 "ER: P\ -- 
(s + 3 < @ q (1.127e) or (s + z) > (5) q. (1.1271) 


2 
Denoting x + 5 by z and (5) — q by m, the inequalities 


z «πι (1.1282) or >m (1.128b) 


can be obtained. Solving these inequalities yields the values for x. 


À 7\? 9. 3 7T 3 
. 942 -- pl 0 Ty FA μα eo. Lue 
B A: —22? + 142 — 20 > 0, 2? — Tz 4-10 < 0, (s 2) <2, -$<09-3$<5, 
oe. NS 
ΠΡΙ 
The solution is 2 < x < 5. 
E B: 2?+6r+15>0, (r4-3) --θ. The inequality holds identically. 
T? 9 7 39 7 3 


MC: —22? + 14x — 20 < 0, (e-5) >F $—5g?5 and T-3577 


The solution intervals are x > 5 and x < 2. 


1.5 Complex Numbers 


1.5.1 Imaginary and Complex Numbers 
1.5.1.1 Imaginary Unit 


The imaginary unit is denoted by i, which represents a number different from any real number, and 
whose square is equal to —1. In electronics, instead of i the letter j is usually used to avoid accidently 
confusing it with the intensity of current, also denoted by i. The introduction of the imaginary unit leads 
to the generalization of the notion of numbers to the complex numbers, which play a very important role 
in algebra and analysis. The complex numbers have several interpretations in geometry and physics. 


1.5.1.2 Complex Numbers 
The algebraic form of a complex number is 

z=a+ib. (1.129a) 
When a and b take all possible real values, then one gets all possible complex numbers z. The number 
a is the real part, the number b is the imaginary part of the number z: 

a=Re(z), b= Im(2). (1.129b) 
For b = 0 it is z = a, so the real numbers form a subset of the complex numbers. For a = 0 it is z = ib, 
which is a “pure imaginary number". 
The total set of complex numbers is denoted by C. 
Remark: Functions w = f(z) with complex variable z = x + iy will be discussed in function theory 


(see 14.1, p. 731 ff). 


1.5.2 Geometric Representation 
1.5.2.1 Vector Representation 


Similarly to the representation of the real numbers on the numerical axis, the complex numbers can be 
represented as points in the so-called Gaussian number plane: A number z = a + ib is represented by 
the point whose abscissa is a and ordinate is b (Fig. 1.5). The real numbers are on the axis of abscissae 
which is also called the real axis, the pure imaginary numbers are on the axis of ordinates which is 
also called the imaginary axis. On this plane every point is given uniquely by its position vector or 
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radius vector (see 3.5.1.1, 6., p. 181), so every complex number corresponds to a vector which starts at 
the origin and is directed to the point defined by the complex number. So, complex numbers can be 
represented as points or as vectors (Fig. 1.6). 


1.5.2.2 Equality of Complex Numbers 


Two complex numbers are equal by definition if their real parts and imaginary parts are equal to each 
other. From a geometric viewpoint, two complex numbers are equal if the position vectors correspond- 
ing to them are equal. In the opposite case the complex numbers are not equal. The notions “greater” 


and “smaller” are meaningless for complex numbers. 
AIm(z) Im(z) 
pl----.-Zgatib Ζ leds 47 
T ! 
0 a Re(z) 0 Re(z) Re(z) 
Figure 1.5 Figure 1.6 Figure 1.7 


1.5.2.3 Trigonometric Form of Complex Numbers 
The form 
z-—a-ib (1.130a) 


is called the algebraic form of the complex number. Using polar coordinates yields the trigonometric 
form of the complex numbers (Fig. 1.7): 

z = p(cos y +i sin 9). (1.130b) 
The length of the position vector of a point p = |z| is called the absolute value or the magnitude of the 


complex number, the angle v, given in radian measure, is called the argument of the complex number 
and is denoted by arg z: 


p-—|z| p =argz =w+2kr with 0< p «oo, =r <w € +r, k= 0,+1,2,.... (1130c) 
One calls y the principal value of the argument of the complex number. 
The relations between p, y and a, b for a point are the same as between the Cartesian and polar 
coordinates of a point (see 3.5.2.2, p. 192): 


a = pcos q, (1.131a) b=psing, (1131b) ρ- Va? +B, (1.131c) 
b 
arctan — for a > 0, 
a 
arccos Í for b > 0, p > 0, +5 fora — 0, b> 0, 
"m ? π 
P = 1 — arccos 2 forb < 0, p » 0, p= z5 fora = 0, b « 0, 


p 
undefined for p = 0 — +n fora<0, b 70, 
a 


(1.131d) b 
arctan——7 fora «O0, b « 0. 
a 


(1.131e) 
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'The complex number z — 0 has absolute value equal to zero; its argument arg 0 is undefined. 


1.5.2.4 Exponential Form of a Complex Number 
The representation 

z= pe (1.132a) 
is called the exponential form of the complex number, where p is the magnitude and q is the argument. 
The Euler relation is the formula 

οἱ” = coso +i sing. (1.132b) 
W Representation of a complex number in three forms: 


a)z=1+iv3 (algebraic form), b) z = 2 c +i sin 5) (trigonometric form), 


c) z= 2e'3 (exponential form), considering the principal value of it. 
Without restriction to the principal value holds the representation 
π π π 

d) z=1+iV3 = 2οχρ i ( t 3] =2 [eos (5 Σετ) +i sin G 3] (k — 0, 
1.5.2.5 Conjugate Complex Numbers 
Two complex numbers z and z* are called conjugate complex numbers if their real parts are equal and 
their imaginary parts differ only in sign: 

Re(z*) = Re(z), Im(z*)-— —Im(z). (1.133a) 
The geometric interpretation of points corresponding to the conjugate complex numbers are points 
symmetric with respect to the real axis. Conjugate complex numbers have the same absolute value, 
their arguments differ only in sign: 


punt 


2,...). 


z —a-ib- p(cosq +i sing) = pel”, (1.133b) 


2* =a — ib = p(cosp —ising) = pe. (1.133c) 


Instead of z* one often uses the notation z for the conjugate of z. 


1.5.3 Calculation with Complex Numbers 
1.5.3.1 Addition and Subtraction 


Addition and subtraction of two or more complex numbers given in algebraic form is defined by the 
formula 


21 + z2 — 24". = (αι 


101) + (as +ib2) — (ag 4-103) +- 

az — a3 ++++) +i(bi + by — ba +++). (1.134) 
The calculation can be done in the same way as doing with usual binomials. As a geometric interpreta- 
tion of addition and subtraction can be considered the addition and subtraction of the corresponding 
vectors (Fig. 1.8). For these the usual rules for vector calculations are to be used (see 3.5.1.1, p. 181). 
For z and z*, z + z* is always real, and z — z* is pure imaginary. 


0| 1 Re(z) 


Re(z) 


Figure 1.8 Figure 1.9 Figure 1.10 
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1.5.3.2 Multiplication 


The multiplication of two complex numbers σι and 20 given in algebraic form is defined by the following 
formula 


2129 = (αι +ib1)(a2 +ib2) = (αι — ὐιῦο) + i (a1b2 + bia). (1.135a) 
For numbers given in trigonometric form holds 
2122 = [pi(cos yı + i sin q1)][p»(cos p2 + i sin q)] 
= ριρο[εοβ(φι + p2) +i sin(gi + φο)], (1.135b) 
i.e., the absolute value of the product is equal to the product of the absolute values of the factors, and 


the argument of the product is equal to the sum of the arguments of the factors. The exponential form 
of the product is 


229 = pipae ete» (1.135c) 


The geometric interpretation of the product of two complex numbers 21 and 20 is a vector (Fig. 1.9). It 
is generated by rotation of the vector corresponding to z1 by the argument of the vector 2ο (clockwise or 
counterclockwise according to the sign of this argument), and the length of the vector will be stretched 
by |50|. 
The product 2122 can also be represented with similar triangles (Fig. 1.9). The multiplication of a 
complex number z by i means a rotation by 7/2 and the absolute value does not change (Fig. 1.10). 
For z and z*: 

zz = P -- | =a? +8. (1.136) 


1.5.3.3 Division 
Division is defined as the inverse operation of multiplication. For complex numbers given in algebraic 
form holds 


z αι +ib aia + bibo . agby — a4bx 
1 mdi _ aa E pie D (1.1372) 
z2 d3-ib, ay? + b» a3? + be 
For complex numbers given in trigonometric form holds 
zı _ pi(cosyi +ising:) p [ 
29  po(cosyo+isinysr) po 
i.e., the absolute value of the quotient is equal to the ratio of the absolute values of the dividend and 
the divisor; the argument of the quotient is equal to the difference of the arguments. 
For the exponential form follows 
Bb P1 ilp), (1.137c) 
42 p2 
In the geometric representation the vector corresponding to 21/2» can be generated by a rotation of the 
vector representing σι by — arg z2, and then by a contraction by |z9|. 


cos(q1 — v) + i sin(qi — v2)], (1.137b) 


Remark: Division by zero is impossible. 


1.5.3.4 General Rules for the Basic Operations 
Calculations with complex numbers z = a + ib are to be done in the same way as doing with ordinary 
binomials, but considering i? = —1. Dividing a complex number by a complex number first the imag- 
inary part of the denominator has to be removed by multiplying the numerator and the denominator 
of the fraction by the complex conjugate of the divisor. This is possible because 
(a -Fib)(a—ib) =a” +b? (1.138) 

is a real number. 

(3-4i)(-1- 5i» | 10-7 — (3—4i)(1— 10i — 25) 

14 3i © δ 1+ 3i 


(10+ 7i)i — —2(3 — 4i)(12 + 5i) 
5Η 11-38 
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7—-10i -2(56-- 33i)(1— 3i) 7- 100  —2(—43 — 201i) , 7 — 10i 
T A 
9 


= "5 10 


50+191i) = 1 2. 
5 (I+ 31) —30) (50+191i) = 10+38.2i 


1 
5 
1.5.3.5 Taking Powers of Complex Numbers 
The n-th power of a complex number could be calculated using the binomial formula, but it would 
be very inconvenient. For practical reasons the trigonometric form is to be used and the so-called de 
Moivre formula: 

[p(cosy~ +i sin y)]|” = p"(cosny +i sinng), (1.139a) 
i.e., the absolute value is raised to the n-th power, and the argument is multiplied by n. In particular, 
holds: 


i?=-1, i?2-i, it=41 (1.139b) in general jew. (1.139c) 


1.5.3.6 Taking the n-th Root of a Complex Number 


Taking of the n-th root is the inverse operation of taking powers. For z = p(cosy +i sin p) 4 0 the 


notation 
zl" = Vz (n> 0, integer), (1.140a) 
is the shorthand notation for the n different values 


p+ 2k + 2k 
τα. ών 5), 
n 


| n 
(k 20,1,2,...,n— 1). (1.140b) 
While addition, subtraction, multiplication, division, and tak- 
ing a power with integer exponent have unique results, taking 
the n-th root has n different solutions wx. 
The geometric interpretations of the points ωχ are the vertices 
of a regular n-gon whose center is at the origin. In Fig. 1.11 
the six values of 4/z are represented. Figure 1.11 


1.6 Algebraic and Transcendental Equations 


1.6.1 Transforming Algebraic Equations to Normal Form 
1.6.1.1 Definition 


The variable x in the equality 

F(z) = f(z) (1.141) 
is called the unknown if the equality is valid only for certain values x1, 5,..., x, of the variable, and 
these values are called the solutions or the roots of the equation. Two equations are considered equiv- 
alent if they have exactly the same roots. 
An equation is called an algebraic equation if the functions F(x) and f(x) are algebraic, i.e., they are 
rational or irrational expressions; of course one of them can be constant. Every algebraic equation can 
be transformed into the normal form 

P(x) = αμα" αμ 41" | +++» + a,x +a9 = 0 (1.142) 
by algebraic transformations. The roots of the original equation occur among the roots of the normal 
form, but under certain circumstances some are superfluous. The leading coefficient an is frequently 
transformed to the value 1. 
The exponent n is called the degree of the equation. 
x-14+vV2?-6 2 -- 


3 . 
ls . The transformations 


3(x — 2) T 


W Determine the normal form of the equation 
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step by step are: 

a(x — 1 + Vx? -- 6) = δα(α — 2) + 3(α -- 2)(x — 3), x? — x ανα — 6 = 3x? — 6x + 32? — 15x 4 
18, zV/z?—6-5z?—20z--18, z?(a?— ϐ) = 25x4— 2002? + 5802? — 7202 +324, 241 — 2002? + 
58627? — 720x + 324 = 0. The result is an equation of fourth degree in normal form. 


1.6.1.2 System of n Algebraic Equations 


Every system of algebraic equations can be transformed to normal form, i.e., into a system of polynomial 
equations: 


P24;2, s.) = Oy PE αν) 0, ο PADs = 0s (1.143) 
The P; (i = 1,2,...,n) are polynomials in x, y, z,.... 
x 1 r-1 
E Determine the normal form of the equation system: 1. ον. ο΄ —Xz, 3. sy=2. 


Vy oz y—1 
—y=0, 2. 32—2x -1-— y?z 4 2yz 


1.6.1.3 Extraneous Roots 


After transforming an algebraic equation into the normal form (1.142) it can happen that the equation 
P(x) = 0 has some roots which are not solutions of the original equation (1.141). The roots of the 
equation P(r) — 0 must be substituted into the original equation to check whether they are really 
solutions of (1.141). 

Extraneous solutions can emerge if not invertible transformations are performed: 

1. Vanishing denominator If the equation has the form 


2 


The normal form is: 1. z?z 


x 
o 
59 
8 
as 
| 
x 
Il 
c 


-0 (1.1442) 


with polynomials P(x) and Q(x), then the normal form of (1.144a) after multiplying by the denomi- 
nator Q(x) is: 
P(x) — 0. (1.144b) 

The roots of (1.144b) are the same as the roots of (1.144a), except the ones which are roots both of the 
numerator and of the denominator, i.c. which satisfy P(x) = 0 and Q(x) = 0. If x = a is a root of 
the denominator, then in the case x = o the multiplication by Q(x) is a multiplication by zero. Every 
time when a non-identical transformation is performed, the checking of the solutions is necessary (sce 
also 1.6.3.1, p. 43). 
P 

r-1 2-1’ 
the normal form, but it is not a solution of the original equation, since the fractions are not defined for 
p= 


The corresponding normal form is z* — 3? — x +1 = 0. xı = 1 is a solution of 


2. Irrational equations If the original equation contains radicals, the normal form is usually 
achieved by powering. E.g. squaring is not an identical transformation (since it is not invertible). 


E νι | = 2rorvyzr+T = 2r — 1. By squaring both sides of the second form of the equation 
its normal form is 417 — 5x — 6 = 0, and the roots are zı = 2 and αυ = —3/4. The root a; = 2isa 
solution of the original equation, but the root 2» = —3/4 is not. 


1.6.2 Equations of Degree at Most Four 
1.6.2.1 Equations of Degree One (Linear Equations) 


1. Normal Form 
ax 4b — 0 (az 0). (1.145) 
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2. Number of Solutions 
There is a unique solution 
b 
n (1.146) 


a 


1.6.2.2 Equations of Degree Two (Quadratic Equations) 


1. Normal Form 


ax? 4- bx -- c — 0 (a 0) (1.1472) 
or divided by a: 
x? +pe+q=0. (1.147b) 
2. Number of Real Solutions of a Real Equation Depending on the sign of the discriminant 
„2 
D = 4ac— δ) for (1.147a) or D =q — m for (1.147b), (1.148) 
holds: 


e for D < 0, there are two real solutions (two real roots), 

e for D = 0, there is one real solution (two coincident roots), 

e for D > 0, there is no real solution (two complex roots). 

3. Properties of the Roots of a Quadratic Equation If x; and x» are the roots of the quadratic 

equation (1.147a) or (1.147b), then the following equalities hold: 
b c 

πι Ἔπος —— — —p, πι πω --Ξ. (1.149) 

a a 

4. Solution of Quadratic Equations 


Method 1: Factorization of 
aa? 4 bx +c = a(x — o)(r — B) (1.150a) or z?-- pr q- (x —o)(x — 8), (1.150b) 


if it is successful, immediately gives the roots 
πιξα, ποςβ. (1.151) 
WB ert+x—-6=0, r? +2-6=(x+3)(x— 2), a) = —3, t2 =2. 
Method 2: Using the solution formula in the cases D < 0: 
a) For (1.147a) the solutions are 


Bl 
—~+,](=] τας 
—b+ Vi? — 4ac 2 2 
— (1.1522) or t= . Ω1530) 
a a 


31ο = 


If b is an even integer the second formula is to be used. 
b) For (1.147b) the solutions are 


p p 
+42 -q. 11 
s Nag l (1.153) 


tj => 


1.6.2.3 Equations of Degree Three (Cubic Equations) 
1. Normal Form 
ax? 4- ba? +cx+d=0 (a #0) (1.154a) 


b 
or after dividing by a and substituting y = x + 36 there is 
a 


y? +3py +2q¢=0 orin reduced form y?+p*y+q* =0, (1.154b) 


1.6 Algebraic and Transcendental Equations 41 


where 
Fe 2p? bc d " 3ac — b? 
q'-—29q CM 33a and p* — 3p RN (1.154c) 
2. Number of Real Solutions Depending on the sign of the discriminant 
D= + (1.155) 
holds: 


e for D > 0, one real solution (one real and two complex roots), 

e for D < 0, three real solutions (three different real roots), 

e for D = 0, one real solution (one real root with multiplicity three) in the case p = q = 0; or two real 
solutions (a single and a double real root) in the case p? = —q? 4 0. 

3. Properties of the Roots of a Cubic Equation If xı, x», and x3 are the roots of the cubic 
equation (1.154a), then the following equalities hold: 


g d 
21 + 20 + T3 = τα» mm + 4103 + 2021 = go TI =. (1.156) 


4. Solution of a Cubic Equation 
Method 1: If it is possible to decompose the left-hand side into a product of linear terms 


aa? + ba? + cx +d = a(z —a)(x — B)(x — η) (1.157a) 
one immediately gets the roots 
=a, a=b; g=: (1.157b) 


Bo?+2?-6r =0, αὖ +2?-6r =2(x+3)(x—-2); 2 =0, £2 3, 13 = 2. 


Method 2: Using the Formula of Cardano. By substituting y = u + v the equation (1.154b) has the 
form 


u? 4- v? + (u + v)(3uv + 3p) + 2q = 0. (1.158a 
This equation is obviously satisfied if 

u +u — —2q and uv =—p 1.158b 
hold. Writing (1.158b) in the form 

u? 4v) = —2q, uy? = p, (1.158c 


there are two unknowns u? and v’, the sum and product of which are known. Therefore using the Vieta 
root theorem (see 1.6.3.1, 3., p. 44) the solutions of the quadratic equation 


w — (w+ v?)w + αδυ) = w? + 2qw — p? = 0 1.158d 
can be calculated: 
w =u = -q +4 +P, w = v’ = -q -VÈ +, 1.158e 


so for the solution y of (1.154b) the Cardano formula results in 


y=u+v i-e» yetti gaye to. (1.1586 


Since the third root of a complex number means three different numbers (see (1.140b), p. 38) there are 
nine different cases, but because of uv = —p, the solutions are reduced to the following three: 


yi = uj + v, (if possible, consider the real third roots u; and v; such that wiv; = —p), (1.158g 
i 1 
Yo -u( ον») vi ( 5 v3) ; (1.158h 


i 
2 2 
ys = U1 ( 


τοί: Nie 


5v3) τοι ο (1.158i 
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E + 6y +2 = 0 with p= 2, q= 1 and q? + p? = 9 and u = 4/—14 3 = 4/2 = 1.2599, 
v = 4-1 = 4/—4 = —1.5874. The real root is yj = u +v = —0.3275, the complex roots are 


1 3 
Y2,3 σία + v) 1u v) = 0.1638 + 1: 2.4659. 


Method 3: For a real equation, the auxiliary values given in Table 1.3 can be used. With p from 
(1.154c) 
r= xp (1.159) 


is substituted where the sign of r is the same as the sign of g. Next, using Table 1.3, one can determine 
the value of the auxiliary variable y and with it the roots γι, y2 and ya depending on the signs of p and 
D=@t+p. 


Table 1.3 Auxiliary values for the solution of equations of degree three 


p«o 
p»0 
+p <0 g+p?>0 
cosp = f coshyy = 4 sinh y = τα 
yı = —2r cos £ yı = —2r cosh £ yı = —2r sinh É 
y2 = +2r cos (60° £) y2 = r cosh S ΠΕ; sinh 7 yp-r sinh 7 +iv3r cosh £ 
y3 = +2r cos (ου | £) y3 = r cosh Ξ iv3rsinh A y3 = rsinh S i 3r cosh £ 


B ὁ 9y+4=0. p 3, q=2, +p? <0, r = V3, cosp = = 0.3849, y = 67°22’. 


2 
να 
yi = —2V3 cos 22°27’ = —3.201, y2 = 2V3 cos(60° — 22°27') = 2.747, ys = 2V3 cos(60° + 22°27’) = 
0.455. 

Checking: yi + ya + yz = 0.001 which can be considered 0 for the accuracy of our calculations. 
Method 4: Numerical approximate solution, see 19.1.2, p. 952; numerical approximate solution by 
the help of a nomogram, see 2.19, p. 128. 
1.6.2.4 Equations of Degree Four 
1. Normal Form 

azt 4- ba? +cr?+dr+e=0 (a0). (1.160 
If all the coefficients are real, this equation has 0 or 2 or 4 real solutions. 
2. Special Forms If b = d = 0 holds, the roots of the biquadratic equation 

4 2 

ax + cr +e =0 (1.161a 
can be calculated by the formulas 
—c+ Vc? — 4αε 
X1234 — Ey, y= ———— — —. 

2a 

For a — e and b — d, the roots of the equation 

ax* + ba? + cx? +ba +a=0 (1.161c 
can be calculated by the formulas 


qus Vy? —4 —b + Vb? — 4ac + 8a? 


112,34 = 2 y= 2a 


(1.161b 


(1.161d 
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3. Solution of a General Equation of Degree Four 
Method 1: If somehow the left-hand side of the equation can be factorized 


azt + bx? + cx? + dz +e = 0 = a(z —a)(x — β)(α — γ)(α — ὃ) (1.162a) 
then the roots can be immediately determined: 

=a, ποβ, =7, t=. (1.162b) 
E ct οι —2? + 22 =0, x(x? — 1)(x — 2) = α(α-- Ό(α + Ό(α — 9): 
ay 0, Xs 1, σα ME PELA 


Method 2: The roots of the equation (1.1622) for a = 1 coincide with the roots of the equation 


i by —d 
a? +(b4 Α)5 c z Jes (1.1632) 
where A = +\/8y + 0? — 4c and y is one of the real roots of the equation of third degree 
8y? — 4cy? + (2bd — 8e)y + e(4e - b’) - d? --0 (1.163b) 
b bc b 
with B — da > # 0. The case B = 0 gives by the help of the substitution £ = u — 18 biquadratic 


equation of the form (1.161a) for u with a 1. 
Method 3: Approximate solution, see 19.1.2, p. 952. 


1.6.2.5 Equations of Higher Degree 


It is impossible to give a formula or a finite sequence of formulas which produce the roots of an equation 
of degree five or higher (see also 19.1.2.2,2., p. 954). 


1.6.3 Equations of Degree n 
1.6.3.1 General Properties of Algebraic Equations 


1. Roots 
The left-hand side of the equation 
z"-Fa, ια 5... cag 0 (1.164a 


is a polynomial P (x) of degree n, and a solution of (1.164a) is a root of the polynomial P,,(x). If a is 
a root of the polynomial, then P, (x) is divisible by (x — a). Generally 

P, (x) = (x — a) Py-i(x) + Ρι(α). (1.164b 
Here Β, ι(α) is a polynomial of degree n — 1. If P, (2) is divisible by (x — a)", but it is not divisible by 
(x — a)** then a is called a root of order k of the equation P,,(x) = 0. In this case a is a common roo 
of the polynomial P, (x) and its derivatives to order (k — 1). 
2. Fundamental Theorem of Algebra 
Every equation of degree n whose coefficients are real or complex numbers has n real or complex 
roots, where the roots of higher order are counted by their multiplicity. Denoting the roots of P(x 
by o, 6,y,... and they have multiplicity k,l, m, .. ., then the product representation of the polynomial 
is 


P(x) = (x — o)*(x — B) (a2 — yyn.... (1.165a 
The solution of the equation P(x) = 0 can be simplified by reducing the equation to another one, which 
has the same roots, but only with multiplicity one (if possible). In order to get this, the polynomial is 
to be composed into a product of two factors 


P(x) = Q(a)T(2), (111655) 
such that 
T(x) = (x — a (x — 8)71..., Q(x) = (z — α)ία-- β).... (1.165c) 
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Because the roots of the polynomial P(x) with higher multiplicity are the roots of its derivative Ρ'(α). 
too, T(x) is the greatest common devisor of the polynomial P(x) and its derivative P'(x) (see 1.1.6.5, 
p.14). Dividing P(x) by T(x) yields the polynomial Q(x) which has all the roots of P(x), and each 
root occurs with multiplicity one. 


3. Theorem of Vieta About Roots 


The relations between the n roots £1, £2, ..., 1, and the coefficients of the equation (1.164a) are: 
n 
Ti Fiata + Lp » 2; Qs 1, 
i=l 
n 
2121 d 2433 +... + Ὅῃ i124 = 5 TjTj = απο, 

ij-1 

i<j 
τι 

©0903 + T124 +... + Ζῃ 22ῃ.12ῃ = p» XjTjYk = —n-3, (1.166) 
i,j.k=1 
icjek 


αιτο... Ὅῃ = (—1)"ao. 


1.6.3.2 Equations with Real Coefficients 
1. Complex Roots 


Polynomial equations with real coefficients can also have complex roots but only pairwise conjugate 
complex numbers, i.e., if a = a + ib is a root, then 8 = a — ib is also a root, and it has the same 


multiplicity. The expressions p (α 4- B) 2a and q = aß = a? + b? satisfy the unequation 
2 
(5) — q < 0, so that 
(zx -- α)(α -- β) -- α” }- ρα ἠ-ᾳ (1.167) 


holds. Substituting the product corresponding to (1.167) for every pair of factors in (1.165a), one gets 
a decomposition of the polynomial with real coefficients into real factors. 


P(x) = (a — o1) (x — o3)? --- (x — ay) 


(x? δια + qi)" (a? + pow + qo) +++ (a? + px + qp)™. (1.168) 
Here 04,02, ..., a; are the { real roots of the polynomial P(x). It also has r pairs of conjugate complex 
roots, which are the roots of the quadratic factors x? + pjz -- q; (i = 1,2,...,r). The numbers a; (j = 


5:52 
1,2,...,0), pi and q; (i = 1,2,...,r) are real and the inequalities (5) — qi < 0 hold. 


2. Number of Roots of an Equation with Real Coefficients 

According to (1.167) every equation of odd degree has at least one real root. The number of further 
real roots of (1.164a) between two arbitrary real numbers a < b, can be determined in the following 
way: 


a) Separate the Multiple Roots: Separating the multiple roots of P(r) — 0, yields an equation 
which has all the roots ofthe original equation, but only with multiplicity one. Then the form mentioned 
in the case of the fundamental theorem must be produced. 

For practical reasons it is a good idea to start with the determination of the Sturm chain (the Sturm 
functions (1.169)). This is almost the same as the Euclidean algorithm for determining the greatest 
common devisor, but it gives some further information. If P,, is not a constant then P(x) has multiple 
roots, which must be separated. Therefore in the following it can be assumed that P(x) — 0 has no 
multiple roots. 
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b) Creating the Sequence of Sturm Functions: 
P(x), P’(x), P (x), Po(x),..., P4, = const. (1.169) 


Here P(x) is the left-hand side of the equation, P'(x) is the first derivative of P(x), Ρι(α) is the re- 
mainder on division of P(x) by P'(x), but with the opposite sign, P»(x) is the remainder on division of 
P'(x) by Pi(x) similarly with the opposite sign, etc.; P = const is the last non-zero remainder, but 
it must be a constant, otherwise P(x) and P'(x) have common devisors, and P(x) has multiple roots. 
In order to simplify the calculations the remainders can be multiplied by positive numbers, what does 
not change the result. 


c) Theorem of Sturm: If A is the number of changes in sign, i.e. the number of changes from “+” 
to “—” and vice versa, in the sequence (1.169) for x = a, and B is the number of changes in sign in the 
sequence (1.169) for x = b, then the difference A — B is equal to the number of real roots of P(x) = 0 in 
the interval [a, b]. If in the sequence some numbers are equal to zero, then they should not be considered 
in the sign change count. 


Bl Determination of the number of roots of the equation zt — 5x? + 8r — 8 = 0 in the interval [0, 2]. 
The calculations by the Sturm functions are: P(x) = x! — 5a? + 8r — 8; P'(r) = 4α) — 10x + 8; 
Ῥι(α) = 5a? — 12x + 16; P(x) 3x + 284; P, 1. Substituting r — 0 results in the sequence 
—8, +8, +16, +284, —1 with two changes in sign, substituting x = 2 results in +4, +20, +12, +278, 21 
with one change in sign, so A — B = 2 — 1 = 1, i.e., between 0 and 2 there is one root. 


d) Descartes Rule: The number of positive roots of the equation P(x) = 0 is not greater than the 
number of changes of sign in the sequence of coefficients of the polynomial P(x), and these two numbers 
can differ from each other only by an even number. 


Bl What can be told about the roots of the equation z* + 22? — x? -5x — 1 = 0 ? The coefficients in the 
equation have signs +, +, —, +, — , i.e., there are three changes of sign. By the rule of Descartes the 
equation has either three or one roots. Because on replacing x by --ᾱ' the roots of the equation change 
their signs, and on replacing x by x + h the roots are shifted by h, the number of negative roots, or 
the roots greater than h can be estimated by the help of the rule of Descartes. In the given example 
replacing z by —a yields «4 — 22? — x? — 5a — 1 = 0, i.e., the equation has at most one negative root. 
Replacing x by x + 1 yields z + 62? + 11a? + 13x + 6 = 0, i.e., every positive root of the equation (one 
or three) is smaller than 1. 


3. Solution of Equations of Degree n 

Usually equations with n > 4 can be solved only approximately. In practice, approximate methods are 
also used to get solutions of equations of degree three or four (see 19.1.2.3, p. 954). 

In order to determine certain real roots of an algebraic equation the general numerical procedures for 
non-linear equations can be used (see 19.1, p. 949). In order to determine all roots, including the 
complex roots of an algebraic equation of degree n the Brodetsky-Smeal method can be used (see [1.7], 
{19.31]). In order to determine complex roots one can use the Bairstow method (see [19.31]). 


1.6.4 Reducing Transcendental Equationsto Algebraic Equations 


1.6.4.1 Definition 
An equation F(x) = f(x) is transcendental if at least one of the functions F(x) or f(x) is not algebraic. 
WB A: 3° — 47? 2", B B: 2log; (3r — 1) — log; (12x + 1) = 0, C: 3cosh x = sinh x + 9, 
9 : f 1 
E D: 27! = 8%? — 4%- BE: sing —cos?r-—-,W F: xcosx — sing. 


In some cases it is possible to reduce the solution of a transcendental equation to the solution of an 
algebraic equation, for instance by appropriate substitutions. In general, transcendental equations 
can be solved only approximately. In the following sections some special transcendental equations are 
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discussed which can be reduced to algebraic equations. 


1.6.4.2 Exponential Equations 


Exponential equations can be reduced to algebraic equations in the following two cases, if the unknown 
x or a polynomial P(x) is only in the exponent of some quantities a, b, c, .. .: 


Pile) pP2G).... are connected by multiplication or division, then the logarithm can 


a) If the powers a 
be taken on an arbitrary base. 
2log4 


log 4 — log3 + log 2° 


WB 3” = 47? . 27; zlog3 = (x — 2) log 4 + zlog2; x 


b) If a, b, c, ... are integer (or rational) powers of the same number k, i.e., a = k”, b = k™, c = k}, . 
holds, then by substituting y = k” one can get an algebraic equation for d and after solving it follows 


1. y 4 
the solution z = 959. 
log 
Μη E ο. 95 23x 22x : . : 3 " . 
i? - 477; s e 16. Substitution of y = 2* results in y? — 4y^ — 32y = 0 and 


yi 8, yo 4,93 = 0; 2? 8, 272 4, 2*9 0, so x, 3 follows. There are no further real 
roots. 


1.6.4.3 Logarithmic Equations 


Logarithmic equations can be reduced to algebraic equations in the following two cases, if the unknown 

x or a polynomial P(x) is only under the logarithm sign: 

a) If the equation contains only the logarithm of the same expression, then by introducing this as a 

new unknown, one can solve the equation with respect to it. The original unknown can be determined 

by using the logarithm. 

B milog, P(x)? +n = aJ (log, P(x)|? +b. The substitution y = log, P(x) yields the equation my? + 
= ayy? +b. After solving for y one gets the solution for x from the equation P(x) = a. 

b) If the equation is a linear combination of logarithms of polynomials of x, on the same base a, with 

integer coefficients m, n, ..., i.e., it has the form m log, Pi(x) + nlog, P(a JH .. = 0, then the left- 

hand side can be written as the logarithm of a rational expression. (The original equation may contain 

rational coefficients and rational expressions under the logarithm, or logarithms with different bases, 

if the bases are rational powers of each other.) 

3r — 1)? 3r — 1)? 

CER E 

12r -1 12x +1 

Substituting zı = 0 in the original equation gives negative values in the logarithm, i.e., this logarithm 

is a complex value, so 2; = 0 is not a solution. 


E 2log; (3x — 1) — log,(12% + 1) = 0, logs τι = 0,24 = 2. 


1.6.4.4 Trigonometric Equations 


Trigonometric equations can be reduced to algebraic equations if the unknown x or the expression nx +a 
with integer n is only in the argument of the trigonometric functions. After using the trigonometric 
formulas (see 2.7.2, p.81) the equation will contain only one unique function containing z, and after 
replacing it by y an algebraic equation arises. The solution for x is obtained from the solutions for y, 
naturally taking the multi-valuedness of the μα into consideration. 


: 1 ; δ ; 3 
E sina = cos? x — 40 sins = 1 — sin? x — " Substituting y = sin x yields y? + y — 17 0 and 


1 8 ; : 1 
y dien. The result y» gives no real solution, because | sin x| < 1 for all real x; from y; = 5 


meso 
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z 
follows x = : + 2Κπ and x = - + 2km with k = 1,2,3,.... 


1.6.4.5 Equations with Hyperbolic Functions 
Equations with hyperbolic functions can be reduced to algebraic equations if the unknown is only in the 
argument of the hyperbolic functions. Rewriting the hyperbolic functions as exponential expressions, = 


1 i . : 8 , : 
then substituting y = e” and — = e “, and the result is an algebraic equation for y. After solving this 
y 


the solution is x = In y. 


om pe o —g T 2 , 
ESTEE TE 3 Y μα. i +9; e” 42677 59 = 0; y+ —-9 = 0, y?-9y +2 = 0; 
y 
"PES E nu cae uA & 2.1716, αὐ = TeL & —1.4784. 


2 Functions 


2.1 Notion of Functions 
2.1.1 Definition ofa Function 
2.1.1.1 Function 


If x and y are two variable quantities, and if there is a rule which assigns a unique value of y to a given 
value of x, then y is called a function of x, using the notation 

y= f). (2:1) 
The variable 2 is called the independent variable or the argument of the function y. The values of x, 
to which a value of y is assigned, form the domain D of the function f(x). The variable y is called the 
dependent variable; the values of y form the range Wof the function f(x). Functions can be represented 
by the points (x, y) as curves, or graphs of the function. 


2.1.1.2 Real Functions 
If both the domain and the range contain only real numbers the function y = f(x) is called a real 
function of a real variable. 
WA:uy-—2? with D: ~œ <x «oo, W: 0€ y € cc. 
BB: y= yx with D: 0€ x «oo, W:0<y<om. 
2.1.1.3 Functions of Several Variables 
If the variable y depends on several independent variables x1, v2,...,2n, then the notation 

y fm, en (2.2) 
is used for a function of several variables (see 2.18, p. 118). 
2.1.1.4 Complex Functions 


If the dependent and independent variables are complex numbers w and z respectively, then w — f(z) 
means a complex function of a complex variable, (see 14.1, p. 731). Complex-valued functions w(x) are 
called complex functions even if they have real arguments zx . 


2.1.1.5 Further Functions 


In different fields of mathematics, for instance in vector analysis and in vector field theory (see 13.1, 
p. 701), other types of functions are to be considered whose arguments and values are defined as follows: 


1. The arguments are real -- the function values are vectors. 

W A: Vector functions (see 13.1.1, p. 701). 

Β D: Parameter representations of curves (see 3.6.2, p. 256). 

2. The arguments are vectors -- the function values are real numbers. 

W Scalar fields (see 13.1.2, p. 702). 

3. The arguments are vectors -- the function values are vectors. 

E A: Vector fields (see 13.1.3, p. 704). lE B: Parametric representations or vector forms of surfaces 
(see 3.6.3, p. 261). 


2.1.1.6 Functionals 
If a real number is assigned to every function x = x(t) of a given class of functions, then it is called a 
functional. 

b 
W A: Ifx(t)isa given function which is integrable on fa, b], then f(a) = / x(t) dt isa linear functional 
defined on the set of continuous functions x integrable on [a, b] (see 12.5, p. 677). 
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W D: Integral expressions in variational problems (see 10.1, p. 610). 
2.1.1.7 Functions and Mappings 
Suppose there are given two non-empty sets X and Y. A mapping, which is denoted by 

f: XY, (2.3) 
isa rule, by which a uniquely defined element y of Y is assigned to every element x of X. The element 
y is called the image of x, as formula y = f(x). The set Y is called the image space or range of f, the 
set X is called the original space or domain of f. 
W A: If both the original and the image spaces are subsets of real numbers, i.e., X = D C R and 
Y = W c R hold, then (2.3) defines a real function y = f(x) of the real variable 2. 
WB B: If f isa matrix A = (aj) (i = 1,2,...,m;j = 1,2,...,n) of type (m, n) and X = R” and 
Y = R”, then (2.3) defines a mapping from IR" into R". The rule (2.3) is given by the following 
system of m linear equations: 


Yı Gy + 2t + +++ + Ainin 
. Yo = αολιΏι + 22312 +++ + 24, 

y=Ax or 4 
Ym = QmiXi + Am2%2 + +++ + AmnXn 


ie. Ax means the product of the matrix A and the vector x. 
Remarks: 
1. The notion of mapping is a generalization of the notion of function. So, some mappings are some- 
times called functions. 


2. The important properties of mappings can be found in 5.2.3, 5. p. 333. 

3. A mapping, which assigns to every element from an abstract space X a unique element usually from 
a different abstract space Y, is called an operator. Here an abstract space usually means a function 
space, since the most important spaces in applications consist of functions. Abstract spaces are for in- 
stance linear spaces (see vector spaces 5.3.8, p. 365), metric spaces (see 12.2, p. 662) and normed spaces 
(see 12.3, p. 669). 


2.1.2 Methods for Defining a Real Function 
2.1.2.1 Defining a Function 


A function can be defined in several different ways, for instance by a table of values, by graphical rep- 
resentation, i.e., by a curve, by a formula, which is called an analytic expression, or piece by piece with 
different formulas. Only such values of the independent variable can belong to the domain of an an- 
alytic expression for which the function makes sense, i.e., it takes a unique, finite real value. If the 
domain is not otherwise defined, the domain is considered as the maximal set for which the definition 
makes sense. 

2.1.2.2 Analytic Representation of a Function 

Usually the following three forms are in use: 

1. Explicit Form: 

y= f(a). (24 
By=Vl-22, —1<x<1, yÈ 0. Here the graph is the upper half of the unit circle centered a 
the origin. 

2. Implicit Form: 

F (x,y) — 0, 6.5 
in the case when there is a unique y which satisfies this equation, or it can be told which solution is 
considered to be the value of the function. 

Bert+y-1=0, -l<«x< +l, y > 0. Here the graph is again the upper half of the uni 
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circle centered at the origin. It should be emphasized that «? + y? + 1 = 0 itself does not define a real 
function. 
3. Parametric Form: 

v—g(D, y=). (2.6) 
The corresponding values of x and y are given as functions of an auxiliary variable t, which is called a 
parameter. The functions y(t) and (8) must have the same domain. This representation defines a real 
function only if x = y(t) defines a one-to-one correspondence between x and t. 
E -= olt), y-—w(t) with y(t) = cost and y(t) = sint, 0 € t € m. Here the graph is again the 
upper half of the unit circle centered at the origin. 
Remark: Functions given in parametric form sometimes do not have any explicit or implicit parameter- 
free equation. 
Mc =t+42sint = y(t), y =t — cost = y(t). 


Examples for Functions Given Piece by Piece: 


y y y=x-E(x) ΒΛ: y = E(x) = int(z) = [x] = n for 
— 1 n<r<n+l1, ninteger. 

y=E(x) + The function E(x) or int(x) (read “integer 
-- part of x”) means the greatest integer less 

than or equal to x. 
E B: The function y = frac(x) = x — [v] 
(read “fractional part of 2” ) gives the differ- 
ΓΙ 23 4 5 6 A 0| 12345 S ence of x and [x] (Fig. 2.1b). Fig. 2.1a,b 
shows the corresponding graphical repre- 
a) b) sentations, where the arrow-heads mean 
that the endpoints do not belong to the 

Figure 2.1 curves. 


= 


— 


Cl -: Ὁ οὐ Η σι 


g forz <0, 


α.. ν-{2 for x > 0, 


(Fig. 2.2a). 
—1 for x <0, 
MD: y = sign(r) = 0 for x = 0, 
+1 for x > 0, 
(Fig. 2.22). By sign(r) (read "signum 
x”), the sign function is denoted. 


a) 


Figure 2.2 


2.1.3 Certain Types of Functions y y 

2.1.3.1 Monotone Functions m. 

If a function satisfies the relations T | ! 
| 


r 


> > 
f(t2) > f(x1) or f(x) < f(a), (2.7a) 0| x ox 0 XX X 
for arbitrary arguments xı and x» with x2 > 2 in its do- a) b) 
main, then it is called monotonically increasing or mo- 
notonically decreasing (Fig. 2.3a,b). Figure 2.3 


If one of the above relations (2.7a) does not hold for every x in the domain of the function, but it is valid, 
e.g., in an interval or on a half-axis, then the function is called monotonic in this domain. Functions 
satisfying the relations 


f (v2) > f(a) ον f (2) < f(r), (2.2) 
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i.e., when the equality never holds in (2.7a), are called strictly monotonically increasing or strictly mono- 
tonically decreasing. In Fig. 2.3a there is a representation of a strictly monotonically increasing func- 
tion; in Fig. 2.3b there is the graph of a monotonically decreasing function being constant between xı 
and zə. 


WB ; = e” is strictly monotonically decreasing, y = In x is strictly monotonically increasing. 


2.1.3.2 Bounded Functions 


A function is called bounded above if there is a number (called an upper bound) such that the values 
of the function never exceed it. A function is called bounded below if there is a number (called a lower 
bound) such that the values of the function are never less than this number. If a function is bounded 
above and below, it simply is called bounded. 


E A: y=1-—27 is bounded above (y < 1 WB B: y = e” is bounded below (y > 0). 


4 
WB C: y= sinz is bounded (-1 € y € +1). BD: y= i is bounded (0 < y < 4). 


+2? 


2.1.3.3 Extreme Values of Functions 
The function f(x) with domain D has an absolute or global maximum at the point a, if for all x € D 


f(a) 2 f(x (2.82) 
holds. The function f(x) has a relative or local maximum at the point a, if the inequality (2.8a) holds 
only in an environment of the point a, i.e. for all x witha —e < x «a--e,6»0,r€ D. 


In analogy the definition of an absolute or global minimum as well as for a relative or local minimum can 
be given, but the inequality (2.8a) is to be replaced by 


f(a) € f(x). (2.8b) 
Remarks: 
a) The notions maximum and minimum, are called the extreme values, they are not coupled to the 
differentiability of functions, i.e., they hold also for functions which are not differentiable in some points 
of the domain. Examples are discontinuities of curves (see Figs.2.9, p. 58 and 6.10b,c, p. 443). 


b) Criterions for the determination of extreme values of differentiable functions see in 6.1.5.2, p. 443. 


2.1.3.4. Even Functions 
Even functions (Fig. 2.4a) satisfy the relation y y 
f2) = f(a). (2.9a) 


If D is the domain of f, then 


(x € D) > (—x€ D) (2.9b) " = 
should hold. 0 x 0 pe x 
MA: y=cosxz, MB: y= zr- 3r? +1. a) b) 
2.1.3.5 Odd Functions Figure 2.4 
Odd functions (Fig. 2.4b) satisfy the relation 

f(-x) = -f (x). (2.10a) 
If D is the domain of f, then 

(z € D) > (—z € D) (2.10b) 


should hold. 
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MA: y=sinz, MB: / - τὸ - τα. 
2.1.3.6 Representation with Even and Odd Functions 


If for the domain D of a function f the condition “from x € D it follows that —x € D" holds, then f 
can be written as a sum of an even function g and an odd function u: 


: 1 1 
fle) = g(x) tula) with g(x) UG) ο ο” a) 
qd 1 cy "E L 1 oT o t Y qa 11 v. πω, 

Β f(x) -σἵ - 5 (e He ) t3 (c 6 ) cosh x + sinh x (see 2.9.1, p. 89). 
2.1.3.7 Periodic Functions y T 
Periodic functions satisfy the relation 

f(x--T)-—f(x) T const, T Z 0. (2.12) 

> 

Obviously, if the above equality holds for some T, it holds for 0l κ--Τ--» x 
any integer multiple of T. The smallest positive number T sat- 
isfying the relation is called the period (Fig. 2.5). Figure 2.5 


2.1.3.8 Inverse Functions 
A function y = f(x) with domain D and range W assigns a unique y € W to every x € D. If reversed, 
to every y € W there belongs only one x € D, then the inverse function of f can be defined. It is 
denoted by φον by f~t. Here [1 is a symbol for a function, not a power of f. 
To find the inverse function of f, the variables x and y are interchanged in the formula of f, then y is 
expressed from x = f(a) in order to get y = y(x). The representations y = f(x) and x = φ(υ) are 
equivalent. The following important formulas come from this relation 

f(e(y)) 2 y and e(f(z)) ==. (2.13) 
The graph of an inverse function y = φ(α) is obtained by reflection of the graph of y = f(x) with 
respect to the line y = x (Fig. 2.6). 
W The function y = f(x) =e" (D: —oo < x < oo,W : y > 0) is equivalent Obviously, every strictly 
monotonic function has an inverse function. 


Ay 


y=arésin x 


-© Ti LZ—v-s 
> 
ria. = 


BA: y=f(x)=2 with D: «> 0, W: y>0; 

y= p(x) = Vx with D: «> 0, W: y>0. 
ΒΒ: y=f(x) =e with D: —oo «zr < oo, W: y>0; 

y --ϕφ(α) --η« with D: r0, W: =œ < y< œ. 
BC: y=f(x)=sinz with D: -n/2<a<7n/2, W: —1<y<l; 

y= (x) =aresing  wihD: -l<a<1, W: -n/2<y<7/2. 
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Remarks: 

1. Ifa function f is strictly monotone in an interval J C D, then there is an inverse f^! for this interval. 
2. If a non-monotone function can be partitioned in strictly monotone parts, then the corresponding 
inverse exists for each part. 


2.1.4 Limits of Functions 


2.1.4.1 Definition of the Limit of a Function 
The function y = f(a) has the limit A at x =a 
lim f(x) 2A or f(x) — A fo za, (2.14) 


if as x approaches the value a infinitely closely, the value of f(x) approaches the value A infinitely 
closely. The function f(x) does not have to be defined at a, and even if defined, it does not matter 
whether f(a) is equal to A. 
Precise Definition: The limit (2.14) exists, if for any given positive number e€ there is a positive 
number η such that for every x Z a belonging to the domain and satisfying the inequality 

[x — a| «η, (2.15a) 


the inequality 

|f(z) - Al <€ (2.15b) 
holds eventually with the exception of the point a (Fig. 2.7). 
If a is an endpoint of a connected region, then the inequality 
|x — αἱ «η is reduced either to a — η < x or to x < a +n (see 
also 2.1.4.5). 


2.1.4.2 Definition by Limit of Sequences (see dion LEE: 


7.1.2, p. 458) Figure 2.7 

A function f(x) has the limit A at x = a if for every sequence 

U1,€9,...,%p,... of the values of x from the domain and converging to a (but being not equal to a), the 
sequence of the corresponding values of the function f(x), f(x2),..., f(@n),... converges to A. 


2.1.4.3 Cauchy Condition for Convergence 

A necessary and sufficient condition for a function f(x) to have a limit at x = a is that for any two 
values xı # a and πο ¥ a belonging to the domain and being close enough to a, the values f(x) and 
f (x2) are also close enough to each other. 

Precise Definition: A necessary and sufficient condition for a function f(x) to have a limit at x = a 
is that for any given positive number e there is a positive number η such that for arbitrary values xı 
and 2» belonging to the domain and satisfying the inequalities 


0«|zx;—a| «9 and O0«]|rs—a]| «η, (2.164) 
the inequality 
(αι) — f(z2)| < € (2.16b) 


holds. 


2.1.4.4 Infinity as a Limit of a Function 
The symbol 
lim |f) = 00 (2.17) 


means that as x approaches a, the absolute value |f (z)| does not have an upper bound. 
Precise Definition: The equality (2.17) holds if for any given positive number K there is a positive 
number η such that for any x 4 a from the interval 


a—n«z«actn (2.18a) 
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the corresponding value of | f (x)| is larger than K: 
|f(z)) » K. (2.18b 
If all the values of f(x) in the interval 


a-n<x<at+yn (2.18c 
are positive, one writes 
lim f(x) = +00; (2.18d 


if they are negative, one writes 


lim f(x) = --οο. (2.18e 


2.1.4.5 Left-Hand and Right-Hand Limit of a Function 


A function f(x) has a left-hand limit A^ at x = a, if as x tends to a from the left, the value f(x) tends 
to A`: 


A = lim f(x) = f(a — 0). (2.194) 


ᾱ-λα-Ώ 


Similarly, a function has a right-hand limit Α if as x tends to a from the right, the value f(a) tends to 
At: 


At = lim f(r) = f(a+0). (2.19b) 


a—at+0 


The equality lim f(x) = A is valid only if the left-hand and 


right-hand limits exist, and they are equal: 


At =A =A. (2.19c) 
1 
Β The function f(x) = —— —,- tends to different values from 
l4 ezs-t 
the left and from the right for x > 1: f(1-0) = 1, f(1+0) = 0 
(Fig. 2.8). Figure 2.8 


2.1.4.6 Limit ofa Function as x Tends to Infinity 


Case a) A number A is called the limit of a function f(x) as x — +00, and one writes 
A= lim f(x) (2.20a) 


z—-oo 
if for any given positive number £ there is a number N > 0 such that for every x > N, the corresponding 
value f(x) is in the interval A — € < f(x) < A+e. Analogously 


A= lim f(x) (2.20b) 


@—+—00 


is the limit of a function f(x) as x — —oo if for any given positive number € there is a positive number 
N > Osuch that for any x < —N the corresponding value of f(a) isin the interval A—e < f(x) < Ate. 


BA: lim =1, ΝΠ: lim ale =1, MC: lim ec*=0. 


x=>+%0 mp r-oo 2 =Z—+—00 


Case b) Assume that for any positive number K, there is a positive number N such that if x > N or 
x < —N then the absolute value of the function is larger then K. In this case one writes 


Jim |f(@| = oo or lim |f(z)| = oo. (2.200) 
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3 3 
. T -—1 . a’ —1 
BA: lim = +œ, M B: lm — = —00, 
rz—-Foo f z—-—00 x 
: 1 i D 
BC: lim — ——oo, M D: lim 5 +oo. 
x>too 2 α-}-οο og? 


2.1.4.7 Theorems About Limits of Functions 

1. Limit of a Constant Function The limit of a constant function is the constant itself: 
lim A = A. (2.21) zm 
ra 


2. Limit ofa Sum or a Difference If among a finite number of functions each has a limit, then the 
limit of their sum or difference is equal to the sum or difference of their limits (if this last expression 
does not contain oo — oo): 


lim [/(2) + φ(α) -- (2)] = lim f(z) + lim ία) -- lim v(z). (2.22) 


3. Limit of Products If among a finite number of functions each has a limit, then the limit of their 
product is equal to the product of their limits (if this last expression does not contain a 0 - oo type): 


tim [/(α) e) 9(2)] = [tim f] [lim ρα) [im v] - (2.23) 


r—a 


4. Limit of a Quotient The limit of the quotient of two functions is equal to the quotient of their 
limits, in the case when both limits exist and the limit of the denominator is not equal to zero (and this 
last expression is not an oo/oo type): 
ο TV 
= vx) ~ lim gl£) ee) 
ra 


Also if the denominator is equal to zero, usually one can tell if the limit exists or not, checking the sign 
of the denominator (the indeterminate form is 0/0). Similarly, one can calculate the limit of a power 
by taking a suitable power of the limit (if it is not a 0°, 1%, or oc? type). 


5. Pinching If the values of a function f(x) lie between the values of the functions y(x) and w(x), 
Le. p(x) < f(x) < y(x), and if lim p(x) = A and lim y(x) = A hold, then f(x) has a limit, too, and 
lim f(x) = A. (2.25) 


2.1.4.8 Calculation of Limits 


The calculation of the value of a limit can be made by using the 5 described theorems as well as some 
transformations (sce 2.1.4.7). 
1. Suitable Transformations 


For the calculation of limits the expression is to be transformed into a suitable form. There are several 
types of recommended transformations in different cases; here are three of them as examples. 


23-1 ; 
ΒΑ: lim = lim (a? +z +1) = 3. 
asl z — 1 al 
.ovVic-z-l , (vi+r-1)(vi+r+1) . 1 1 
E B: lim — lim lim =. 
x0 r z—0 a(/l+a2+4+1) z0/1+a+1 2 
TTAR TR (sin 2x iin Dap 
"TEE 2r — lim (sin 2x) sin 2x 


— 2 lim — 2. Here one can refer to the well-known theorem 
r0 τα r—0 x 2230 22; 
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sina 
im 
a30 (y 


2. Bernoulli-UHospital Rule 


=] (see (MA, 2.1.4.9, p. 57)). 


0 
In the case of indeterminate forms like j^ 2 , 0-06, oo — œ, 0°, œ, 1%, one often applies the 
oo 
Bernoulli-U Hospital rule (usually called l'Hospital rule for short): 
ple) 
vu) 


0 oo 
Case a) Indeterminate Forms ποτ. First, use the theorem only after checking if for f(x) = 
oo 
the following conditions are fulfilled. 
Suppose lim p(x) = 0 and lim w(x) = 0 or lim p(x) = oo and lim w(x) = oo, and suppose that there 
is an interval containing a such that the functions y(x) and y(x) are defined and differentiable in this 


interval except perhaps at a, and w(x) # 0 in this interval, and lim ) exists. Then 
aa p(x) 


£ / p 
P na 2O, (2.26) 
Ha) ero) 
Remark: If the limit of the ratio of the derivatives does not exist, it does not mean that the original 
limit does not exist. Maybe it does, but one cannot tell this using l'Hospital's rule. 


mw 


Ifa 
. g(r)... : : . | . 
If lim D is still an indeterminate form, and the numerator and denominator satisfy the assumptions 
asa afl (a 
of the above theorem, l'Hospital's rule can be used again. 
2cos2x 2 
. lnsin2r . ins . 2tanz . m .. cos?2a 
Β lim - im sin 2g im im C98. — jim - 
r0 lnsinzrz r0 ^ 230 tan 2x r0 2 r0 cos?r 
sin x cos?2: 


Case b) Indeterminate Form 0 - oo: Having f(x) = φ(α) w(x) and lim p(x) = 0 and 


lim w(x) = oo, then in order to use l'Hospital's rule for lim f (x) it is to be transformed into one of the 


. x£ . Wa T . : 0 foo ο. 
forms lim olz) or lim p(z) , 80 it is reduced to an indeterminate form — or — like in case a). 
roa 1 aoa 0 oo 
ylz) p(x) 
T= 2x -2 
WB lim (7 —2x)tanz = lim = lim =2. 
an /2 z—7*/2 cota zr—m-/2 — 1 


Case c) Indeterminate Form oo — oo: If f(x) = p(x) — (v) and lim p(x) = oo and lim y(x) = oo 


; : F 0 œ : : 
hold, then this expression can be transformed into the form Ü or — usually in several different ways; 
oo 


1 1 il 
for instance as y — v = ( ) / . Then it is to proceed as in case a). 

Y Φ)) ow 

gi 1 vinx-—a“+1 0 A . ο 
[ lim ( g ) üm| —* s = —. Applying l'Hospital rule twice yields 
z>1\x—1 lng 451 rlnir—lnz 0 2 ἡ 
1 
i zlniz—z-l . lng , m 1 
ΝΤΕ ΤΝ. ε.α. 
x rlnz-—lnz E Inzil--— ER -- 
T T Ὁ 


Case d) Indeterminate Forms 0° , oo? , 1°: If f(x) = v(x)” and lim φ(α) = +0 and lim w(x) = 
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0 holds, then first the limit A of In f(x) = y(x) In y(x), is to be calculated, which has the form 0 - oo 
(case b)), then lim f(x) = οὐ holds. 


The procedures in the cases oo? and 1° are similar. 


. Ina 
WB lim 2” =X, lng” = rlz, Jim zing = lim — = lim (-ᾱ) =0,ie., ΑΞ ΙΧ =0, 


14-0 B +0 αλβ 2-1 2>+0 
so X = 1, and finally lim 2” = 1. 
x—++0 


3. Taylor Expansion = 


Besides l'Hospital's rule the expansion of functions of indeterminate form into Taylor series can be 
applied (see 6.1.4.5, p. 442). 


( x? ob ) 
g= σπα 
Im ᾗ ! 

W lir € : lim » 5 


. 2—sin: i i 1 z j 
n — lir - = lim ee =. 
10 g 120 "E z0\ 3! 5! 6 


2.1.4.9 Order of Magnitude of Functions and Landau Order Symbols 


Comparing two functions, often their mutual behavior with respect to a certain argument x = a is to 
be considered. It is also convenient to compare the order of magnitude of the functions. 
1. A function f(x) tends to infinity at a at a higher order (or faster rate) than a function g(x) at a, if 


f(x) 


the quotient 


and the absolute values of f(x) exceed any limit as x tends to a. 


2. A function f(x) tends to zero at a at a higher order than a function g(x) at a, if the absolute values 


tends to zero as x tends to a. 


of f(x), g(x) and the quotient f(x) 


g(x) 
3. Two functions f(x) and g(x) tend to zero or to infinity at s at the same order of magnitude, if 
f(z) 
g(x) 


are constants. 


O0<m< < M holds for the absolute value of their quotient as 2; tends to a, where M and m 


4. Landau Order Symbols The mutual behavior of two functions at a point x = a can be described 
by the Landau order symbols O (“big O”), or o (“small o") as follows: If x > a then 


f(x) = O(g(r)  meansthat lim πω =A#0, Ac- const, (2.27a) 
ra οί x) 
and 
f(x) = 0(g(@)) means that lim oe =0, (2.27b) 
where a = +00 is also possible. The Landau order symbols have meaning only by assuming x tends to 
a given a. 


ΒΑ: sina = O(x) for x — 0, because with f(x) = sin x and g(x) = x holds: lim 


=10,ie., 
sin x behaves like x in the neighborhood of x = 0. 
Β B: For f(x) 2 1— cosx and g(x) = sin x the function f(x) vanishes with a higher order than g(x): 


T l—cosr 
lim Λα) lim - 0, i.e., 1 — cosa = o(sin x) for x > 0. 
250 |g(x) x0] sing 


Bl C: f(x) and g(x) vanish by the same order for f(x) = 1 — cosa, g(a) = x°: 
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x ie. 1 — cosa = O(x?) for x — 0. 


5. Polynomial The order of magnitude of polynomials at +00 can be expressed by their degree. So 
the function f(x) = x has order 1, a polynomial of degree n + 1 has an order higher by one than a 
polynomial of degree n. 


6. Exponential Function The exponential function e” tends faster to infinity for 2 — oo more 
quickly to infinity than any high power z" (n is a fixed positive number): 
-- 


lim 
Ώ-λοο 


— oo. (2.282) 


g^ 
The proof follows by applying I'Hospital's rule for a natural number n: 
E εὖ e 

lim — = lim =.= lim Č S90: (2.28b) 


α-λοο pn roo η roo n! 


7. Logarithmic Function The logarithm tends to infinity more slowly than any small positive power 
x® (a is a fixed positive number): 


log x 


=0. (2.29) 


γα 


The proof is with the help of l'Hospital's rule. 
2.1.5 Continuity of a Function 
2.1.5.1 Notion of Continuity and Discontinuity 


Most functions occurring in practice are continuous, i.e., for small 
changes of the argument x a continuous function y(x) changes 
also only a little. The graphical representation of such a func- 
tion results in a continuous curve. If the curve is broken at some D 
points, the corresponding function is discontinuous, and the val- 9 
ues of the arguments where the breaks are, are the points of dis- | 
continuity. Fig. 2.9 shows the curve of a function, which is piece- h 
wise continuous. The points of discontinuity are A, B, C, D, E, F 0| A 
and G. The arrow-heads show that the endpoints do not belong 
to the curve. 


2.1.5.2 Definition of Continuity 
A function y = f(x) is called continuous at the point x = a if Figure 2.9 


1. f(x) is defined at a; 
2. the limit lim f(a) exists and is equal to f(a). 


B 


2 


This is exactly the case if for an arbitrary € > 0 there is a δ(ε) > 0 such that 
f(x)— f(a) <£ forevery x with |r —a| «ὁ (2.30) 


holds. 

Here also it is to talk about one-sided (left- or right-hand sided) continuity, if instead of lim f(x) = f(a) 
only the one-sided limit lim f(x) (or lim 1 f (x)) is to be considered and this is equal to the value f(a). 

z—a— za 

Ifa function is continuous for every x in a given interval from a to b, then the function is called continuous 
in this interval, which can be open, half-open, or closed (see 1.1.1.3, 3., p. 2). If a function is defined 
and continuous at every point of the numerical axis, it is called continuous everywhere. 

A function has a point of discontinuity at x — a, which is an interior point or an endpoint of its domain, 
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if the function is not defined here, or f(a) is not equal to the limit lim f(x), or the limit does not exist. 


If the function is defined only on one side of x = a, e.g., Ένα for x = 0 and arccos x for x = 1, then it 
is not a point of discontinuity but it is a termination. 

A function f(x) is called piecewise continuous, if it is continuous at every point of an interval except at 
a finite number of points, and at these points it has finite jumps. 


2.1.5.3 Most Frequent Types of Discontinuities 


1. Values of the Function Tend to Infinity 


The most frequent discontinuity occurs if the function tends to +00 (points B, C, and E in Fig. 2.9). 
WB A: f(x) — tanz, f E 0) too, f G + 0) = —oo. The type of discontinuity (see Fig. 2.34, 
p. 78) is the same as at E in Fig. 2.9. For the meaning of the symbols f(a — 0), f(a + 0) see 2.1.4.5, 
p. 54. 


1 
W B: f(z) GIP’ f(1— 0) = +00, f(1 +0) = +00. The type of discontinuity is the same as at 


the point B in Fig. 2.9. 


1 
WB C: f(r) =e, [1 -- 0) — 0, f(1-- 0) — oo. The type of discontinuity is the same as at C in 
Fig. 2.9, with the difference that this function f(x) is not defined at x = 1. 


2. Finite Jump 


Passing through x = a the function f(a) jumps from a finite value to another finite value (like at the 
points A, F, G in Fig. 2.9, p. 58): The value of the function f(x) for x = a may not be defined here, 
as at point G; or it can coincide with f(a — 0) or with f(a + 0) (point F); or it can be different from 
f(a — 0) and f (a + 0) (point A). 
1 

BA: f(x) - po f(1—0) 2 1, f(1 +0) =0 (Fig. 2.8, p. 54). 

l4 es-t 
W B: f(x) = E(x) (Fig. 2.16, p. 50) f(n —0) 2n — 1, f(n-- 0) =n (n integer). 


B C: f(x) = lim : f(1-0)21, f(140)20, f(1) = a 


π-λοο ] + 2η 3 


3. Removable Discontinuity 

Assuming that lim f (x) exists, i.e., f(a — 0) = f(a + 0) holds, but either the function is not defined 
for x = a or there is f(a) Æ lim f(x) (point D in Fig. 2.9, p. 58). This type of discontinuity is called 
removable, because defining f(a) — lim f(x) the function becomes continuous here. The procedure 


consists of adding only one point to the curve, or changing the place only of one point at D. The 
different indeterminate expressions for x = a, which have a finite limit examined by l'Hospital's rule 
or with other methods, are examples of removable discontinuities. 


Vlrz-1 


0 1 
Β f(z) = is an undetermined 0 expression for r — 0, but lim f(x) = z the function 
Ha xr 
1l+a-1 
ABE for x #0 
πως 5 
1 
5 for « = 0 


is continuous. 
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2.1.5.4 Continuity and Discontinuity of Elementary Functions 


The elementary functions are continuous on their domains; the points of discontinuity do not belong 
to their domain. The following theorems hold: 


1. Polynomials are continuous everywhere. 


2. Rational Functions with polynomials P(x) and Q(x) are continuous everywhere except 


x 

the points x, where Q(x) = 0. If at x = a, Q(a) = 0 and P(a) ¥ 0, the function tends to +00 on 
both sides of a; this point is called a pole. The function also has a pole if P(a) = 0, but a is a root of 
the denominator with higher multiplicity than for the numerator (see 1.6.3.1, 2., p. 43). Otherwise the 
discontinuity is removable. 


3. Irrational Functions Roots of polynomials are continuous for every x in their domain. At the end 
of the domain they can terminate by a finite value if the radicand changes its sign. Roots of rational 
functions are discontinuous for such values of x where the radicand is discontinuous. 


4. Trigonometric Functions The functions sin x and cos x are continuous everywhere; tan x and 
— y (2n + 1}π : ΗΝ 
sec x have infinite jumps at the points 2 = -—————; the functions cot x and cosec x have infinite 


2 


jumps at the points x = nr (n integer). 


5. Inverse Trigonometric Functions The functions arctan x and arccot x are continuous every- 
where, arcsin v and arccos x terminate at the end of their domain because of —1 € x < +1, and they 
are continuous here from one side. 


6. Exponential Functions e” or a” with a > 0 They are continuous everywhere. 
7. Logarithmic Function log x with Arbitrary Positive Base The function is continuous for all 


positive x and terminates at x = 0 because of lim logz = —oo by a right-sided limit. 
x40 


8. Composite Elementary Functions The continuity is to be checked for every point x of every 
elementary function containing in the composition (see also continuity of composite functions in 2.1.5.5, 


2., p. 61). 


1 
The exponent ——~ has an 


-3 
xsin Y1 -— T G2 


1 1 1 
infinite jump at x = 2; for x = 2 also ez-? has an infinite jump: (=) =, Ga Ξ oo. 
α--2-0 2=2+40 


W Find the points of discontinuity of the function y = 


The function y has a finite denominator at x = 2. Consequently, at x = 2 there is an infinite jump of 
the same type as at point C' in Fig. 2.9, p. 58. 

For x = 0 the denominator is also zero, just like for the values of x, for which sin Ψ'1 — x is equal to 
zero. These last ones correspond to the roots of the equation V1 — x = nz or x = 1 — n?z?, where n 
is an arbitrary integer. The numerator is not equal to zero for these numbers, so at the points z — 0, 
y—1lxz—lcsr-lcsSmr-1-c27,...the function has the same type of discontinuity as 
the point E in Fig. 2.9, p. 58. 


2.1.5.5 Properties of Continuous Functions 


1. Continuity of Sum, Difference, Product and Quotient of Continuous Functions 


If f(x) and g(x) are continuous on the interval [a,b], then f(x) + g(x), f(a) g(x) are also continuous, 
f(x) 


g(x) 


and if g(x) # 0 on this interval, then is also continuous. 
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2. Continuity of Composite Functions y = f(u(x)) 
If u(x) is continuous at x = a and f(u) is continuous at u = u(a) then the composite function y = 
f (u(x)) is continuous at x = a, and 


lim fu) = f (Vim ο = ftua) (2.31) 


is valid. This means that a continuous function of a continuous function is also continuous. 
Remark: The converse sentence is not valid. It is possible that the composite function of discontinuous 
functions is continuous. 
3. Bolzano Theorem 
If a function f(x) is continuous on a finite closed interval [a,b], and f (a) and f(b) have different signs, 
then f(x) has at least one root in this interval, i.e., there exists at least one interior point of this interval 
c such that: 

f(c) 20 with a<c<b. (2.32) 
The geometric interpretation of this statement is that the graph of a continuous function can go from 
one side of the x-axis to the other side only if the curve has an intersection point with the x-axis. 
4. Intermediate Value Theorem 


If a function f(x) is continuous on an interval, and it has different values A and B, at the points a and 
b of this interval, where a « b, i.e., 


f(a)—A, f(b)=B, AFB, (2.33a) 
then for any value C between A and B there is at least one point c between a and b such that 
fl) 2C, (a«c«b, A«C«Bor A» C » D). (2.33b) 


In other words: The function f(x) takes every value between A and B on the interval (a, b) at least 
once. Or: The continuous image of an interval is an interval. 


5. Existence of an Inverse Function 

If a one-to-one function is continuous on an inter- 
val, it is strictly monotone on this interval. 

If a function f (x) is continuous on a connected do- | 
main I, and it is strictly monotone increasing or de- P4 ! 
creasing, then for this f(x) there also exists a con- fi I | 


tinuous, strictly monotone increasing or decreas- 


ing inverse function y(x) (see also 2.1.3.8, p. 52), b) 
which is defined on domain II given by the values 
of f(x) (Fig. 2.10). Figure 2.10 


Remark: In order to make sure that the inverse function of f (x) is continuous, f (x) must be continuous 
on an interval. Supposing only that the function is strictly monotonic on an interval, and continuous 
at an interior point c, and f(c) — C, then the inverse function exists, but may be not continuous at C. 
6. Theorem About the Boundedness of a Function 
If a function f(x) is continuous on a finite, closed interval [a, b] then it is bounded on this interval, i.e., 
there exist two numbers m and M such that 

m< flx) <M fo a<a<b. (2.34) 


7. Weierstrass Theorem 

If the function f(x) is continuous on the finite, closed interval fa, b] then f(x) has an absolute maximum 
M and an absolute minimum m, i.e., there exists in this interval at least one point ὁ and at least one 
point d such that for all x with a < x < b: 


m = f(d) € f(x) € f(c) = M. (2.35) 
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The difference between the greatest and smallest value of a continuous function is called its variation 
in the given interval. The notion of variation can be extended to the case when the function does not 
have any greatest or smallest value. 


2.2 Elementary Functions 

Elementary functions are defined by formulas containing a finite number of operations on the indepen- 
dent variable and constants. The operations are the four basic arithmetical operations, taking powers 
and roots, the use of an exponential or a logarithm function, or the use of trigonometric functions or 
inverse trigonometric functions. To distinguish are algebraic and transcendental elementary functions. 
As another type of function, can be defined the non-elementary functions (see for instance 8.2.5, p. 513). 


2.2.1 Algebraic Functions 


In an algebraic function the argument x and the function y are connected by an algebraic equation. It 
has the form 


po(z) + pila)y + po(x)y? +... + Pa(a)y" = 0 (2.36) 
where po, pi... , Pn are polynomials in 2. 
B 3r? — 4ry + x? — 1 = 0, i.e., po(£) = αὖ-- 1, Ρι(α) 4x , po(x) = 0, ρα(α) = δα. 
Tf it is possible to solve an algebraic equation (2.36) for y, then there is one of the following types of the 
simplest algebraic functions. 


2.2.1.1 Polynomials 
Performing only addition, subtraction and multiplication on the argument x then: 

y = Ana” Και ια 1+... + αρ. (2.37) 
holds. In particular one can distinguish y = a as a constant, y = ax + b as a linear function, and 
y = ax? + bx + c as a quadratic function. 
2.2.1.2 Rational Functions 
A rational function can always be written in the form of the ratio of two polynomials: 


à nal 
αμα" Fay 4x3" 7 +... + ao 


y= ; 2.384. 
d bar + bs ιαπ ἳ +... + bo (2-38a) 
The special case 
απ +b 
y= 2.38} 
? cr--d (2:38b) 


is called a homographic or linear fractional function. 


2.2.1.3 Irrational Functions 
Besides the operations enumerated for rational functions, the argument x also occurs under the radical 
sign. 


WA:y—/2r-3, MB: y= /(72— Ίνα. 
2.2.2 Transcendental Functions 


Transcendental functions cannot be given by an algebraic equation like (2.36). In the following para- 
graphs the simplest elementary transcendental functions are introduced. 


2.2.2.1 Exponential Functions 


The variable x or an algebraic function of x is in the exponent of a constant base (see 2.6.1, p. 72). 
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MA: y=e", BB: y=’, mC: y= 30? 5x 
2.2.2.2 Logarithmic Functions 


The function is the logarithm with a constant base of the variable x or an algebraic function of x (see 
2.6.2, p. 73). 
MA: y=Inz, HB: y=lez, B C: y —log,(52? — 3x). 


2.2.2.3 Trigonometric Functions 

The variable x or an algebraic function of x occurs under the symbols sin, cos, tan, cot, sec, cosec (see 

2.7, p. 76). 

MA: y=sinz, WB B: y= cos(2x +3), B C: y= tan yz. 

In general, the argument of a trigonometric function is not only an angle or a circular arc as in the 

geometric definition, but an arbitrary quantity. The trigonometric functions can be defined in a purely 

analytic way without any geometry. For instance one can represent them by an expansion in a series, 
d? 


Ί 
j + y = 0 with the initial values 


or, e.g., the sin function as the solution of the differential equation 


d . | : oe 
y = 0 and Τ = lat x = 0. The numerical value of the argument of the trigonometric function is equal 
dx 
to the arc in units of radians. When dealing with trigonometric functions, the argument is considered 
to be given in radian measure (see 3.1.1.5, p. 131). 
2.2.2.4 Inverse Trigonometric Functions 


The variable x or an algebraic function of x is in the argument of the inverse trigonometric functions 
(see 2.8, p. 85) arcsin, arccos, etc. 


MA: y= arcsinx, W B: y= arccos /1— x. 
2.2.2.5 Hyperbolic Functions 
(see 2.9, p. 89). 


2.2.2.6 Inverse Hyperbolic Functions 
(see 2.10, p. 93). 


2.2.3 Composite Functions 


Composite functions are all possible compositions of the above algebraic and transcendental functions, 
i.e., if a function has another function as an argument. 


lng rona 
BA: y=Insinz, HB: y= lng + Varesin 
x? + 5e” 


Such composition of a finite number of elementary functions again yields an elementary function. The 
examples C in the previous types of functions are also composite functions. 


2.3 Polynomials 
2.3.1 Linear Function 


The graph of the linear function 

y=ar+b (2.39) 
(polynomial of degree 1) is a line (Fig. 2.11a). The proportional factor is denoted by a, the crossing 
point of the line and the axis of the ordinate by b. 
For a > 0 the function is monotone increasing, for a < 0 it is monotone decreasing; for a = 0 it is a 
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: κι. η i b 
polynomial of degree zero, i.e., it is a constant function. The intercepts are at A (- — o) and B(0, b) 
a 


(for details see 3.5.2.6, 1., p. 195). With b = 0 direct proportionality 
y = az; (2.40) 
holds, graphically it is a line running through the origin (Fig. 2.11b). 


E jc 
a) 0! A 


x 
y^ 
0 
b) 0 x a) iC 
Figure 2.11 Figure 2.12 


2.3.2 Quadratic Polynomial 
The polynomial of second degree 
y =ar +br +c (2.41) 


b 
(quadratic polynomial) defines a parabola with a vertical axis of symmetry at £ = E (Fig. 2.12). 
a 
For a > 0 the function is first monotone decreasing, it has a minimum, then it is monotone increasing. 
For a < 0 first it is monotone increasing, it has a maximum, then it is monotone decreasing. In the case 
E yb? — dac 


‘ . J . : . —b4 
b? — 4ac > 0: The intersection points Αι, A» with the x-axis are ( 2 
a 


point B with the y-axis, is at (0,c). In the case b? — 4ac = 0 there is one intersection point (contact 
point) with the z-axis. In the case b? — 4ac < 0 there is no intersection point. The extremum point of 
b 4ac— b? 


2a' 4a 


: o) , the intersection 


the curve is at C (- ) (for more details about the parabola see 3.5.2.10, p. 204). 


Figure 2.13 


2.3.3 Cubic Polynomials 
The polynomial of third degree 
y — ax? + bx? ἠ- cx d (2.42) 
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defines a cubic parabola (Fig. 2.13a,b,c). Both the shape of the curve and the behavior of the function 
depend on a and the discriminant A = 3ac — b?. If A > 0 holds (Fig. 2.13a,b), then for a > 0 the 
function is monotonically increasing, and for a < 0 it is decreasing. If A < 0 the function has exactly 
one local minimum and one local maximum (Fig. 2.13c). For a > 0 the value of the function rises from 
—oo until the maximum, then falls until the minimum, then it rises again to +09; for a < 0 the value 
of the function falls from +oo until the minimum, then rises until the maximum, then it falls again to 
—oo. The intersection points with the x-axis are at the values of the real roots of (2.42) for y = 0. The 
function can have one, two (then there is a point where the x-axis is the tangent line of the curve) or 
three real roots: A;, Αρ and A3. The intersection point with the y-axis is at B(0, d), the extreme points 


b+ V-A_ d+ 205 — 9abe + (6ac — 2b) /—A 
3a? 27a? : 


of the curve C and D, if any, are at ( 


b οὐ) — 9abc , 
3a’ 27a? 


The inflection point which is also the center of symmetry of the curve is at E ( -- 


d. A 
At this point the tangent line has the slope tan y = ( w) 
T) 


=a," 


Y^ — nodd 
- neven 


Figure 2.14 Figure 2.15 


2.3.4 Polynomials of n-th Degree 
The integral rational function of n-th degree 

y = Ont” + a, 42" 3 +... + at + a (2.43) 
defines a curve of n-th degree or n-th order (see 3.5.2.5, p. 195) of parabolic type (Fig. 2.14). 
Case 1, n odd: Fora, > 0 the value of y changes continuously from —oo to +00, and for a, < 0 
from +00 to —oo. The curve can intersect or contact the x-axis up to n times, and there is at least one 
intersection point (for the solution of an equation of n-th degree see 1.6.3.1, p. 43 and 19.1.2, p. 952). 
The function (2.43) has none or an even number up to n — 1 of extreme values, where minima and 
maxima occur alternately; the number of inflection points is odd and is between 1 and n — 2. There 
are no asymptotes or singularities. 
Case 2, n even: Fora, > 0 the value of y changes continuously from +oo through its minimum 
until +00 and for a, < 0 from —oo through its maximum until —oo. The curve can intersect or contact 
the z-axis up to n times, but it is also possible that it never does that. The number of extrema is odd, 
and maxima and minima alternate; the number of inflection points is even, and it can also be zero. 
There are no asymptotes or singularities. 
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Before sketching the graph of a function, it is recommended first to determine the extreme points, the 
inflection points, the values of the first derivative at these points, then to sketch the tangent lines at 
these points, and finally to connect these points continuously. 


2.3.5 Parabola of n-th Degree 
The graph of the function 

y =ar” (2.44) 
where n > 0, integer, is a parabola of n-th degree, or of n-th order (Fig. 2.15). 
1. Special Case a= 1: The curve y = x” goes through the point (0,0) and (1,1) and contacts or 
intersects the x-axis at the origin. For even n the curve is symmetric with respect to the y-axis, and 
with a minimum at the origin. For odd n the curve is symmetric with respect to the origin, and it has 
an inflection point there. There is no asymptote. 


2. General Case a Z0: The curve of y = az" can be got from the curve of y = x" by stretching the 
ordinates by the factor |a|. For a < 0 the curve y = |a|x” is to be reflected with respect to the x-axis. 


2.4 Rational Functions 


2.4.1 Special Fractional Linear Function (Inverse Proportionality) 


The graph of the function 

y=— (2.45) 

τ 

is an equilateral hyperbola, whose asymptotes are the coordinate axes (Fig. 2.16). The point of dis- 
continuity is at x = 0 with y = +oo. If a > 0 holds, then the function is strictly monotone decreasing 
in the interval (--οο, 0) with values from 0 to —oo and also strictly monotone decreasing in the interval 
(0, +20) with values from +oo to 0 (curve in the first and third quadrants). If a < 0, then the func- 
tion is increasing in the interval (—oo , 0) with values from 0 to +00 and also increasing in the interval 
(0. +00) with values from —oo to 0 (dotted curve in the second and fourth quadrants). The vertices A 


+y la|, +y/la|) and (+y lal, —4/ lal) with the same sign for a > 0 and with different sign 


for a < 0. There are no extreme values (for more details about hyperbolas see 3.5.2.9, p. 201). 


ΥΑ 


and B are at 


— 


y 


Figure 2.16 Figure 2.17 


2.4.2 Linear Fractional Function 
The graph of the function 
az + by 


y= a£ + by (o qs 


αι bi 
az bə 


a1b» biaz 0) (2.46) 
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is an equilateral hyperbola, whose asymptotes are parallel to the coordinate axes (Fig. 2.17). 


bo a A 
The center is at C (-2 , 2) . The parameter a in the equality (2.45) corresponds here to — —; with 
a2 ag a2 
a,b bo x AJ |A| αι ΓΔ 
A =|" t|. The vertices of the hyperbola A # 0 and B are at ( | | ΑΙ and 
2 02 az ag 


b, + γΙΔΙ αι - vial 
az ᾿ az 


| , where for A < 0 the same signs are taken, for A > 0 the different ones. 
P ; "— by H : ay 
The point of discontinuity is at x = ——. For A < 0 the values of the function are decreasing from — 
ag ag 
a 1 : : a 
to —oo and from +00 to —. For A > 0 the values of the function are increasing from — to +0 and 
ag ag 
αι 


from —oo to — . There is no extremum. 


a2 
2.4.3 Curves of Third Degree, TypeI 
The graph of the function 
bog ( ax? + br +c 
a+ [E 5 
ge 


E ) (0-0, cz 0) (2.47) 
(Fig. 2.18) is a curve of third degree (type I). It has two asymptotes x = 0 and y = a and it has two 
branches. One of them corresponds to the monotone changing of y while it takes its values between 
a and +00 or —oo; the other branch goes through three characteristic points: the intersection point 
c 2c p? 
with the asymptote y — a at A (; D an extreme point at B (ος -α-- τ) and an inflection 
c 
3c 2b? M . 
b -α-- Fe)” The positions of the branches depend on the signs of b and c, and there 
c 
are four cases (Fig. 2.18). The intersection points D, E with the x-axis, if any, are for a # 0 at 
—b + Vb? — 4ac 
2a 
or none, depending on whether b? — 4ac > 0, = 0 or < 0 holds. 


point at C (- 


c ; = ο 
] o) , fora = Oat (-5 a) ; their number can be two, one (the x-axis is a tangent line) 


2 
For b = 0 the function (2.47) becomes the function y = a + — (see (Fig. 2.21) the reciprocal power), 
F 


, : ] ax +b ] 
and for c = 0 it becomes the homographic function y = ——— , as a special case of (2.46). 
T 


2.4.4 Curves of Third Degree, Type II 
The graph of the function 


1 
=— 0 2.48 
ax? 4- bx +e 0) (2.48) 
b 
is a curve of third degree (type II) which is symmetric about the vertical line x = — A and the x-axis is 
a 


its asymptote (Fig. 2.19), because lim y = 0. Its shape depends on the signs of a and A = 4ac — ἐδ. 
woo 
From the two cases a > 0 and a < 0 only the first one is considered, because reflecting the curve of 
1 


y= ae ο with respect to the x-axis one gets the second one. 
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B 
c>0,b>0 


b) 


c<0,b>0 


Figure 2.18 
Case a) A > 0: The function is positive and continuous for arbitrary values of x and it is increasing 
on the interval (—oo , a2), Here it takes its maximum, a then it is decreasing again in the interval 
Cg «οο). The extreme point A of the curve is at (-z i x) , the inflection points B and C are at 


2a’ A 
b. o VA 3a 
2a ^ Qav/3 A 


α 3 VP 
get tanq = Fa? (5) (Fig. 2.194). 


) ; and for the corresponding slopes of the tangent lines (angular coefficients ) we 


Case b) A = 0: The function is positive for arbitrary values of v, its value rises from 0 to --oo, at 
b 

g= -x zo it has a point of discontinuity (a pole), where lim y = +0. Then its value falls from 
a α- Σαρ 

here back to 0 (Fig. 2.190). 


Case c) A < 0: The value of y rises from 0 to +00, at the point of discontinuity it jumps to —oo, 
and rises to the maximum, then falls back to —oo; at the other point of discontinuity it jumps to +00, 


b 4 
then it falls to 0. The extreme point A of the curve is at (- σα” x) . The points of discontinuity are 
α 
—-b+V-A 
at r= — (Fig. 2.19c). 
a 


2.4.5 Curves of Third Degree, Type III 


The graph of the function 
x 


fo oe (a 40,b40,c40) (2.49) 


y= 
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Figure 2.19 


is a curve of third degree (type III) which goes through the origin, and has the z-axis (Fig. 2.20) as an 
asymptote. The behavior of the function depends on the signs of a and of A = 4ac — b°, and for A < 0 
also on the signs of the roots a and @ of the equation ax? + bx + c = 0, and for A = 0 also on the sign 
of b. From the two cases, a > 0 and a < 0, only the first one is considered because reflecting the curve 


of y = with respect to the x-axis yields the second one. 
r 


(—a)z? — bx — 
Case a) A > 0: The function is continuous everywhere, its value falls from 0 to the minimum, then 
rises to the maximum, then falls again to 0. 
. τ. —b+2./ac ; 1 
The extreme points of the curve, A and B, are at {+,/-, —R ji there are three inflection 
α 


points (Fig. 2.20a). 


Case b) A = 0: The behavior of the function depends on the sign of b, so there are two cases. In both 
cases there is a point of discontinuity at x = "ut both curves have one inflection point. 
a 


e b > 0: The value of the function falls from 0 to —oo, the function has a point of discontinuity, then the 
value of the function rises from —oo to the maximum, then decreases to 0 (Fig. 2.20b,). The extreme 


[3 1 
oint A of the curve is at A | +,/— , ———— |. 
E ( a 2,fac+ ;) 
οὐ «0: The value of the function falls from 0 to the minimum, then rises to +00, running through the 
origin, then the function has a point of discontinuity, then the value of the function falls from 4-oo to 0 


C 1 
Fig. 2.20b;). The extreme point A of the curve is at A | —,/— , -——=—__ |. 
(Fig 2) p ( a` 2γας- ;) 
Case c) A «0: The function has two points of discontinuity, at v = œ and x = f; its behavior 
depends on the signs of a and f. 
e The signs of a and f are different: The value of the function falls from 0 to —oo, jumps up to +00, 
then falls again from --οο to —oo, running through the origin, then jumps again up to 4-oo , then it falls 
tending to 0 (Fig. 2.20c,). The function has no extremum. 
e The signs of a and 8 are both negative: The value of the function falls from 0 to —oo, jumps up 
to +00, from here it goes through a minimum up to 4-oo again, jumps down to —oo , then rises to a 
maximum, then falls tending to 0 (Fig. 2.20c;). 
The extremum points A and B can be calculated with the same formula as in case a) of 2.4.5. 
e The signs of a and B are both positive: The value of the function falls from 0 until the minimum, then 
rises to +00, jumps down to —oo, then it rises to the maximum, then it falls again to —oo, then jumps 
up to 4-oo and then it tends to 0 (Fig. 2.20c;). 
The extremum points A and B can be calculated by the same formula as in case a) of 2.4.5. 
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Figure 2.20 
In all three cases the curve has one inflection point. 
yA 
y 
ael A 
|| Fee : 
s y= 1, SU] — 80 
01. - 
Figure 2.21 Figure 2.22 
2.4.6 Reciprocal Powers 
The graph of the function 
y= D lag (n » 0, integer; a 7 0) (2.50) 


g” 
is a curve of hyperbolic type with the coordinate axes as asymptotes. The point of discontinuity is at 
x = 0 (Fig. 2.21). 
Case a) For a > 0 and for even n the value of the function rises from 0 to +00, then it falls tending 
to 0, and it is always positive. For odd n it falls from 0 to —oo, it jumps up to +00, then it falls tending 
to 0. 
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Case b) For a < 0 and for even n the value of the function falls from 0 to —oo, then it tends to 0, and 
it is always negative. For odd n it rises from 0 up to +00, jumps down to —oo, then it tends to 0. 

The function does not have any extremum. The larger n is, the faster the curve approaches the x-axis, 
and the slower it approaches the y-axis. For even n the curve is symmetric with respect to the y-axis, 
for odd n it is center-symmetric and its center of symmetry is the origin. The Fig. 2.21 shows the cases 
n = 2 and n = 3 for a = 1. 


2.5 Irrational Functions 
2.5.1 Square Root ofa Linear Binomial 


The union of the curve of the two functions 


y=+var+b (α-:0) (2.51) 


: : : . : b . 
is a parabola with the x-axis as the symmetry axis. The vertex A is at (-2. o); the semifocal chord 
a 


a 
(see 3.5.2.10, p. 204) is p = z The domain of the function and the shape of the curve depend on the 


sign of a (Fig. 2.22) (for more details about the parabola see 3.5.2.10, p. 204). 


2.5.2 Square Root ofa Quadratic Polynomial 


The union of the graphs of the two functions 


y = Var? +br+c (a#0, A= 4ας-- b? 40) (2.52) 


is for a < 0 an ellipse, for a > 0 a hyperbola (Fig. 2.23). One of the two symmetry axes is the x-axis, 


the other one is the line z = ——. 
2a 
ba A b ΙΔ ; 
The vertices A, C and B, D are at | - ———— cR }], where A = 4ac — b?. 
2a 4α 
y B 
ς > 
x 
D 
a) a<0,A<0 b) “!a>0,A>0 c) a>0,A<0 


Figure 2.23 


The domain of the function and the shape of the curve depend on the signs of a and A (Fig. 2.23). 
For a < 0 and A > 0 the function has only imaginary values, so no curve exists (for more details about 
the ellipse and hyperbola see 3.5.2.8, p. 199 and 3.5.2.9, p. 201). 


2.5.3 Power Function 


The power function 


y = aa^ = ag tmn (m, n integer, positive, coprime) (2.53) 
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is to be discussed for k > 0 and for Κ < 0 (Fig. 2.24, Fig. 2.25). The investigation here can be 


restricted to the case a = 1, because for a Z 1 the curve differs from the curve of y = z^ only by a 
stretching in the direction of the y-axis by a factor |a|, and for a negative a by a reflection to the x-axis. 

A 2 3 

f 1/3 d ΥΞΧ᾽ Ῥ y=x? 

=x 
> + > + > 
01 x 0| 1 x OJN x 
: 
-—X 
1/2 y 
b) c) d) 


Figure 2.24 


Case a) k > 0, y = ο/η. The shape of the curve is represented in four characteristic cases 
depending on the numbers m and n in Fig. 2.24. The curve goes through the points (0,0) and (1, 1). 
For k > 1 the z-axis is a tangent line of the curve at the origin (Fig. 2.24d), for k < 1 the y-axis is a 
tangent line also at the origin(Fig. 2.24a,b,c). For even n the union of the graph of functions y = 
may be considered: it has two branches symmetric to the x-axis (Fig. 2.24a,d), for even m the curve 
is symmetric to the y-axis (Fig. 2.24c). If m and n are both odd, the curve is symmetric with respect 
to the origin (Fig. 2.24b). So the curves can have a vertex, a cusp or an inflection point at the origin 
(Fig. 2.24). None of them has any — 


-3/2 
-1 Vedi AX NE 


Figure 2.25 


Case b) k < 0, y = α-η/η; The shape of the curve is represented in three characteristic cases 
depending on m and n in Fig. 2.25. The curve is a hyperbolic type curve, where the asymptotes 
coincide with the coordinate axes (Fig. 2.25). The point of discontinuity is at v = 0. The greater |k| 
is the faster the curve approaches the x-axis, and the slower it approaches the y-axis. The symmetry 
properties of the curves are the same as above for k > 0; they depend on whether m and n are even or 
odd. There is no extreme value. 


2.6 Exponential Functions and Logarithmic Functions 


2.6.1 Exponential Functions 
The function 

y=a" =e" (a>0, b=Ina), (2.54) 
is called the exponential function and its graphical representation the exponential curve (Fig. 2.26). 
From (2.54) for a = e follows the function of the natural exponential curve 

y-e. (2.55) 
The function has only positive values. Its domain is the interval (—oo, +00). For a > 1, i.e., for b > 0, 
the function is strictly monotone increasing and takes its values from 0 until oo. For a « 1, i.e., for 
b < 0, it is strictly monotone decreasing, its value falls from oo until 0. The larger |b| is, the greater is 
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the speed of growth and decay. The curve goes through the point (0, 1) and approaches asymptotically 
the z-axis, for b > 0 on the right and for b < 0 on the left, and faster for greater values of |b|. The 


T 
function y = a * = (9) increases for a < 1 and decreases for a > 1. 
a 
x ποπ. YA. x x 
eme [em " 
DIES [517] y-log;x-lb x 


y=log.x=In x [& 
e P EEA ο y-log,x-lg is 


© 
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Figure 2.26 Figure 2.27 


2.6.2 Logarithmic Functions 


The function 

y =logax (a>0, a#1) (2.56) 
gives the logarithmic curve (Fig. 2.27); the curve is the reflection of the exponential curve with respect 
to the line y = x. From (2.56) for a = e follows the curve of the natural logarithm 

y-—lhnz. (2.57) 
The real logarithmic function is defined only for x > 0. For a > 1 it is strictly monotone increasing 
and takes its values from —oo to +00, for a < 1 it is strictly monotone decreasing, and takes its values 
from +00 to —oo, and the greater | In αἰ is, the faster the growth and decay. The curve goes through the 
point (1, 0) and approaches asymptotically the y-axis, for a > 1 down, for a < 1 up, and again faster 
for larger values of | In a]. 


2.6.3 Error Curve 


The function 

y = ear)” (2.58) 
gives the error curve (Gauss error distribution curve) (Fig. 2.28). Since the function is even, the y- 
axis is the symmetry axis of the curve and the larger |a] is, the faster it approaches asymptotically the 
a-axis. It takes its maximum at zero, and it is equal to one, so the extreme point A of the curve is at 


1 1 
0, 1), the inflection points of the curve B, C are at | +—— , — |. 
(0,1) à ( av2 ' Ve ) 


The slopes of the tangent lines are here tan y = xay 2/e. 


A very important application of the error curve (2.58) is 
the description ofthe normal distribution properties of the 
observational error (see 16.2.4.1, p. 818.): 


y = p(z) = — exp ( Z) ; (2.59) 


Figure 2.28 
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2.6.4 Exponential Sum 


The function 


y = ae" + ce” (2.60) 


. Ol x 
sign a=sign c signa = sign c signa + sign c 
iei πες d sign b + sign d sign b = sign d 


a) b) c) 
Figure 2.29 


is represented in Fig. 2.29 for the characteristic sign relations. The sum of the functions is got by 
adding the ordinates of the curves, i.e., the summands are yi = ac" and y» = ce*. The function is 
continuous. If none of the numbers a, b, c, d is equal to 0, the curve has one of the four forms represented 
in Fig. 2.29. Depending on the signs of the parameters it is possible, that the graphs are reflected over 
a coordinate axis. 
The intersection points A and B of the curve with the y-axis and with the x-axis are at (0, a + c), and 
In (—a/c) 
at 


1 ab 
a 9) respectively, the extremum C is at « = d-b In (- “) , and the inflection point D 
= - c 


1 ab? . l 
is at x = —— In| ——,, |, in the case when they exist. 

d—b cd 
Case a) The parameters a and c, and b and d have the same signs: The function does not change its 
sign, it is strictly monotone; its value is changing from 0 to +00 or to —oo or it is changing from +00 
or from —oo to 0. There is no inflection point. The asymptote is the z-axis (Fig. 2.29a). 
Case b) The parameters a and c have the same sign, b and d have different signs: The function does 
not change its sign and either comes from +00 and arrives at +00 and has a minimum or comes from 
—oo, goes to —oo and has a maximum. There is no inflection point (Fig. 2.29b). 


Case c) The parameters a and c have different signs, b and d have the same signs: The function has 
one extremum and it is strictly monotone before and after. It changes its sign once. Its value changes 
whether from 0 until the extremum, then goes to +00 or —oo or it comes first from +00 or —oo, takes 
the extremum, then approaches 0. The z-axis is an asymptote, the extreme point of the curve is at C 
and the inflection point at D (Fig. 2.29c). 

Case d) The parameters a and c and also b and d have different signs: The function is strictly monotone, 
its value rises from —oo to +œ or it falls from --oo to —oo. It has an inflection point D (Fig. 2.29d). 


2.6.5 Generalized Error Function 


'The curve of the function 


y = aexp(bz + ca?) = aexp (σα) exp ( (« + x) | (ασ 0) (2.61) 
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Figure 2.30 


can be considered as the generalization of the error function (2.58); it results in a symmetric curve with 

respect to the vertical line x = — πο it has no intersection point with the x-axis, and the intersection 
ο 

point D with the y-axis is at (0, α) (Fig. 2.30a,b). 

The shape of the curve depends on the signs of a and c. Here only the case a > 0 is discussed, because 

the curve for a < 0 is got by reflecting it in the x-axis. 

Case a) c > 0: The value of the function falls from +oo until the minimum, and then rises again to 


b p 
+oo. It is always positive. The extreme point A of the curve is at (-z: ,aexp ( τ) and it corre- 
c ο 


sponds to the minimum of the function; there is no inflection point or asymptote (Fig. 2.304). 


b 
Case b) ς < 0: The z-axis is the asymptote. The extreme point A of the curve is at ο. 
ο 


9 
a exp (- τ) and it corresponds to the maximum of the function. The inflection points B and C 
P 


are at 


(45> ace ο) (Fig 2.509). 


2c 4c 


2.6.6 Product of Power and Exponential Functions 


The function 

y = are (2.62) 
is discussed here only in the case a > 0, because in the case a < 0 the curve is got by reflecting it in the 
x-axis. For a non-integer b the function is defined only for x > 0, and for an integer b the shape of the 
curve for negative x can be deduced also from the following cases (Fig. 2.31). 
Fig. 2.31 shows how the curve behaves for arbitrary parameters. 
For b > 0 the curve passes through the origin. The tangent line at this point for b > 1 is the z-axis, for 
b = 1 the line y = x, for 0 < b < 1 the y-axis. For b < 0 the y-axis is an asymptote. For c > 0 the 
function is increasing and exceeds any value, for c < 0 it tends asymptotically to 0. For different signs 


i b : ; 
of b and c the function has an extremum at 2 = —- (point A on the curve). The curve has either no or 
p 


b 
one or two inflection points at 2; = — (points C and D see Fig. 2.31c,e,f,g). 
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Figure 2.31 


2.7 Trigonometric Functions (Functions of Angles) 


2.7.1 Basic Notions 


2.7.1.1 Definition and Representation 


1. Definition 


The trigonometric functions are introduced by geometric considerations. So in their definition and also 
in their arguments degree or radian measure is used (see 3.1.1.5, p. 131). 


2. Sine 
The standard sine function 
y —sinx (2.63) 
is a continuous curve with period T' — 27 (see Fig. 2.928). 
„a. 
Figure 2.32 
The intersection points Bo, By, B-1, B», Β.»,..., with By = (kr, 0) (k = 0,1, +2, ...) of the stan- 


dard sine curve and the x-axis are the inflection points of the curve. Here the angle of slope of the tan- 


5 à S π $ ? 
gent line with the 2 axis is Tq The extreme points of the curve are at Cy, C1, C 4, Co, C_2,... with 


1, 


Cy = ((k + i)m, (-1)*) (k =0, 
The general sine function 
y = Asin(wz + qo) 


2 J For every value of the function y there is —1 < y < 1. 


(2.64) 
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with an amplitude |A|, frequency w, and phase shift yo is represented in Fig. 2.32b. 
Comparing the standard and the general sine curve (Fig. 2.32b) it can be seen that in the general 
case the curve is stretched in the direction of y by a factor |A|, in the direction of x it is compressed 


1 2π 
by a factor —, and it is shifted to the left by a segment #0 The period is T = —. The intersection 
ω ω ω 


kr — 
points with the x-axis are Bj ( Vo : ) (k = 0,£1,+2,...). The extreme points are Cj, = 
ω 


(στο), 1*A (k — 0, 


[ni 


d 
L 


3. Cosine 
The standard cosine function 


y = cosx = sin(x + 5) (2.65) 


is represented in Fig. 2.33. 


The intersection points with the x-axis Figure 2.33 


1 
Bo, Bi, Bo,..., Be = (( + 5) T, 0) (k = 0, +1, +2,...) are also the inflection points. The angle of 


slope of the tangent line is +—. 
The extreme points are Co, C4, ..., Cy = (kr, (C1)*) (k = 0, +1, +2,...). 


The general cosine function 


y = Acos(wa + qo) (2.66a) 
can be transformed into the form 
y = Asin (ως + Yo + 5) , (2.66b) 


i.e., the general sine function shifted left by y = 2 


4. Tangent 
The tangent function 
sinc 
y= tang = 2.67 
d à cos x Επ 


1 
has period T = π and the asymptotes are x = (t + 5) m (k = 0,+1,+2,...) (Fig. 2.34). The 
gt kr +5 + kr) (k =0, 
values from —oo to +00. The curve has intersection points with the x-axis at Ag, Αι, A 4, Ao, A_2,..., 
A; = (ἐπ.0) (k = 0,1, 2,...), these points are the inflection points and the angle of slope of the 


= 


function is monotone increasing in the intervals 2,...) and takes 


π 
tangent line is T 


5. Cotangent 
The cotangent function 
iu 1 
y = cotz ο. tan (s + 5) (2.68) 


sing tang 
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i 
a 


Figure 2.34 


Figure 2.35 


has a graph which is the tangent curve reflected with respect to the x-axis and shifted to the left by 


s (Fig. 2.35). The asymptotes are x = km (k = 0, +1, +2,. 


monotone decreasing and takes its values from +00 until —oo; 


..). Between 0 and π the function is 


the function has period T = π. The 


1 
intersection points with the x-axis are at Ao, Αι, A-1, Ao, A_2,... with Αμ = ((x + 5) T, 0) (k = 


6. Secant 
The secant function 
1 
y = sec x = (2.69) 
cos x 


1 
has period T = 27, the asymptotes are 7 = (r + 5) π (k= 


0,+1,+2,...); and obviously |y| > 1 holds. The extreme points 


corresponding to the maxima of the function are Αρ, A1, A. 1... 


Figure 2.36 Figure 2.37 


T 
0, +1, £2,...), they are the inflection points of the curve and here the angle of the tangent line is —— . 


A T 
Το] 


vesec 
m SEC 


xy 
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Figure 2.38 


with Αμ = ((2k + 1)r,—1) (k = 0, +1, 2,.. J be: extreme re Wr to the minima of 


the function are Bo, Bi, Β ,,... with B= (ἐπ, +1) (k — 0, 


1. ..) (Fig. 2.36). 


7. Cosecant 
The cosecant function 
1 


y —cosecm = = 
sin £ 


(2.70) 
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π 
has a graph which is the graph of the secant shifted to the right by x = z The asymptotes are 


x = kr (k = 0,+1,+2,...). The extreme points corresponding to the maxima of the function are 


Ak +: 
Ao, Αι, A-1,... with A, = ( - in 1) (k = 0,+1,+2,...) and the the points corresponding 


2 
Ak +1 
2 


to the minima of the function are Bo, Bi, B. 4,... with Bj = 


τα) (k = 0,41,+2,...) 
(Fig. 2.37). 
2.7.1.2 Range and Behavior of the Functions 


1. Angle Domain 0 < x < 360? 

'The six trigonometric functions are represented together in Fig. 2.38 in all the four quadrants for a 
complete domain of angles from 0? to 360° or for a complete domain of radians from 0 to 27. 

In Table 2.1 there is a review of the domain and the range of these functions. The signs of the functions 
depend on the quadrant where the argument is taken from, and these are reviewed in Table 2.2. 


Table 2.1 Domain and range of trigonometric functions 


Domain Range Domain Range 


-l<sina <1 x # (2k 17 —oo « tana « oo 
—1 < cosx < 1 ασε kr —oo < cot z < oo 
(k =0,+1,+2,...) 


-οο «s«o { 


2. Function Values for Some Special Arguments (see Table 2.3) 
3. Arbitrary Angle 


Since the trigonometric functions are periodic (period 360° or 180°), the determination of their values 
for an arbitrary argument x can be reduced by the following rules. 

Argument x > 360° or x > 180°: If the angle is greater than 360° or greater than 180°, then it is 
to be reduced for a value a, for which 0 € a € 360° or 0 € a < 180° holds, in the following way (n 
integer): 


sin(360* - n +a) — sina, (2.71) cos(360* : n +a) = cosa , (2.72) 
tan(180°-n+a)=tana, (2.73) cot(180* - n + a) = cota. (2.74) 
Table 2.2 Signs of trigonometric functions 
| Quadrant Angle sin cos tan cot sec csc 
I from 0° to 90° H I I 
Il from 90° to 180° | 
HI from 180° to 270° I | 
IV from 270° to 360° 
Argument æ < 0: If the argument is negative, then the following formulas reduce the calculations to 


functions for positive argument: 


sin(—a) = —sina, (2.75) cos(—a) = cosa, (2.76) 


tan(—a) = — tana, (2.77) cot(—a) = —cota. (2.78) 
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Table 2.3 Values of trigonometric functions for 0°, 30°, 45°, 60° and 90°. 


Angle | Radian | sin | cos | tan | cot | sec | csc 


0° 0 ο 1 0 σου). 1 | Fæ 
1 1 | V3) νᾶ 2/3 
5 Ξ ο DI aem 2 
D ο la lala | Via 
45° LN v2 {v2 1 1 V2 | V2 
4 | 9 
1 V3} 1 V3 2/3 
à 1 νο! 1 M9 | 3 |273 
τ 3 la ον ση 3 
1 
90° 37 1 0 | too 0 +00 T 


Table 2.4 Reduction formulas and quadrant relations of trigonometric functions 


Function | x = 90° +a | x = 180° +a | x = 270° +a | x = 360° — a 


sin x + cosa Fsina — cosa —sino 
cos 1 Ἔδιπα — cosa +sina + cosa 
tan x + tana F cota —tana 
cot x F tana cota F tana — cota 


Argument g for 90° < x < 360°: If 90° < x < 360° holds, then the arguments are to be reduced 
for an acute angle a by the reduction formulas given in Table 2.4. The relations between the values 
of the functions belonging to the arguments which differ from each other by 90°, 180° or 270° or which 
complete each other to 90°, 180° or 270° are called quadrant relations. 

The first and second columns of Table 2.4 give the complementary angle formulas, and the first and 
third ones give the supplementary angle formulas. Because x = 90° — a is the complementary angle 
(see 3.1.1.2, p. 130) of a, the following relations 


cosa = sin x = sin(90? — a), (2.792) sina = cos x = cos(90° — a) (2.79b) 
are called the complementary angle formulas. 
For a + x = 180° the relations between the trigonometric functions for supplementary angles (see 
3.1.1.2, p. 130) 

sina = sin x = sin(180° — a), (2.80a) cosa = cos x = cos(180* — a) (2.80b) 


are called supplementary angle formulas. 

Argument g for 0? < x < 90°: The values of trigonometric functions for acute angles (0° < x < 
90°) have been taken formerly from tables, today calculators are used. 

E sin(—1000°) = — sin 1000° = — sin(360° - 2 + 280°) = — sin 280° = + cos 10° = +0.9848. 


4. Angles in Radian Measure 
The arguments given in radian measure, i.e., in units of radians, can be easily converted by formula 


(3.2) (see 3.1.1.5, p. 131). 
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2.7.2 Important Formulas for Trigonometric Functions 


Remark: Trigonometric functions with complex argument z are discussed in 14.5.2, p. 759. 


2.7.2.1 Relations Between the Trigonometric Functions 


sin? a -- cos?a — 1, (2.81) sec? a — tan? a = 1, (2.82) 
cosec? a — cot? a = 1, (2.83) sina:coseca = 1, (2.84) 
cosa seca = 1, (2.85) tana cota — 1, (2.86) 
si cosa 
unir NE tana, (2.87) - cm cota. (2.88) 
cosa sina 


Some important relations are summarized in Table 2.5 for 0 < a < 7/2 in order to create an easy 
survey. For other intervals in Table 2.5 the square roots are always considered with the sign which 
corresponds to the quadrant where the argument is. 


2.7.2.2 Trigonometric Functions of the Sum and Difference of Two Angles 


(Addition Theorems) 
sin(a+) = sina cos f 3: cos a sin £,(2.89) cos(a B) = cos a cos 8 Fsin a sin β.(3.90 
tana + tan 8 cotacot ô F1 
tan(a + β) = ———————, 2.91 cot(a + B) = ————————, 2.92 
mh) 17F tanatan 2’ ( ) μὴ cot β + cota ' ( 
sin(a + B + y) = sina cos f cosy + cos asin f cos y 
+ cos a cos B sin y — sin a sin f sin y, (2.93 
cos(a + B + y) = cosa cos B cos y — sin a sin B cos y 
— sin a cos B sin  — cos a sin f) sin y. (2.94 
2.7.2.3 Trigonometric Functions of an Integer Multiple of an Angle 
sin2a — 2sina cosa, (2.95) cos 2a = cos? a — sin? a, (2.97 
οἴη δα = 3sina — 4sin? a, (2.96) cos 3a = 4cos? a — 3cosa, (2.98 
sin 4a = 8cos? asina — 4coso sino, (2.99) cos 4a = 8cos!a — 8cos?a 4-1, (2.100 
2tana cot? a — 1 
tan 2a = ——__, 2.101 y= 2 
er T (2.101) cot 2a = T o (2.104 
3tana — tan? a cot? a — 3cot a 
tan 3a = — —————— 2.102 ot 3a = ------π------- : 
pon 1—-3tan?a ( ) ως 3cot^a — 1 (2105 
4tana — 4tan?a cotta — 6cot?a+1 
tan 4a = - 2.103 = - ; i 
1—6tan? a + tant a, ( ) otia 4cot® a — 4 cot a (2:106 
For larger values of n in order to gain a formula for sin na and cos na the de Moivre formula is to be 


used (see 1.5.3.5, p. 38). 
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Using the binomial theorem ( 


cosna@ +isinna = . 1 cost cos"-* asin! a = 


(see 1.1.6.4, p. 12) gives: 


= cos" a + in cos"! a sin a 


(cosa + sin a)” 


n n—2 n n—3 d n n—4 int 
= 2 COS Q sin? a-i 3 cos” asin’ oa + 4 cos asm a-c.... 


With this it follows: 


(2.107) 


n μμ n "VP n DOE 
cos na = cos" a — ( ) cos"? o sin? a + (;) cos" a sint a — (2) cos” asinfa +... , 


2 


9 


: αι n n—3 |, cid n n—5 n an5 
sinna = n cos”™* asin a — 3 cos" asin” a + [.|cos" "asina —.... 


(2.108) 


(2.109) 


" å r n 5 à π 
Table 2.5 Relations between the trigonometric functions of the same argument in the interval 0 < a < 5 


α sina cos a tana cota 
g 3 tana 
sina — v1 — cos? a j s 
V1+tan?a | V1 + cot? a 
= 1 cota 
cosa vl -sina 7 > 
V1-tan?a | V14- cot?a 
sina y1 — cos? a 
tana == 
ν]-- sin? α cosa cota 
Vi—sin2a cosa 1 
cota - - = 
sina y1 — cosa tana 


2.7.2.4 Trigonometric Functions of Half- Angles 


In the following formulas the sign of the square root must be chosen positive or negative, according to 


the quadrant where the half-angle is. 
. a 1 

sin = σί — cosa), (2.110) 
a 1—cosa 1— cosa sina 

tan = = = - = ; 
2 1 + cosa sina 1+ cosa 
α 1l+cosa 1+cosa sina 

cot = = = - = : 
2 1— cosa sina 1—cosa 


a 


cos — = 


1 
ju + cos o), 


(2.111) 


(2.112) 


(2.113) 
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2.7.2.5 Sum and Difference of Two Trigonometric Functions 


B ν-β (EB -β 
sina + sin B = 2811 a cos 2 5 —,(2.114) sina —sin f — 2cos 2 sin Ž g (2.115) 
v+ B -β zv (— B 
cos a+ cos 8 = 2605 2 cos 2 5 —,(2.116) cosa — cos B 2sin 27^ sin ^ 5 —, (2.117) 
[> i es B si [2 B 
tana + tan 8 = sin(a + £) 2.118) cota + cot 8 = sin(a — ) (2.119) 
cos a cos 3 sina sin 8 
s(a — B :os(a + { 
tana + cot B = cos(a = b) 2.120) cota — tan B = cos(a + £) (2.121) 
cos asin 3 sin a cos 8 
2.7.2.6 Products of Trigonometric Functions 
1 
sin o sin 8 g [costa B) — cos(a + β)]. (2.122) 
ee l 
cos a cos B gleos(a B) + cos(a + B)], (2.123) 
: Ls. ; ; 
sina cos B = zala B) + sin(a + β)], (2.124) 
: : ; Te : 
sin a sin f sin y gee + 6—y)+sin(8+7-a) 
tsin(y +a — 8) — sin(a + B + 5] (2.125 
sina cos B cos y = —[sin(a + B — η) — sin(B + y — a) 
+sin(y +a — 8) + sin(a + B + 4], (2.126 
1 
sin a sin B cosy = jl cos(a + B — y) + cos(B +7 — a) 
+cos(y + a — 8) — cos(a + B + γ)], (2.127 
1 
cos a cos β cosy = qleos(a + B — «) 9 cos(B 4- y — a) 
+ cos(y + a — B) + cos(a + B + γ)]. (2.128 
2.7.2.7 Powers of Trigonometric Functions 
22 1 2 1 
sin? a = σα — cos 2a), (2.129) cos” a = σα + cos 2a), (2.130) 
23 lua ia ; 3 1 
sin” a = 18 sina — sin 3a), (3.131) cos” a = 4 (cos 3a--3cosa), (2.132) 


1 1 
sinf a = g (cos 4a — 4cos 2a + 3), (2.133) cost a = 3 (eos da + 4cos 2a + 3). (2.134) 
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For large values of n sin" a and cos” a , can be expressd by applying the formulas for cos na and sin na 


(see 2.7.2.3, p. 82). 


2.7.3 Description of Oscillations 
2.7.3.1 Formulation of the Problem 
In engineering and physics one often meets quantities depending on time and given in the form 

u(t) = Asin(wt + p). (2.135) 
They are called also sinusoidal quantities. Their dependence on time results in a harmonic oscillation. 
The graphical representation of (2.135) results in a general sine curve, as shown in Fig. 2.39. 


ev 
c 


ΖΩ) 


Figure 2.39 Figure 2.40 
The general sine curve differs from the simple sine curve y = sin x: 
a) by the amplitude A, i.e., the greatest distance between its points and the time axis t, 


2m 
b) by the period T = —, which corresponds to the wavelength (with w as the frequency of the oscilla- 
ω 
tion, which is called the angular or radial frequency in wave theory), 
c) by the initial phase or phase shift by the initial angle yp 4 0. 
The quantity u(t) can also be written in the form 
u(t) = asin wt + bcos wt. (2.136) 
b 
Here for a and b A = Va? + b? and tan y = — holds. The quantities a, b, A and φ can be represented 
a 
as sides and angle of a right triangle (Fig. 2.40). 
2.7.3.2 Superposition of Oscillations 
In the simplest case the superposition of oscillations is 
the addition of two oscillations with the same frequency. 
It results again in a harmonic oscillation with the same 
frequency: 
l Ai sin(wt+1)+A2 sin(wt+y2) = A sin(wt+y)(2.137a) 
ot with 


A= VA? + A? + 24 As cos(ipa — p1), (2.137b) 
— Αι sin q1 + A» sin p2 (2.1370) 
* A1 cos q1 + A» cos Φα 
) where the quantities A and ᾧ can be determined by a 
Figure 2.41 vector diagram (Fig. 2.414). 


A linear combination of several sine functions with the same frequency is also possible and yields a 
general sine function (harmonic oscillation) with the same frequency: 


Y eiAsin(wt + pi) = Asin(wt + p). (2.138) 
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2.7.3.3 Vector Diagram for Oscillations 

The general sine function (2.135, 2.136) can be represented easily by the polar coordinates p = A, p 
and by the Cartesian coordinates x = a, y = b (see 3.5.2.1, p. 190) in a plane. The sum of two such 
quantities then behaves as the sum of two summand vectors (Fig. 2.41a). Similarly the sum of several 
vectors results in a linear combination of several general sine functions. This representation is called a 
vector diagram. 

The quantity u for a given time t can be determined from the vector diagram with the help of Fig. 
2.41b: First the time axis OP(t) has to be put through the origin O, which rotates clockwise around 
O by a constant angular velocity w. At start t = 0 the axes y and 1 coincide. Then at any time t 
the projection ON of the vector d onto the time axis is equal to the absolute value of the general sine 
function u = Asin(wt + p). For time t = 0 the value ug = Asin y is the projection onto the y-axis 
(Fig. 2.41b). 


2.7.3.4 Damping of Oscillations 
The function u(t) = Ae“ sin(wt + yo) (a, t > 0) (2.139) 


yields the curve of a damped oscillation 
(Fig. 2.42). 

The oscillation proceeds along the t-axis, 
while the curve asymptotically approaches 
the t-axis. The sine curve is enclosed by the 
exponential curves u(t) = +Ae~, and it 
contacts them in the points 


1 
DLE 


ω 


uA 


Ao, Αι, Ao, ..., Ak = 


1 
ρω. 
(—1)^A exp | -a2——£————— 
Figure 2.42 
The intersection points with the coordinate axes are B = (0, Asin qo), Co, C1, Ca, ..., Ck = 


km — φο τα 
ω 


kr- y 
(A, o) . The extrema Do, Di , Do,... are at tk = ; and the inflection points Eo, Ει, 
w 


] ω 
with tana = —. 
a 


kn — po + 2a 
Eo,... are at tk = προς 
ω 


] E ; s T : 
The logarithmic decrement of the damping is ὃ = In = a—, where y; and y;,4 are the ordinates 
ω 


of two consecutive extrema. 


2.8 Cyclometric or Inverse Trigonometric Functions 

The cyclometric functions or arcus functions are the inverses of the trigonometric functions. For a 
unequivocal definition the domain of the trigonometric functions is to be decomposed into monotony 
intervals, to get an inverse function for every monotony interval. So, there are infinitely many such 
intervals, and for each its inverse is to be defined. In order to distinguish them an index k is to be 
assigned according to the corresponding interval. Obviously the trigonometric inverse functions are 
monotony in these intervals. 


2.8.1 Definition of the Inverse Trigonometric Functions 


How to define the inverse trigonometric functions will be shown here for the inverse of the sin function 
(Fig. 2.43). The usual notation for it is arcsin z. The domain of y = sin x will be split into monotony 
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Figure 2.43 Figure 2.44 Figure 2.45 Figure 2.46 


T T 
intervals kr — — € x < kr + => with k = 0, +1, £2,.... Reflecting the curve of y = sin z in the line 


y = x yields the curve of the inverse function 
y = arc, sin £ (2.1402) 


with the domains and ranges 


—-]zzt€-4l and kr- 7 <y<kr+ S where k=0,+1,+2,.... (2.140b) 


The form y = arc; sin x has the same meaning as x = sin y. 
Similarly, one can get the other inverse trigonometric functions which are represented in Fig. 2.44— 
2.46. The domains and ranges of the inverse functions can be found in Table 2.6. 


2.8.2 Reduction to the Principal =| SINT RM. 
Value 


In their domain the arcus functions have the so-called 
principal values for k = 0, written usually without an ^ -------------- T ——X— S 
index, g.e., as arcsinx = arcosinz. In Fig. 2.47 the 
principal values of the inverse functions are presented. 


arctan 


m arccot 

The values of the different inverses can be calculated > 
from the principal values by the following formulas: -1 1 x 
arc, sin x = kr + (—1)* arcsin x. (2.141) arctan 

π 
dris Bon = (k +1)r — arecosz (k odd), (2.142) 2 

kr + arccos x (k even). i 
Figure 2.47 

arc, tan x = kr + arctan 2. (2.143) 
arc, cot 2; = kr + arccot 2. (2.144) 


ΒΑ: arcsin0 = 0, arc; sinO = kr. 

α B: arccot 1 = 3i arc, cot 1 = τ + ἔπ. 
1 1 

B C: arccos 3 = s arc, COS 2 = = + (k + 1)r for odd k, 

= a + kr for even k. 


Remark: Calculators give the principal values of the trigonometric inverses. 
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Table 2.6 Domains and ranges of the inverses of trigonometric functions 


Trigonometric 
Inverse function Domain Range function with the 
same meaning 
arc sine T T ΐ 
; -l<a< T—-—<y<kr+— gs 
i CHANCE l<a2<l kr PESE 2 r-—siny 
arc cosine 
-l<a<l kr € y € (k 4-1) x = cosy 
Y = arc, COS T 
are vangent —o00 < $ « oo kr- = < «ἐπε pr tani 
y = arc, tan x 2 y d 9 = tang 
arc cotangent 
—oo«z «oo kr < y € (k 4-1) z= cot y 
y = arck cot 2 
k= NEE For k = 0 one gets the principal value of the inverse functions, 
which is usually written without an index, e.g., arcsin 1; = arco sin 2. 


2.8.3 Relations Between the Principal Values 


. π 2 —arccosV/1l—2a? (-l<a<0), 
arcsin 2; arccos x = arctan ÉL - (2.145 
2 vl- 2 arccos ΝΊ — a? (0 € x « 1). 
T : x m —arcinyl—a? (t#-1l<a<0), 
arccos t = arcsin 2; = arccot == . - (2.146 
2 V1 - x? arcsin v1 — 2? (0 € x x 1). 
T g 
arctan y = arccot r = arcsin ; 2.147 
2 v1 +x? ( 
ΓΝ Gen cos—— (x <0) 
arccot -—* ας — arccos 7, ——3 WSV), 
arctan 2; = 1 = A = (2.148 
arccot — r0 arccos ———— 2; 20). 
x ( ) V1+2? (720) 
ta = 2 — arct 7 (2.149 
arccot 2; = arctan 2; = arccos . 
2 v1-4 a? 
1 -— 
arctan—+7 (x <0) 7 — arcsin 
arccot x = 1 = (2.150 
arctan — r0 arcsin ———— x 0). 
a We τ ιν 
2.8.4 Formulas for Negative Arguments 
arcsin(—x) = — arcsin x. (2.151) arccos(—x) = π — arccos m. (2.153) 


arctan(—x) = — arctan x. (2.152 


© 


arccot(—x) = m — arccot x. (2.154) 
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2.8.5 Sum and Difference of arcsin x and arcsin y 


arcsin 2; + arcsin y = arcsin (νι-ν + yvi-#) (ay € 0 or r? +y « 1), 
= 7 — arcsin (cyi-¥ + yvi— a) (x >0,y>0, x? +y?>1), (2.155b 
= —r — arcsin (si + yvī =a) (z <0,y <0, z? -- y? > 1). (2.155c 

arcsin x — arcsin y = arcsin CE = νντ--5) (ry > 0 ora? +y? € 1), 
= 7 — arcsin (s -y- να) (150, <0, 2? +y?>1), (2.156b 
= —7 — arcsin (cyi-¥ - wi-a) (x « 0, y » 0, z? +y? > 1). (2.156c 


2.8.6 Sum and Difference of arccos z and arccos y 


arccos 2; + arccos y = arccos (ου ν 1 - αγ] y) (x y 20) 


= 2r — arccos (ey V1 — 24/1 y) (x 4- y « 0). 


arccos x — arccos y = — arccos (ey Ἔν] - αγ. -ν ) (a > y) 
= arccos (y + VE ah - y?) (a <y). 


2.8.7 Sum and Difference of arctan z and arctan y 


(xy < 1), 


u+y 
arctan x + arctan y = arctan Τ 


ty 


= 7 + arctan τ (x >0, xy > 1), 


e+ 
7 + arctan 7 — (x <0, xy > 1). 
x 


ry 
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arctan x — arctan y = arctan 


2 -- 
= T + arctan 


1+ ay 


= —7 + arctan did (a <0, ry < —1). 
1+ ry 


2.8.8 Special Relations for arcsin x, arccos x, arctan x 


1 
2arcsin x = arcsin (2ον 1 -- 2?) (is < =) ; 


1 
= 7 — arcsin (2v 1-- =) (5 ems 1) ; 


(2.155a 


(2.156a 


(2.157a 
(2.157b 
(2.158a 


(2.158b 


(2.159a 


(2.159b 
(2.159c 
(2.160a 


(2.160b 


(2.160c 


(2.161a) 


(2.161b) 
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1 
—e ard : τὰ ; 
= --π — arcsin (2x l=g ) l<a< ^5) 5 (2.161c 
2arccos x = arccos(23? — 1) (0 € x <1), (2.162a 
= 2r — arccos(2x? — 1) (-1 € x <0). (2.162b 
2m 
2 arctan 2 = arctan 1-3 ([α| < 1), (2.163a 
2x 
=n + arctan i= («> 1), (2.163b 
=r) 
2x 
= -r + arctan ——z (ας —1). (2.163c 
1 -- z? 
cos(narccosz) = T,(x) (n> 1), (2.164 
where n > 1 can also be a fractional number and T, (x) is given by the equation 
(« + va? -- ý + (υ —Xx?-— T)" 
Τι(α) = . (2.165 


2 


For any integer n, T; (x) is a polynomial of x (a Chebyshev polynomial). To study the properties of the 
Chebyshev polynomials see 19.6.3, p. 988. 


2.9 Hyperbolic Functions 
2.9.1 Definition of Hyperbolic Functions 


Hyperbolic sine, hyperbolic cosine and hyperbolic tangent are defined by the following formulas: 


ot ο 


sinh z = = , (2.166) cosh x = — , (2.167) tanh x = mE : 


The geometric definition (see 3.1.2.2, p. 132), is an analogy to the trigonometric functions. 
Hyperbolic cotangent, hyperbolic secant and hyperbolic cosecant are defined as reciprocal values of the 
above hyperbolic functions: 


(2.168) 


1 ο pent 1 2 
= Ὁ. (2.169) sech ¢ = — — = —— — (2.170) 


coshhz  e*-4e-*' 


cotha = = - 
tanha | e*—e-* 


cosech x = 


ll 


(2.171) 


The shapes of curves of hyperbolic functions are shown in Fig. 2.48—2.52. 
2.9.2 Graphical Representation of the Hyperbolic Functions 
2.9.2.1 Hyperbolic Sine 


y = sinh x (2.166) is an odd strictly monotone increasing function between —oo and +00 (Fig. 2.49). 
The origin is its symmetry center, the inflection point, and here the angle of slope of the tangent line is 


π : 
p= FU There is no asymptote. 


2.9.2.2 Hyperbolic Cosine 


y = cosh x (2.167) is an even function, it is strictly monotone decreasing for x < 0 from +00 to 1, and 
for x > 0 it is strictly monotone increasing from 1 until +00 (Fig. 2.50). The minimum is at x = 0 
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Figure 2.48 Figure 2.49 Figure 2.50 
and it is equal to 1 (point A(0,1)); it has no asymptote. The curve is symmetric with respect to the 
kA 


i i D xU . 
y-axis and it always stays above the curve of the quadratic parabola y = 1+ 3 (the broken-line curve). 


Because the function demonstrates a catenary curve, the curve is called the catenoid (see 2.15.1, p. 107). 


2.9.2.3 Hyperbolic Tangent 


y = tanh x (2.168) is an odd function, for —oo < x < +00 strictly monotone increasing from —1 to +1 
(Fig. 2.51). The origin is the center of symmetry, and the inflection point, and here the angle of slope 


of the tangent line is y = 1 . The asymptotes are the lines y = +1. 
AY 
4 
3 
2 
€———— 1: ο OO E. 
-4 3-2-1041 2 3 4x 
-2 
-3 
—4 
Figure 2.51 Figure 2.52 


2.9.2.4 Hyperbolic Cotangent 


y = coth zx (2.169) is an odd function which is not continuous at x = 0 (Fig. 2.52). It is strictly 
monotone decreasing in the interval —oo < x < 0 and it takes its values from —1 until —oo ; in the 
interval 0 < a < -oo it is also strictly monotone decreasing with values from +00 to +1. It has no 
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inflection point, no extreme value. The asymptotes are the lines x = 0 and y = +1. 


2.9.3 Important Formulas for the Hyperbolic Functions 


There are similar relations between the hyperbolic functions as between trigonometric functions. The 
validity of the following formulas can be shown directly from the definitions of hyperbolic functions, or 
considering the definitions and relations of these functions also for complex arguments, from (2.199)— 
(2.206), they can be calculated from the formulas known for trigonometric functions. 


2.9.3.1 Hyperbolic Functions of One Variable 


cosh? a — sinh? x = 1, (2.172) coth? x — cosech? x = 1, (2.173) 
sech? x + tanh? 2 = 1, (2.174) tanh a -cotha = 1, (2.175) 
sinh x i cosh x 
= tanh z, (2.176) - — coth z. (2.177) 

cosh x sinh x 

2.9.3.2 Expressing a Hyperbolic Function by Another One with the 

Same Argument 

The corresponding formulas are collected in Table 2.7. 

2.9.3.3 Formulas for Negative Arguments 
sinh(—x) = — sinha, (2.178) cosh(—a) = cosh x, (2.180) 
tanh(—x) = — tanh z, (2.179) coth(—x) = — coth x. (2.181) 


Table 2.7 Relations between two hyperbolic functions with the same arguments for x > 0 
sinh x cosh x tanh x coth x 
. ; tanh x 1 
sinh x - V cosh? x — 1 : 
γ 1— tanh? x γ coth? x — 1 
5 1 coth x 
cosh x sinh* x 4- 1 : 
V 1 — tanh? x V coth’ x — 1 
sinh x cosh? z — 1 1 
tanh x — 
Inh? z +1 cosh x coth x 
V sinh? x + 1 cosh x 1 
coth a - = 
sinh x leosh? z — 1 tanh x 


2.9.3.4 Hyperbolic Functions of the Sum and Difference of Two Arguments 


(Addition Theorems) 


sinh(x + y) = sinh z cosh y + cosh z sinh y, 


cosh(x + y) = cosh x cosh y + sinh x sinh y, 


(2.182) 
(2.183) 
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tanh 2; + tanh y 1+ coth z coth y 
tanh(a + y) = ——— — ———-— , 2.184) coth(a + y) = —————_.. 2.185 
-; 1 tanh tanh μ᾿ ( ARA coth x + coth y ( 5) 
2.9.3.5 Hyperbolic Functions of Double Arguments 
2tanh x 
sinh 2r = 2sinh x cosh z, (2.186) Επ l-tanh?z' (2.188) 
; n2 2 2 
:oshi 2g. —8 x sh? x. 18 1 + coth" x 
cosh 2x = sinh” x + cosh’ x, (2.187) coth 2z = coth (2.189) 
2 coth x 
2.9.3.6 De Moivre Formula for Hyperbolic Functions 
(cosh x + sinh z)" = (e**) "= ef"? = cosh nz + sinh nz. (2.190) 


2.9.3.7 Hyperbolic Functions of Half- Argument 


x 1 x i 
sinh 7 = +4/ 3 (cosh x—1), (2.191) cosh = = 3 (cosh gI) (2.192) 


The sign of the square root in (2.191) is positive for x > 0 and negative for x < 0. 


TE c E a .:; 
2.9.3.8 Sum and Difference of Hyperbolic Functions 

sinh x + sinh y = 2sinh z 5 ” cosh Ž = 4 , (2.195 

cosh x + cosh y = 2 cosh ΣΤ 9 cosh = 3 Z ; (2.196 

cosh x — cosh y = 2 sinh ZEY sinh Y ; (2.197 

tanh x + tanh y = pied) (2.198 


cosh x cosh y ` 


2.9.3.9 Relation Between Hyperbolic and Trigonometric Functions with 
Complex Arguments z 


sinz = —isinhiz, (2.199) sinh z = —isiniz, (2.203) 
cos z = coshiz, (2.200) cosh z = cosiz, (2.204) 
tanz = —itanhiz, (2.201) tanh z = —itaniz, (2.205) 
cot z = icothiz, (2.202) coth z = icotiz. (2.206) 


Every relation between hyperbolic functions, which contains x or ax but not az+b, can be derived from 
the corresponding trigonometric relation with the substitution isinh x for sina and cosh x for cos o. 


ΒΑ: cos? o — sin? o = 1, cosh? x + i? sinh? x = 1 or cosh? x — sinh? x = 1. 
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W B: sin 2a = 2sina cosa, isinh 2x = 2isinh x cosh or sinh 2r = 2 sinha cosh 2. 


2.10 Area Functions 
2.10.1 Definitions 


The area functions are the inverse functions of the hyperbolic functions, i.e., the inverse hyperbolic 
functions. The functions sinh z , tanh x, and coth x are strictly monotone, so they have unique inverses 
without any restriction; the function cosh x has two monotonic intervals so there are to consider two 
inverse functions. The name area refers to the fact that the geometric definition of the functions is the 
area of certain hyperbolic sectors (see 3.1.2.2, p. 132). 

2.10.1.1 Area Sine 

The function y = Arsinh x (2.207) 
(Fig. 2.53) is an odd, strictly monotone increasing function, with domain and range given in Ta- 
ble 2.8. (2.53) is equivalent to the expression x = sinh y. The origin is the center of symmetry and 


T 
the inflection point of the curve, where the angle of slope of the tangent line is o — re 


2.10.1.2 Area Cosine 
The functions y= Arcoshr and y= — Arcoshc (2.208) 


(Fig. 2.54) or x = cosh y have the domain and range given in Table 2.8; they are defined only for 
x > 1. The function curve starts at the point A(1,0) with a vertical tangent line and the function 
increases or decreases strictly monotonically respectively. 


y-Arcosh x 


» 
-2 -1 23456x 


y=-Arcosh x 


Figure 2.53 Figure 2.54 


Table 2.8 Domains and ranges of the area functions 


Hyperbolic function 


"Oz al 1 £ ye " H 
Area function Domain Range with same meaning 


area sine 


y — Arsinh x —oo«z«oo|-—oo«y«oo x = sinh y 
area cosine 

y = Arcosh a O<y<a qaos 

y — — Arcosh x i$v«9 wg. y <0 z = cosh y 


area tangent 


y = Artanh x «1 | -co<y<oo z =tanhy 
area cotangent 
y = Arcoth x |z|>1 | -c0« y «0 x = coth y 
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Figure 2.55 Figure 2.56 
2.10.1.3 Area Tangent 
The function y = Artanh x (2.209) 


(Fig. 2.55) or x = tanh y is an odd function, defined only for |x| < 1, with domain and range given in 
Table 2.8. The origin is the center of symmetry and also the inflection point of the curve, and here the 


"e π , : : 
angle of slope of the tangent line is o — 1 . The asymptotes are vertical, their equations are x = +1. 


2.10.1.4 Area Cotangent 
The function y = Arcoth x (2.210) 


(Fig. 2.56) or x = coth y is an odd function, defined only for |x| > 1, with domain and range given in 
Table 2.8. In the interval -oo < x < —1 the function is strictly monotone decreasing from 0 until —oo, 
in the interval 1 < x < +00 it is strictly monotone decreasing from 4-oo to 0. It has three asymptotes: 
their equations are y = 0 and x = +1. 


2.10.2 Determination of Area Functions Using Natural Logarithm 


From the definition of hyperbolic functions ((2.166)-(2.171), see 2.9.1, p. 89) follows that the area 
functions can be expressed with the logarithm function: 


Arsinh x = In (z + Vx? +1), (2.211) 


1 
Arcosh x = In (x + Va? — 1) = m( j ;) (v 2 1), (2.212) 
2 --γαθ- 
Ll. el ee 1, x41 
Artanh x = zm 1 = (|x| « 1), (2.213) Arcoth x = zm Ξ = (z| » 1). (2.214) 
= z— 


2.10.3 Relations Between Different Area Functions 


Arsinh x = (sign x) Arcosh v x? + 1 = Artanh —— = Arcoth ———— 
Ἢ Ve 


(|z| « oo), (3.215) 


vz?—1 x 
Arcosh x = Arsinh vz? = Artanl — Arcotl 215 2.216 
rcosh x rsinh Vx rtanh —— rcoth mg (a > 1) ( ) 
x 1 1 
Artanh x = Arsinh "E — Arcoth "ied (sign x) Arcosh Vice ([α| < τ), (2.217) 
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1 
Arcoth x = Artanh — = (sign x) Arsinh 
T 


|z] 


xr? — 1 


1 


= (sign x) Arcosh (z| > 1). (2.218) 


2.10.4 Sum and Difference of Area Functions 
Arsinh a + Arsinh y = Arsinh (2y 1l+ytyVv1+ zi ; (2.219) 


Arcosh x + Arcosh y = Arcosh (ου + y (x? — 1)(y? — 1) ; (2.220) 

Artanh x + Artanh y = Artanh = i : (2.221) 
2.10.5 Formulas for Negative Arguments 

Arsinh(—r) = — Arsinh z, (2.222) 

Artanh(—a) = — Artanh z, (2.223) Arcoth(—a) = — Arcoth x. (2.224) 


The functions Arsinh, Artanh and Arcoth are odd functions, and Arcosh (2.212) is not defined for 
arguments x < 1. 


2.11 Curves of Order Three (Cubic Curves) 


A curve is called an algebraic curve of order n if it can be written in the form of a polynomical equation 
F(x,y) = 0 of two variables where the left-hand side is a poynomial expression of degree n. 

E The cardioid with equation (x? + y?) (z? + y? — 2ax) — a?y? = 0 (a > 0) (see 2.12.2, p. 98) isa curve 
of order four. 'The well-known conic sections (see 3.5.2.11, p. 206) result in curves of order two. 


2.11.1 Semicubic Parabola 
The equation y — ax?? (a0, «> 0) (2.225a) 
or in parametric form «=¢?, y=at® (a0, —co <t <oo) (2.225b) 
gives the semicubic parabola (Fig. 2.57). It has a cuspidal point at the origin, it has no asymptote. 
6a 
The curvature K = ——-——.—, takes all the values between oo and 0. The arclength of the curve 
ντ(4 + 9a2x)3/2 
between the origin and the point P(x, y) is L = 


2.11.2 Witch of Agnesi 


= + 9a2x)3/? — 8]. 


The equation y= (a>0, =œ < x < œ) (2.226a) 


a? + x? 

determines the curve represented in Fig. 2.58, the witch of Agnesi. It has an asymptote with the 
a 

equation y = 0, it has an extreme point at A(0, a), where the radius of curvature is r = z The inflection 

a 3a 3V3 

y3 4 8 


points B and C are at ( ) , where the angles of slope of the tangent lines are tan o = 3 
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Figure 2.57 Figure 2.58 Figure 2.59 


The area of the region between the curve and its asymptote is equal to S = πα’. The witch of Agnesi 
(2.2262) is a special case of the Lorentz or Breit-Wigner curve 

u a 

(O0 -(α- c)? 
E a Fourier transform of the damped oscillation is the Lorentz or Breit-Wigner curve (see 15.3.1.4, 
p. 791). 


y (a>0,b40). (2.226b) 


2.11.3 Cartesian Folium (Folium of Descartes) 


The equation z?--y?-—3ary (a0) or (2.227a) 
; trie f 3at 3at? i 
in parametric form z = ——.,y—-——4 with 
; 14-8'97 148 
t=tan¢P0r (a»0,—o00«t«-—1 and —1«t <œ) (2.227b) 


gives the Cartesian folium curve represented in Fig. 2.59. The origin is a double point because the 
curve passes through it twice, and here both coordinate axes are tangent lines. At the origin the radius 


A 3a ; i 
of curvature for both branches of the curve is r = m The equation of the asymptote is x +y +a = 0. 


: 3 3 : 3a* 
The vertex A has the coordinates A (Za σα) . The area of the loop is $1 = ue The area S between 


the curve and the asymptote has the same value. 


2.11.4 Cissoid 


23 

The equation 4? = — (a> 0), (2.228a) 

α-- Ἡ 
: tric f at? at? ith 
or in parametric form r = —, y= wit 
k T4 g** 7 qus 
t —tanXPOr (a>0,-co <t < œ) (2.228b) 
asin? ip 
or with polar coordinates p = (a > 0) (2.228c) 
cos p 


(Fig. 2.60) describes the locus of the points P for which 


0P = MQ (2.229) 
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a 
is valid. Here M is the second intersection point of the line 0P with the drawn circle of radius 2) and 
Q is the intersection point of the line 0P with the asymptote x = a. The area between the curve and 


; ον. 
the asymptote is equal to S = ya 


Figure 2.60 Figure 2.61 
2.11.5 Strophoide 


Strophoide is the locus of the points P, and P», which are on an arbitrary half-line starting at A (A is 
on the negative x-axis) and for which the equalities 

MP, = MP = 0M (2.230) 
are valid. Here M is the intersection point with the y-axis (Fig. 2.61). The equation of the strophoide 
in Cartesian, and in polar coordinates, and in parametric form is: 


F ofa+r 082 
foe ¢ + =) (a > 0), (2.231a) pma es), (2.231b) 
a3 cos (p 
a ἅ μες δε Gncliceetecug (2.2310) 
=a $ = q with t = tan £ a ,—O00 ν oo). H ς 
PU 3 στη i 
The origin is a double point with tangent lines y = +x. The asymptote has the equation x = a. The 


; ; δα NEP 
vertex is A(—a, 0). The area of the loop is Sı = 2a? — σπα”, and the area between the curve and the 


"u-— ei 
asymptote is $5 = 2a? + σπα”. 
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2.12.1 Conchoid of Nicomedes 
The Conchoid of Nicomedes (Fig. 2.62) is the locus of the points P, for which 

OP — 0M +1 (2.232) 
holds, where M is the intersection point of the line 0P 0P with the asymptote x = a. The “+” sign 
belongs to the right branch of the curve, the “—” sign belongs to the left one in relation to the asymptote. 
The equations for the conchoid of Nicomedes are the following in Cartesian coordinates, in parametric 
form and in polar coordinates: 


(α —a)?(a? +y?)— Pa? =0 (a 0,0 0), (2.233a) 


r—a-clcosp, y=atany+lsiny 


(a > 0, right branch: — u «φς 2 , left branch: 5 <y< m) (2.233b) 
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p= +1 (“+” sign: right branch, “—” sign: left branch,) (2.233c) 


y^ 


e 


"E 


SEE I ER 


a) Ica b) ba 


Figure 2.62 


1. Right Branch: The asymptote is x = a. The vertex A is at (a + l, 0), the inflection points B, C 
have as z-coordinate the greatest root of the equation z? — 3a?z + 2a(a? — I?) = 0. The area between 
the right branch and the asymptote is $ — oo. 

2. Left Branch: The asymptote is v = a. The vertex D is at (a — 1,0). The origin is a singular 
point, whose type depends on a and |: 


Case a) For | < a it is an isolated point (Fig. 2.62a). The curve has two further inflection points E 
and F, whose abscissa is the second greatest root of the equation Ὁ) — 3a?z + 2a(a? — I?) = 0. 


Caseb) For! > athe origin is a double point (Fig. 2.62b). The curve has a maximum and a minimum 


value at x = a — Val? . At the origin the slopes of the tangent lines are tana = —— — ——. Here the 
a 
IVE — a? 


radius of curvature is rg = 2 
a 


Case c) For l = a the origin is a cuspidal point (Fig. 2.62c). 


2.12.2 General Conchoid 


The conchoid of Nicomedes is a special case of the general conchoid. One gets the conchoid of a given 
curve by elongating the length of the position vector of every point by a given constant segment +1. 
Considering a curve in a polar coordinate system with an equation p = f(y), then the equation of its 
conchoid is 


p— f(q) xt. (2.234) 
So, the conchoid of Nicomedes is the conchoid of the line. 


2.12.3 Pascal's Limagon 
The conchoid of a circle is called the Pascal limaçon (Fig. 2.63) if in (2.232) the origin is on the 
perimeter of the circle, which is a further special case of the general conchoid (see 2.12.2, p. 98). The 
equations in the Cartesian and in the polar coordinate systems and in parametric form are the following 
(see also (2.246c), p. 105): 

(2? +y — az = P(a? y?) (a 0,0» 0), (2.235a) 


p=acosy+l (α»0, 1» 0), (2.235b) 
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x —acosp--lcosp, y=acosysing+Ilsing (a2 0, 12 0, 0 € o « 2x) (2.235c) 


with a as the diameter of the circle. The vertices A, B are at (a +1,0). The shape of the curve depends 

on the quantities a and {, as can be seen in Fig. 2.63 and Fig. 2.64. 

a) Extreme Points and Inflection Points: For a > l the curve has four extreme points C, D, E, F; 

—l + JP + δα 
4a 


for a < lit has two; they are at ( cosy = 


2a? + D? 
3al i 


) . Fora < l < 2a there exist two inflection 


points G and H at (s p=- 


P^ W4- ἴξ 
b) Double Tangent: For | < 2a, at the points J and K at τα QE 
α Λα 


) there is a double 


tangent. 
c) Singular Points: The origin is a singular point: For a < l it is an isolated point, for a > l it is a 


a<l<2a 


b) 
Figure 2.63 
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; : a?—l . , 

double point and the slopes of the tangent lines are tan a = +————_., here the radius of curvature is 
I 8 I 


1 


TQ = 7 a2 -- l2. 


For a = | the origin is a cuspidal point; then the curve is called a cardioid (see also 2.13.3, p. 103). 
va? , 
The area of the limaçon is S = E + Tl’, where in the case a > | (Fig. 2.63c) the area of the inside 


loop is counted twice. 


2.12.4 Cardioid 
The cardioid (Fig. 2.64) can be defined in two different ways, as: 
1. Special case of the Pascal limaçon with 
0P =0M +a, (2.236) 


where a is the diameter of the circle. 


2. Special case of the epicycloid with the same diameter a for the fixed and for the moving circle (see 
2.13.3, p. 103). The equation is 
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(a? + y?)? — 2ασ(α +y) =a?y? (a. 0), (2.2372) 
and the parametric form, and the equation in polar coordinates are: 
2 =acosp(l+cosy), y=asiny(1+ cosy) 
(a >0,0 € v « 2π), (2.237b) 
p=a(l+cosy) (a> 0). (2.237c) 
The origin is a cuspidal point. The vertex A is at (2a, 0); extreme points 
3/3 


It aa ς 9 
C and D are at cosy = 5 with coordinates E ; αρα) . The area 


= ; 3 5. Tw . . f 
a=1 is ὃ = σπα” , i.e., six times the area of a circle with diameter a. The 
Figure 2.64 length of the curve is L — 8a. 


2.12.5 Cassinian Curve 

The locus of the points P, for which the product of the distances from two fixed points F, and [ο 
with coordinates (c, 0) and (—c,0) resp., is equal to a constant a? 7 0, is called a Cassinian curve 
(Fig. 2.65): 


FP. PP = a’. (2.238) 
The equations in Cartesian and polar coordinates are: 
(a? + y?)? — 22? (x? — y?)=at- c, (a>0,c>0), (2.239a) 
P = cos2 + y ct cos? 2p + (at — ct) (a> 0,c » 0). (2.239b) 
y a 
G E 
c Q P A 
ofc x 
K I 
c«a«c42' a«c 


Figure 2.65 
'The shape of the curve depends on the quantities a and c : 
Case a > εν: For a > cV2 the curve is an oval whose shape resembles an ellipse (Fig. 2.65a). 
The intersection points A, C with the x-axis are (+ Va? + c? , 0), the intersection points B, D with the 
y-axis are (0, +Va? — c). 
Case a = cV/2: For a = cV2 the curve is of the same type with A, C (+ cV/3, 0) and B, D (0, c), 


where the curvature at the points B and D is equal to 0, i.e., there is a narrow contact with the lines 
y — cc. 


is) 
I 


Case c < a < cV/2: For c < a < cV2 the curve is a pressed oval (Fig. 2.65b). The intersection 


points with the axes are the same as in the case a > cv/2, also the extreme points B, D, while there are 


vAci — a a? 
further extreme points E, G, K, I at (« 7 (t3 and there are four inflection points P, L, 
iC C 
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iy a ο 9 m 1), ΞΕ 9 m n withn = 3e and m = 3 


Case a = c: For a = cthere is the lemniscate. 

Case a « c: For a < c there are two ovals (Fig. 2.65c). The intersection points A, C and P,Q 

with the x-axis are at (+ Va? +, 0) and (+ vc? — a?,0). The extreme points E, G, K, I are 
/ 2 

at (« κε A + =) . The radius of curvature is r = 

2c 2c 

coordinate representation. 


2a? p? 
c! — at 4 3p! 


, where p satisfies the polar 


2.12.6 Lemniscate 


The lemniscate (Fig. 2.66) is the special case a — c of the Cassinian curve satisfying the condition 


m 


FP. BP = ( (2.240) 


where the fixed points Fi, F> are at (+a, 0). The equation 
in Cartesian coordinates is 

(a? + y°)? — 2a?(x? - y?) 20. (a> 0) (2.241a) 
and in polar coordinates 


p=ay/2cos2p (a> 0). (2.241b) 


The origin is a double point and an inflection point at the 
same time, where the tangents are y = +2. Figure 2.66 


The intersection points A and C with the a-axis are at (+ αγΏ 0), the extreme points of the curve E, 
a3 
2^" 


a π 
G, K, I are at ( 3 . The polar angle at these points is y = E The radius of curvature is 


a , ; 
He. and the area of every loop is S = a? . 


2.13 Cycloids 
2.13.1 Common (Standard) Cycloid 


The cycloid is a curve which is described by a point of the perimeter of a circle while the circle rolls 
along a line without sliding (Fig. 2.67). The equation of the usual cycloid written in parametric form 
is the following: 


x —a(t — sint), y — a(1— cost) 

(a>0, —oo«t«oo) (2.242a) 
where a is the radius of the circle and t 
is the angle X PC, B in radian measure. 
In Cartesian coordinates 


x +4/y(2a — y) = aarc;, cos --- 
a 


(a>0, k=0,+1,+2,...) (2.242b) 
holds. 


Figure 2.67 
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The curve is periodic with period OgO; = 2ra. At Op, Οι. Os, .. ., Oy = (2kra, 0) there are cusps, the 
vertices are at Αχ = ((2k+1)ra, 2a (k = 0,+1,+2,...)). The arc length of OpP is L = 8a sin” (t/4), 
the length of one arch is Lo,4,0, = δα. The area of one arch is S = ὅπα”. The radius of curvature 
isr = Aasin 3t, at the vertices ra = 4a. The evolute of a cycloid (see 3.6.1.6, p. 254) is a congruent 
cycloid, which is denoted in Fig. 2.67 by the broken line. 


2.13.2 Prolate and Curtate Cycloids or Trochoids 
Prolate and curtate cycloids or trochoids are curves described by a point, which is inside or outside of a 
circle, fixed on a half-line starting from the center of the circle, while the circle rolls along a line without 
sliding (Fig. 2.68). 
'The equation of the trochoid in parametric form is 
x — a(t — Asint), (2.2432) 
y — a(1 — Acost), (2.243b) 


where a is the radius of the circle, t is the angle X: PCM, and Aa = Cj P. 

The case À > 1 gives the prolate cycloid and λ < 1 the curtate one. 

The period of the curve is OjO; = 27a, the maximum points are at Αι, A», ..., Appi = ((2k+1)ra, (1+ 
A)a), the minimum points are at Bo, B1, Β2,..., By = (2Κπα, (1 — λ)α) (k = 0,4£1,+2,...). The 


mn 


prolate cycloid has double points at Do, Di, Do,..., Dy = [πα τα (1 —4A2— D) | , where tọ is the 


smallest positive root of the equation t = 
Asint. 

The curtate cycloid has inflection points 
at Fy, E», saby Eesti = 

[a (arc; cos À — AV1 -- 12) ,a(1— x). 
The calculation of the length of one cy- 
cle can be done by the integral L = 


2T 
" / V1+X2—2)\costdt. The shaded 
0 


area in Fig. 2.68 is S = ma?(2 + X°). 
The radius of curvature 
. (1--A—2Acost)/? 
isr—a , which has 
A(cost — A) 
(1+ A)? 


the value ra = —a-——,—— at the max- 


λ 
(=a) 
λ 


ima and the value rg = a at the 


Figure 2.68 minima. 


2.13.3 Epicycloid 

A curve is called an epicycloid, if it is described by a point of the perimeter of a circle while this circle 
rolls along the outside of another circle without sliding (Fig. 2.69). The equation of the epicycloid in 
parametric form is 


A-ca A 
x = (A + a) cosy — a cos “9, y = (A + a)sin y — asin Ti (—oo < p < oo), (2.244) 
a a 


where A is the radius of the fixed circle, a is the radius of the rolling one, and φ is the angle X C0x. The 


. A 
shape of the curve depends on the quotient m — —. 
α 
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For m — 1 one gets the cardioid. 


Figure 2.69 


Figure 2.70 


Case m integer: For an integer m the curve consists of m identically shaped branches surrounding the 


fixed curve (Fig. 2.69a). The cusps Αι. A2, ..., Am are at | p = A, y = — (k=0,1,...,m— D) " 
m 


m 


2 1 
the vertices B4, Β....., Bm are at (» =A+2a, p= e (k f 5) } 


Case m a rational fraction: If m is a non-integer rational number, the identically shaped branches 
follow each other around the fixed circle, overlapping the previous ones, until the moving point P re- 
turns back to the starting-point after a finite number of circuits (Fig. 2.69b). 


Case m an irrational: For an irrational m the number of round trips is infinite; the point P never 
returns to the starting-point. 
8(A +a) 
m ` 
curve is Liota = 8(A + a). The area of the sector Ay B1 A2A; (without the sector of the fixed circle) is 
ο [3A + 2a 4a(A +a 4a(A+a 
g (A+a) (A+) 


Ay 
. The radius of curvature is r = —————— sin , at the vertices rg = 
A ) 2a+A 2a P aA 


The length of one branch is L4,5,4, = For an integer πι the total length of the closed 


2.13.4 Hypocycloid and Astroid 
A curve is called a hypocycloid if it is described by a point of the perimeter of a circle, while this circle 
rolls along the inside of another circle without sliding (Fig. 2.70). The equation of the hypocycloid, 
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the coordinates of the vertices, the cusps, the formulas for the arc-length, the area, and the radius of 
curvature are similar to the corresponding formulas for the epicycloid, only “+a” is to be replaced by 
“—q” The number of cusps for integer, rational or irrational m is the same as for the epicycloid (now 
m > 1 holds). 
Case m — 2: For m — 2 the curve is actually the diameter of the fixed circle. 
Case m = 3: For m = 3 the hypocycloid has three branches (Fig. 2.70a) with the equation 

x = a(2cos y + cos 29), y = a(2sin p — sin 2y) (2.2452) 
and Li = 16a, Si = 27a?. 
Case m, — 4: For m — 4 (Fig. 2.70b) the hypocycloid has four branches, and it is called an astroid 
(or asteroid). Its equation in Cartesian coordinates and in parametric form is 


p73 +y? = Αα (99450) r= Aco y, y= Asie (0Xqo«m) (2.245c) 
and Liota = 24a = 6A, S44 = ara. 
yA γα 


8) λ51, a>0 9) 


Figure 2.71 


a) -- a AS, a<0 


Figure 2.72 


2.13.5 Prolate and Curtate Epicycloid and Hypocycloid 

The prolate and curtate epicycloid and the prolate and curtate hypocycloid, which are also called the 
epitrochoid and hypotrochoid, are curves (Fig. 2.71 and Fig. 2.72) described by a point, which is inside 
or outside of a circle, fixed on a half-line starting at the center of the circle, while the circle rolls around 
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the outside (epitrochoid) or the inside (hypotrochoid) of another circle, without sliding. 
The equation of the epitrochoid in parametric form is 


A Aca 
x = (A + a) cosy — Aa cos ( = e) : y = (A + a)sin o — Aasin ( xis e) i (2.246a) 
a a 


where A is the radius of the fixed circle and a is the radius of the rolling one. For the hypocycloid “+a” 
is to be replaced by “—a”. For λα = CP one of the inequalities A > 1 or A < 1 is valid, depending on 
whether the prolate or the curtate curve is considered. 


For A = 2a, and for arbitrary \ Z 1 the hypocycloid with equation 
z=a(l+A)cosy, y=a(l1—A)sing (0< œ< 2π) (2.246b) 


describes an ellipse with semi-axes a(1 + λ) and a(1 — A). For A = a it results in the Pascal limaçon 
(see also 2.12.3, p. 98): 


2 =a(2cosy—Acos2y), y=a(2siny — Asin 2y). (2.246c) 


Remark: For the Pascal limaçon on 2.12.3, p. 98 the quantity denoted by a there is denoted by 2a 
here, and the / there is the diameter 2a here. Furthermore the coordinate system is different. 


2.14 Spirals 
2.14.1 Archimedean Spiral 


An Archimedean spiralis a curve (Fig. 2.73) described by a point which is moving with constant speed 
v on aray, while this ray rotates around the origin at a constant angular velocity w. The equation of 
the Archimedean spiral in polar coordinates is 


p= aj], απο (a >0,-00 < p< œ). (2.247) 


Figure 2.73 Figure 2.74 
The curve has two branches (p < 0, > 0) in a symmetric position with respect to the y-axis. Every 
ray OK intersects the curve at the points 0, Αι, Ag,...,An,...; their distance is A;A;,; = 27a. The 
es ] ; F . 2L 
arclength 0P is L = s (ev p? +14 Arsinh v) , where for large o the expression αρ tends to 1. The 
a, : : (Φ} ἠ- 13 
area of the sector P,OP, is S = ra — 1"). The radius of curvature is r = xy and at the 
p 
- a 
origin ro = 5. 
2.14.2 Hyperbolic Spiral 
The equation of the hyperbolic spiral in polar coordinates is 
p=— (a>0, -o<p<0,0<¢<oo). (2.248) 


ly 
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The curve of the hyperbolic spiral (Fig. 2.74) has two branches (p < 0, > 0) in a symmetric 
position with respect to the y-axis. The line y = a is the asymptote for both branches, and the origin is 


ΓΙ 1 a? 

an asymptotic point. The area of the sector P 0P, is S = — | — — — |, and lim S = — is valid. 
2471 Φα pao 2p 

3 


TE 


The radius of curvature is r = 


p 


2.14.3 Logarithmic Spiral 


The logarithmic spiralis a curve (Fig. 2.75) which intersects all the rays starting at the origin 0 at the 
same angle a. The equation of the logarithmic spiral in polar coordinates is 


p=ae? (a>0, -œ< 9< ορ), (2.249) 
where k = cota. The origin is the asymptotic point of the curve. The arclength P,P, is L = 


(p2 — pi), the limit of the arclength OP calculated from the origin is Lg = p. The 


k 


: k 
radius of curvature is r = V1 + k?p = Lok. 


Special case of a circle: For a = 3 holds k = 0, and the curve is a circle. 


2.14.4 Evolvent ofthe Circle 


The evolvent of the circle is a curve (Fig. 2.76) which is described by the endpoint of a string while 


rolling it off a circle, and always keeping it tight, so that AB= BP. The equation of the evolvent of the 
circle is in parametric form 
Tr -acosQ--apsing, y-asing — ay cos p, (2.250) 


where a is the radius of the circle, and y = X: Β0α. The curve has two branches in symmetric position 
with respect to the x-axis. It has a cusp at A(a, 0), and the intersection points with the x-axis are at 


a τ l . 
2 -- , where yo are the roots of the equation tan y = y. The arclength of AP is L = σάφ᾽. The 
COS Yo 


radius of curvature is r = ay = V2aL; the centre of curvature B is on the circle, i.e., the circle is the 
evolute of the curve. 


yA 


Figure 2.75 Figure 2.76 Figure 2.77 
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2.14.5 Clothoid 
The clothoid is a curve (Fig. 2.77) such that at every point the radius of curvature is inversely propor- 
tional to the arclength between the origin and the considered point: 


2 


r= - (a » 0). (2.2512) 


The equation of the clothoid in parametric form is 
t " t 5 
vt . TU ; 5 
z = ay T fos F dt, y=a/r [ 5» E dt with t= ——, s=0P. (2.251b) 
0 0 


The integrals cannot be expressed in terms of elementary functions; but for any given value of the pa- 
rameter t = to, t1,... it is possible to calculate them by numerical integration (see 19.3, p. 963), so the 
clothoid can be drawn pointwise. About calculations with a computer see the literature. 

The curve is centrosymmetric with respect to the origin, which is also the inflection point. At the inflec- 
tion point the x-axis is the tangent line. At A and B the curve has asymptotic points with coordinates 


an ayr an — a m 
( t 9^" H £) and ( ντ y £), The clothoid is applied, for instance in road construction, 


2 


where the transition between a line and a circular arc is made by a clothoid segment. 


2.15 Various Other Curves 
2.15.1 Catenary Curve 


The catenary curve is a curve wich has the shape of a homogeneous, flexible but inextensible heavy 
chain hung at both ends (Fig. 2.78) represented by a continuous line. The equation of the catenary 
curve is 


et/a de e t/a 


x 
y =acosh— =a 5 (a > 0). (2.252) 
a 
The parameter a determines the vertex A at (0, a). The curve is symmetric to the y-axis, and is always 
2 


higher than the parabola y = a + or! which is represented by the broken line in Fig. 2.78. The 
a 


αν T οἳ/α _ e t/a 
arclength of AP is L = asinh a 7 . The area of the region 0ΑΡΜ has the value S = 
a 
2 : 2 
82a Ὁ : . y 47 L 
aL = a? sinh —. The radius of curvature is r a cosh? a 
a a a 


Figure 2.78 Figure 2.79 
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The catenary curve is the evolute of the tractrix (see 3.6.1.6, p. 254), so the tractrix is the evolvent (see 
3.6.1.6, p. 255) of the catenary curve with vertex A at (0, a). 


2.15.2 Tractrix 


The £ractriz (the thick line in Fig. 2.79) is a curve such that the length of the segment PM of the 
tangent line between the point of contact P and the intersection point with a given straight line, here 
the x-axis, is a constant a. Fasting one end of an inextensible string of length a to a material point P, 
and dragging the other end along a straight line, here the x-axis, then P draws a tractrix. The equation 
of the tractrix is 


= aln — 
y 


The z-axis is the asymptote. The point A at (0, a) is a cusp. The curve is symmetric with respect to 


a?—4? (a>0,0<y<a). (2.253) 


a - 
a = aArcosh — + y a? 
y 


y? α 

the y-axis. The arclength of AP is L = aln —. For increasing arclength L the difference L — x tends 
y 

to the value a(1 — In2) ~ 0.307a, where x is the abscissa of the point P. The radius of curvature is 


r = acot — . The radius of curvature PC and the segment PE = b are inversely proportional: rb = a? . 


The evolute (see 3.6.1.6, p. 254) of the tractrix, i.e., the geometric locus of the centers of circles of 
curvature C, is the catenary curve (2.252), represented by the dotted line in Fig. 2.79. 


2.16 Determination of Empirical Curves 


2.16.1 Procedure 
2.16.1.1 Curve-Shape Comparison 


If there are only empirical data for a function y = f(x), it is possible to get an approximate formula in 
two steps. First choose a formula for an approximation which contains free parameters. Then calculate 
the values of the parameters. If there is no theoretical description for the type of formula, then first 
choose the approximate formula which is the simplest among the possible functions, comparing their 
curves with the curve of empirical data. Estimation of similarity by eye can be deceptive. Therefore, 
after the choice of an approximate formula, and before the determination of the parameters, it is to 
check whether it is appropriate. 


2.16.1.2 Rectification 

Supposing there is a definite relation between x and y and in the chosen approximate formula two 

functions X = (x,y) and Y = u(x, y) are introduced such that a linear relation of the form 
Y=AX+B (2.254) 


holds, where A and B are constant. Calculating the corresponding X and Y values for the given x and 
y values, and considering their graphical representation, it is easy to check if they are approximately 
on a straight line, or not. Then it can be decided whether the chosen formula is appropriate. 


x 
ΒΑ: Ifthe approximate formulais y = 
ax +b 


, then substituting X = x, Y = 2 one gets Y = aX +b. 
y 
1 1 
Another possible substitution is X = —, Y = —. Then Y = a + bX follows. 
T y 


W B: Using semi-logarithmic paper, 2.17.2.1, p. 116. 

W C: Using double logarithmic paper, 2.17.2.2, p. 117. 

In order to decide whether empirical data satisfy a linear relation Y = AX + B or not, one can use 
linear regression or correlation (see 16.3.4, p. 839). The reduction of a functional relationship to a linear 
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relation is called rectification. Examples of rectification of some formulas are given in 2.16.2, p. 109, 
and for an example discussed in detail, see in 2.16, p. 114. 


2.16.1.3 Determination of Parameters 

The most important and most accurate method of determining the parameters is the least squares 
method (see 16.3.4.2, p. 841). In several cases, however, even simpler methods can be used with success, 
for instance the mean value method. 

1. Mean Value Method 

The mean value method uses the linear dependence of the “rectified” variables X and Y, i.e., Y = 
AX +B as follows: The conditional equations Y; = AX;+ B for the given values Y;, X; are to be divided 
into two groups, which have the same size, or approximately the same size. By adding the equations 
in the groups one gets two equations, from which A and B can be determined. Then replacing X and 
Y by the original variables x and y again, one gets the connection between x and y, which is what one 
was looking for. 
If not all the parameters are to be determined, one can apply the mean value method again with a 
rectification by other amounts X and Y (see, e.g., 2.16.2.11, p. 113). 

Rectification and the mean value method are used above all when certain parameters occur in non-linear 
relations in an approximate formula, as for instance in (2.267b), (2.267c). 

2. Least Squares Method 
When certain parameters occur in non-linear relations in the approximation formula, the least squares 
method usually leads to a non-linear fitting problem. Their solution needs a lot of numerical calculations 
and also a good initial approximation. These approximations can be determined by the rectification 
and mean value method. 


2.16.2 Useful Empirical Formulas 

In this paragraph some of the simplest cases of empirical functional dependence are discussed, and 
the corresponding graphs are presented. Each figure shows several curves corresponding to different 
parameter values involved in the formula. The influence of the parameters upon the forms of the curves 
is discussed in the following sections. 

For the choice of the appropriate function, usually only that part of the corresponding graph is to be 
considered, which is used for the reproduction of the empirical data. Therefore, e.g., one should not 
think that the formula y = ax? + br + c is suitable only in the case when the curve of the empirical 
data have a maximum or minimum. 


y yA yA 


[αυ] 
xy 
οι 
κ 
[e] 
xy 


Figure 2.80 Figure 2.81 Figure 2.82 


2.16.2.1 Power Functions 
1. Type y = ααὖ: 


Typical shapes of curve for different values of the exponent b 
y =ar? (2.255a) 
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areshown in Fig. 2.80. The curves for different values of the exponent are also represented in Figs. 2.15, 
2.21, 2.24, 2.25 and Fig. 2.26. The functions are discussed on pages 66, 70 and 71 for the formula 
(2.44) as a parabola of order n, formula (2.45) as a reciprocal proportionality and formula (2.50) as a 
reciprocal power function. The rectification is made by taking the logarithm 

X —-logz, Y —logy: Y=loga+bX. (2.255b 
2. Typey = az? + c: 
The formula 

y =ar +c (2.256a 
produces the same curve as in (2.255a), but it is shifted by c in the direction of y (Fig. 2.82). If b is 
given, the following rectification can be used: 


X—s, Yoy: Y=aX+e. (2.256b 
If b is not known first c is to be determined, then the rectification can be done in accordance with 

X —logz, Y —log(y—- c): Y —loga-br. (2.256c 
In order to determine a first approach of c, one can choose two arbitrary abscissae x4, 20 and a third 
one, £3 = \/%1£ as well as the corresponding ordinates yi, yo, ya, so that now 

c= Hm y? (2.256d 


yn tye — 2ys 

holds. After having determined a and b, one can get a corrected c, namely as the average of the amounts 
b 

y — ax". 


yA ya 


Figure 2.83 Figure 2.84 


2.16.2.2 Exponential Functions 
1. Type y = ae": 
The characteristic shapes of the curves of the function 

y =ae™ (2.257a 
are shown in Fig. 2.81. The discussion of the exponential function (2.54) and its graph (Fig. 2.26) is 
presented in 2.6.1, p. 72. For the rectification one takes 

X=2, Y=logy: Y =loga+bloge. X. (2.257b 
2. Type y = ae +c: 
The formula 

y =a +c (2.258a 
produce the same curve as (2.257a), but it is shifted by c in the direction of y (Fig. 2.83). Begin with 
the determination of a first approach c for c then the rectification by logarithm: 

Y =log(y—a), X =a: Y=loga+bloge-X. (2.258b 
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In order to determine c as for (2.256d) two arbitrary abscissae x1, 2 are to be chosen and x3 = LAS 
yı + y» — 2ys 
and b one can get a corrected c, namely as the average of the amounts y — az". 


2.16.2.3 Quadratic Polynomial 
Possible shapes of curves of the quadratic polynomial 

y = ax? +br +e (2.259a 
are shown in Fig. 2.84. For the discussion of quadratic polynomials (2.41) and their curves Fig. 2.12 


(see 2.3.2, p. 64). Usually the coefficients a, b and c are determined by the least squares method; bu 
in this case also rectification is possible. Choosing an arbitrary point of data (x1, y1) one rectifies 


as well as the corresponding ordinates yi, Y2, Y3 to get c = . After having determined a 


y-W. 
=k 


X= Y Y = (b + axi) + aX. (2.259b 


If the given x values form an arithmetical sequence with a difference h, one rectifies 
Y=Ay, X=a: Y =(bh+ah?) + 2ahx. (2.259c 
In both cases after the determination of a and b from the equation 


Soy = aya? b rnc (2.259d 


cis to be calculated; n is the number of the given x values, for which the sum is calculated. 


2.16.2.4 Rational Linear Functions 


The rational linear function 


ax +b 
y= 2.260a 
dp" +d ( ) 
is discussed in 2.4 with (2.46) and graphical representation Fig. 2.17 (see p. 66). 
Choosing an arbitrary data point (x1, y1) the rectification is done by the formulas 


L- 21 
y-u' 
After determining the values A and B the relation can be written in the form (2.260c). Sometimes 
instead of (2.260a) the form (2.260d)is sufficient: 


Y 


X=r: Y=A+BX. (2.260b) 


2 — Tı z 1 
στ οι (2.260 - > y= . 2.260d 
A+ Ba’ 0909) Y= ad V V ard (272900) 


yur 


1 1 x 
Then in the first case the rectification can be made by X and Y or X = «and Y and by 
2 y y 


1; 
X = x and Y = — in the second case. 


2.16.2.5 Square Root of a Quadratic Polynomial 
Several possible shapes of curves of the equation 

yY =ar bxc (2.261) 
are shown in Fig. 2.85. The discussion of the function (2.52) and its graph Fig. 2.23 (see p. 71). 


If introducing the new variable Y = y?, the problem can be reduced to the case of the quadratic poly- 
nomial in 2.16.2.3, p. 111. 
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Figure 2.85 Figure 2.86 


2.16.2.6 General Error Curve 
The typical shapes of curves of the functions 


ba+ex? 


y =ae or logy = loga + ba loge + cx” loge (2.262) 
are shown in Fig. 2.86. The discussion of the function with equation (2.61) and its graph Fig. 2.31 
(see p. 75). 
Introducing the new variable Y = log y, the problem is reduced to the case of the quadratic polynomial 
in 2.16.2.3, p. 111. 


2.16.2.7 Curve of Order Three, Type II 
The possible shapes of graphs of the function 

1 
ax? + ba +c 
are represented in Fig. 2.87. The discussion of the function with equation (2.48) and with graphs 
Fig. 2.19 (see p. 67). 


y= (2.263) 


1 
By introducing the new variable Y — —, the problem is reduced to the case of the quadratic polynomial 
y 
in 2.16.2.3, p. 111. 
y yA 
0 X 
0 x 
Figure 2.87 Figure 2.88 


2.16.2.8 Curve of Order Three, Type III 
Typical shapes of curves of functions of the type 
2 


----------- 2.264 
az? + ὑπ +c (2.264) 


y= 
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are represented in Fig. 2.88. The discussion of the function with equation (2.49) and with graphs 
Fig. 2.20 (see p. 68). 


Introducing the new variable Y = Z the problem can be reduced to the case of the quadratic polynomial 
y 

in 2.16.2.3, p. 111. 

2.16.2.9 Curve of Order Three, TypeI 

Typical shapes of curves of functions of the type 
b a 

y=a+2+Ś4 (2.265) 

T X 


are represented in Fig. 2.89. The discussion of the function with equation (2.47) and with graphs 
Fig. 2.18 (see p. 67). 


1 
Introducing the new variable X — — the problem can be reduced to the case of the quadratic polynomial 
x 
in 2.16.2.3, p. 111. 
A 
ys y 
" d x 
0 x 
Figure 2.89 Figure 2.90 


2.16.2.10 Product of Power and Exponential Functions 
Typical shapes of curves of functions of the type 

y ασε (2.266a) 
are represented in Fig. 2.90. The discussion of the function with equation (2.62) and with graphs 
Fig. 2.31 (see p. 75). 
If the empirical values of x form an arithmetical sequence with difference h, the rectification follows in 
accordance with 
Y=Alogy, X-Alogz: Y —hcloge-- 6X. (2.266b) 
Here A log y and A log x denote the difference of two subsequent values of log y and log x respectively. 
If the x values form a geometric sequence with quotient q, then the rectification follows by 
X-—z, Y=Alogy: Y =blogq + c(q — 1)X log e. (2.266c) 
After b and c are determined the logarithm of the given equation is taken, and the value of loga is 
calculated like in (2.259d). 
If the given x values do not form a geometric sequence, but one can choose pairs of two values of x such 
that their quotient q is the same constant, then the rectification is the same as in the case of a geometric 
sequence of x values with the substitution Y = A, log y. Here Δι log y denotes the difference of the two 
values of log y whose corresponding x values result in the constant quotient q (see 2.16.2.12, p. 114). 


2.16.2.11 Exponential Sum 
Typical shapes of curves of the exponential sum 
y = ae" + ce™ (2.267a) 
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are represented in Fig. 2.91. The discussion of the function with equation (2.60) and with graphs 
Fig. 2.29 (see p. 74). 

If the values of x form an arithmetical sequence with difference h, and y, yi, y» are any three consecutive 
values of the given function, then the rectification is made by 


νο y="; yoo sexo ih uu (2.267b) 
y y 
After b and d are determined, follows again a rectification by 
Yaye*, ἁ- ει ὃς. Y=aX +c. (2.267c) 


2.16.2.12 Numerical Example 

Find an empirical formula to describe the relation between x and y for given values in Table 2.9. 
Choice of the Approximation Function: Comparing the graph prepared from the given data (Fig. 
2.92) with the curves discussed before, one can see that formulas (2.264) or (2.266a) with curves in 
Fig. 2.88 and Fig. 2.90 can fit the considered case. 


Determination of Parameters: Using the formula (2.264) to rectify are AŽ and x. The calculation 
: y 

shows, however, the relationship between x and A is far from linear. To verify whether the formula 

(2.266a) is suitable one plots the graph of the relation between Alog x and Alogy for h = 0,1 in 

Fig. 2.93, and also between A; logy and x for q = 2 in Fig. 2.94. In both cases the points fit a 


straight line well enough, so the formula y = az"e** can be used. 


Table 2.9 For the approximate determination of an empirically given function relation 


zy Δ lgs | lgy | Alex | Algy | Δι]βῳ | Yer 
0.1 | 1.78 0.056 0.007 | —1.000 0.250 .301 0.252 0.252 1.78 
0.2 | 3.18 0.063 0.031 | —0.699 0.502 .176 +0.002 —0.097 3.15 
0.3 | 3.19 0.094 0.063 | —0.523 0.504 0.125 —0.099 —0,447 3.16 
0.4 | 2.54 0.157 0.125 | —0.398 0.405 .097 —0.157 —0.803 2.52 
0.5 | 1.77 0.282 0.244 | —0.301 0.248 .079 —0.191 —1.134 1.76 
0.6 | 1.14 0.526 0.488 | —0.222 0.057 .067 —0.218 —1.455 1.14 


0.7 | 0.69 | 1.014 | 0.986 | —0.155 | —0.161 .058 | —0.237 = 0.70 
0.8 | 0.40 | 2.000 | 1.913 | —0.097 | —0.398 051 —0.240 = 0.41 
0.9 | 0.23 | 3.913 | 3.78 | —0.046 | —0.638 .046 | —0.248 -- 0.28 
1.0 | 0.13 | 7.69 8.02 0.000 | —0.886 041 —0.269 a 0.13 
1.1 | 0.07 | 15.71 | 14.29 0.041 | —1.155 .038 | —0.243 = 0.07 
1.2 | 0.04 | 30.0 = 0.079 | —1.398 = — - 0.04 


In order to determine the constants a, b and c, a linear relation between x and A, log y is to be searched 
by the method of mean values. Adding the conditional equations A, log y = blog 2 + cx loge in groups 
of three equations each, yields 

—0.292 = 0.903b + 0.2606c, --5.392 = 0.903b + 0.6514c, 


and from here b = 1.966 and c = —7.932 holds. To determine a, the equations of the form logy = 
loga + blogz + εἰορε: x are to be added, which yields —2.670 = 12 log a — 6.529 — 26.87, so from 
loga = 2.561, a = 364 follows. The values of y calculated from the formula y = 364x177" are 
given in the last column of Table 2.9; they are denoted by Yer, as an approximation of y. The error 
sum of squares is 0.0024. 

Using the parameters determined by rectification as initial values for the iterative solution of the non- 
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Figure 2.93 Figure 2.94 


linear least squares problem (see 19.6.2.4, p. 987) 


Sin — aa e^? = min! 


yields a = 396.601 986, b = 1.998 098, c = —8.000 0916 with the small error sum of squares 0.000 0916. 


2.17 Scales and Graph Paper 
2.17.1 Scales 


The base of a scale is a function y = f(x). The task is to construct a scale from this function so that 
on a curve, e.g. on a line, the function values of y are to be inserted as the values of the argument x. A 
scale can be considered as a one-dimensional representation of a table of values. 

The scale equation for the function y = f(x) is: 


y — l[f(x) — (απο). (2.268) 
The starting point of the scale is fixed at zo. The scale factor l takes into consideration that for a con- 
crete scale there it is only one given scale length. 


ΒΑ Logarithmic Scale: For | 10 cm and zo = 1 the scale equation is 
y = 10[lg x — lg 1] = 101g x (in cm). For the table of values 
x [1| 2 | 3 | 4 {5 | 6 | 7 | 8 | 9 10 
y = Iga | 0 | 0.30 | 0.48 | 0.60 | 0.70 | 0.78 | 0.85 | 0.90 | 0.95 | 1 


one gets the scale shown in Fig. 2.95. 
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Figure 2.95 


W B Slide Rule: The most important application of the logarithmic scale, from a historical view- 
point, was the slide rule. Here, for instance, multiplication and division were performed with the help 
of two identically calibrated logarithmic scales, which can be shifted along each other. 

From Fig. 2.96 one can read: y3 = yı + ys, i.e., lg x3 = lg xı +lg xo = lg x, x2, hence 73 = πι £2; Yı = 


. T3 T3 
Ua — Yo, Le., lg xı = lg £3 — Iga = lg — , so 21 = ; 

22 22 
W C Volume Scale on the lateral surface of a conical shaped funnel: A scale is to be marked on the 
funnel, so that the volume inside could be read from it. The data of the funnel are: Height Η = 15 cm, 
upper diameter D — 10 cm. 


Í; 
Taking in mind Fig. 2.97a gives the scale equation as follows: Volume V = gr πι, apothem s = 


τ. D/2 1 
vh? +r?, tana = z = 7; gs From these h = 3r, s 2 ry10, V = (y follows, so the scale 
i 3 
v10 


set 10 ,— . 3/5 : p , . 
equation is s = yu VV z 2.6V V. The following table of values contains the calibration marks on 
VL 


the funnel as in the figure: 


V|[0| 50 | 100 | 150 | 200 | 250 | 300 | 350 
s | 0] 7.96 | 10.03 | 11.48 | 12.63 | 13.61 | 14.46 | 15.22 | 
E 3007 
1 X Xs 10 
I I I ] 
X» 1 X, 10 H 
I I ] 
2 Ἴ 
M 
i i b) 
Figure 2.96 Figure 2.97 


2.17.2 Graph Paper 


The most useful graph paper is prepared so that the axes of a right-angle coordinate system are cali- 
brated by the scale equations 


x —h[g(u) —g(uo)], y — l[f(v) — /(υο)]. (2.269) 
Here lı and l5 are the scale factors; ug and vo are the initial points of the scale. 
2.17.2.1 Semilogarithmic Paper 


If the z-axis has an equidistant subdivision, and the y-axis has a logarithmic one, then one talks about 
semilogarithmic paper or about a semilogarithmic coordinate system. 


Scale Equations: 
x =I,[u— uo] (linear scale), y-—l»lgv —1lgvo] (logarithmic scale). (2.270) 
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The Fig. 2.98 shows an example of semilogarithmic paper. 


Representation of Exponential Functions: On semilogarithmic paper the graph ofthe exponential 
function 


y — ae? (α,β const) (2.271a) 
is a straight line (see rectification in 2.16.2.2, p. 110). This property can be used in the following way: 
If the measuring points, introduced on semilogarithmic paper, lie approximately on a line, it can be 
supposed a relation between the variables as in (2.271a). With this line, estimated by eye, one can 
determine the approximate values of a and 3: Considering two points Pi(z1,y1) and P2(x2, y2) from 
this line one gets 


u In yo — In yı 


8 and, e.g, a= yer". (2.271b) 
29 — Tı 
γα n : 
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2.17.2.2 Double Logarithmic Paper 
If both axes of a right-angle x, y coordinate system are calibrated with respect to the logarithm function, 
then one talks about double logarithmic paper or log-log paper or a double logarithmic coordinate system. 
Scale Equations: The scale equations are 

x = hflg u — lg uo], y = lo[lg v — lg vo]; (2.272) 
where l,l are the scale factors and uo, vo are the initial points. 
Representation of Power Functions (see 2.5.3, p. 71): Log-log paper has a similar arrangement 
to semilogarithmic paper, but the x-axis also has a logarithmic subdivision. In this coordinate system 
the graph of the power function 

y — aa^ (α,β const) (2.273) 
is a straight line (see rectification of a power function in 2.16.2.1, p. 109). This property can be used in 
the same way as in the case of semilogarithmic paper. 
2.17.2.3 Graph Paper with a Reciprocal Scale 


The subdivisions of the scales on the coordinate axes follow from (2.45) for the function of inverse 
proportionality (see 2.4.1, p. 66). 


a a 


Scale Equations: x —l[u — uj], y= lz | | (a const), (2.274) 


U U9 
where {1 and l5 are the scale factors, and ug, vo are the starting points. 


W Concentration in a Chemical Reaction: For a chemical reaction, the concentration denoted by 
c = c(t), has been measured as a function of the time t, giving the following results for c: 
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t/min | 5 | 10 | 20 | 40 
c: 108 /mol/T | 15.53 | 11.26 | 7.27 | 4.25 


It can be supposed that the reaction is of second order, i.e., the relation should be 


Hie «Ὁ ---- 
c(t) = Irak (co, k const). (2.275) 


; : ; 1 1 ; 
Taking the reciprocal value of both sides, one gets = = — + kt, i.e., (2.275) can be represented as a 
C Co 
line, if the corresponding graph paper has a reciprocal subdivision on the y-axis and a linear one on the 
Ρ : m 1 
x-axis. The scale equation for the y-axis is, e.g., y = 10: — cm. 
U 
It is obvious from the corresponding Fig. 2.99 that the measuring points lie approximately on a line, 
i.e., the supposed relation (2.275) is acceptable. 
From these points the approximate values of both parameters k (reaction rate) and co (initial concen- 
tration) can be determined. Choosing two points, e.g., P, (10, 10) and P2(30, 5), one gets: 
l/e — 1/eo 
T dd 


2.17.2.4 Remark 

There are several other possibilities for constructing and using graph papers. Although today in most 
cases there are high-capacity computers to analyze empirical data and measurement results, in every- 
day laboratory practice, when getting only a few data, graph papers are used quite often to show the 
functional relations and approximate parameter values needed as initial data for applied numerical 
methods (see the non-linear least squares method in 19.6.2.4, p. 987). 


k & 0.005, co zz 20 - 1073. 


2.18 Functions of Several Variables 
2.18.1 Definition and Representation 


2.18.1.1 Representation of Functions of Several Variables 
A variable value u is called a function of n independent variables £1, £2, . . ‚£n, if for given values of the 
independent variables, u is a uniquely defined value. Depending on how many variables there are, two, 
three, or n, one writes 

u-—f(zy), w= fryz); u= ftit tn) (2.276) 
Substituting given numbers for the n independent variables yields a value system of the variables, which 
can be considered as a point of the n-dimensional space. The single independent variables are called 
arguments; sometimes the entire n tuple together is called the argument of the function. 
Examples of Values of Functions: 
WB A: u= f(x,y) = xy? has for the value system x = 2, y = 3 the value f (2,3) = 2-3? = 18. 
WB B: u= f(x,y,z,t) = x In(y — zt) takes for the value system x = 3, y = 4, z = 3, t = 1 the value 
f(3,4,3, 1) = 31n(4 — 3- 1) = 0. 


2.18.1.2 Geometric Representation of Functions of Several Variables 


1. Representation ofthe Value System of the Variables 

The value system of an argument of two variables x and y can be represented as a point of the plane 
given by Cartesian coordinates x and y. A value system of three variables x, y, z corresponds to a point 
given by the coordinates x, y, z in a three-dimensional Cartesian coordinate system. Systems of four 
or more coordinates cannot be represented obviously in our three-dimensional imagination. 

Similarly to the three-dimensional case the system of n variables x4, £2,. .. ,r,, is to be considered as a 
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point of the n-dimensional space given by Cartesian coordinates 21, 29 „Zn. In the above example 
B, the four variables define a point in four-dimensional space, with coordinates x = 3, y = 4, z = 3 
andt= 1. 


u 2. Representation of the Function u = f(x,y) of Two 
Variables 
a) A function of two independent variables can be represented by a sur- 
0 face in three-dimensional space, similarly to the graph representation 
of functions of one variable (Fig. 2.100, see also 3.6.3, p. 261). Consid- 
ering the values of the independent variables of the domain as the first 
y two coordinates, and the value of the function u = f(x,y) as the third 
coordinate of a point in a Cartesian coordinate system, these points 
Figure 2.100 form a surface in three-dimensional space. 


Examples of Surfaces of Functions: 


E A:u—1-— : — 1 represents a plane (Fig. 2.101a, sce also 3.5.3.10, p. 218). 
Bi. Lu 
BB: u= > + T represents an elliptic paraboloid (Fig. 2.101b, see also 3.5.3.13, 5., p. 226). 


BC: u 16 -- 
b) The shape of the surface of the function u = f(x,y) can be pictured with the help of intersection 
curves, which can be get by intersecting the surface parallel to the coordinate planes. The intersection 
curves u = const are called level curves. 


ll 


— y? represents a hemisphere with r = 4 (Fig. 2.101c). 


Β In Fig. 2.101b,c the level curves are ellipses or concentric circles (not denoted in the figure). 
Remark: A function with an argument of three or more variables cannot be represented in three- 
dimensional space. Similarly to surfaces in three-dimensional space also the notion of a hyper surface 
in n-dimensional space is in use. 


Figure 2.101 


2.18.2 Different Domains in the Plane 
2.18.2.1 Domain of a Function 


The domain of definition of a function (or domain of a function) is the set of the system of values or points 
which can be taken by the variables of the argument of the function. The domains defined in this way 
can be very different. Mostly they are bounded or unbounded connected sets of points. Depending on 
whether the boundary belongs to the domain or not, the domain is closed or open. An open, connected 
set of points is called a domain. If the boundary belongs to the domain, it is called a closed domain, if 
it does not, sometimes it is called an open domain. 


2.18.2.2 Two-Dimensional Domains 


Fig. 2.102 shows the simplest cases of connected sets of points of two variables and their notation. Do- 
mains are represented here as the shaded part; closed domains, i.e., domains whose boundary belongs to 
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them, are bounded by thick curves in the figures; open domains are bounded by dotted curves. Includ- 
ing the entire plane there are only simply connected domains or simply connected regionsin Fig. 2.102. 


2.18.2.3 Three or Multidimensional Domains 

These are handled similarly to the two-dimensional case. It concerns also the distinction between sim- 
ply and multiply connected domains. Functions of more than three variables will be geometrically 
represented in the corresponding n-dimensional space. 


unbounded y 
closed 
domain 


unbounded 
open domain 


bounded 
open domain 


Figure 2.102 


2.18.2.4 Methods to Determine a Function 


1. Definition by Table of Values Functions of several variables can be defined by a table of values. 
An example of functions of two independent variables are the tables of values of elliptic integrals (see 
21.9, p. 1103). The values of the independent variables are denoted on the top and on the left-hand 
side of the table. The required value of the function is in the intersection of the corresponding row and 
column. It is called a table with double entry. 
2. Definition by Formulas Functions of several variables can be defined by one or more formulas. 


at+y forx>0, y>0, 

BMA: u= ry. BC: ue x—y forx>0, y<0, 
BB: u= x In(y — zt). u=] —o+y forx<0, y>0, 

—-r—yforz«0,gy«0. 
3. Domain of a Function Given by One Formula In the analysis mostly such functions are 
considered which are defined by formulas. Here the union of all value systems for which the analytical 
expression has a meaning is considered to be the domain, i.e., for which the expression has a unique, 
finite, real value. 
Examples for Domains: 
E A: u = 2? +y: The domain is the entire plane. 


1 y ; m } ; 
E B: u = ——— ——,——,: The domain consists of all value systems z, y, satisfying the inequality 


J16 — α) — 3? 


x? +y? < 16. Geometrically this domain is the interior of the circle in Fig. 2.103a, an open domain. 


WB C: u= arcsin(x + y): The domain consists of all value systems x, y, satisfying the inequality —1 < 
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x+y < 51, i.e., the domain of the function is a closed domain, the stripe between the two parallel lines 
in Fig. 2.103b. 

E D: u = arcsin(2z — 1) + V1— y? + vy + Inz: The domain consists of the value system x, y, 2, 
satisfying the inequalities 0 € x < 1,0 € y < 1, z > 0, i.e., it consists of all points of the three 
dimensional x, y, z space lying above a square with side-length 1 shown in Fig. 2.103c. 
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Figure 2.103 
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Figure 2.105 


y 
If from the interior of the considered part of the plane a point or a 
bounded, simply connected point set is missing, as shown in Fig. 2.104, 
it is called a doubly-connected domain or doubly-connected region. Mul- 
tiply connected domains are represented in Fig. 2.105. A non-connected 
6 > region is shown in Fig. 2.106. 


Figure 2.106 


2.18.2.5 Various Forms for the Analytical Representation of a Function 


Functions of several variables can be defined in different ways, just as functions of one variable. 
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1. Explicit Representation 
A function is given or defined in an explicit way if its value (the dependent variable) can be expressed 
by the independent variables: 

w= f nda, gus (2.277) 


2. Implicit Representation 
A function is given or defined in an implicit way if the relation between its value and the independent 
variables is given in the form: 

F(11,22,...,24,u) =0, (2.278) 
if there is a unique value of u satisfying this equality. 
3. Parametric Representation 
A function is given in parametric form if the n arguments and the function are defined by n new vari- 
ables, the parameters, in an explicit way, supposing there is a one-to-one correspondence between the 
parameters and the arguments. For a two-variable function, for instance 


x=(r,s), y-—w(r,s), u-x(r,s), (2.2792) 
and for a three-variable function 
z—«(rst), y-—w(rst), z-—x(nst), u= xk(r,s,t) (2.279b) 


etc. 
4. Homogeneous Functions 
A function f (a1, %2,...,£n) of several variables is called a homogeneous function if the relation 
f (Ax, λαο,..., Ars) = AF (21, ο0,...., Tn) (2.280) 
holds for arbitrary A. The number m is the degree of homogeneity. 


ΒΛ: For u(z, y) = a? — 3xy +y? 4 τ the degree of homogeneity is m — 2. 
y 


2 }- 2 
ο... , the degree of homogeneity is m = 0. 
2x — 3y ` 


2.18.2.6 Dependence of Functions 
1. Special Case of Two Functions 


Two functions of two variables u = f(x,y) and v = (x,y), with the same domain G, are called 
dependent functions if one of them can be expressed as a function of the other one u = F (v). For every 
point of the domain G of the functions the identity 

f(t,y) = Flle, y)) or Φ(}.ῳ) =0 (2.281) 
holds. If there is no such function F(p) or (f, p) , one speaks about independent functions. 
Bl u(x, y) = (a? «νι v = Va? +y? are defined in the domain x? + y? > 0, i.e., in the whole x, y 
plain, and they are dependent, because u = v! holds. 


WB B: Foru(r,y) = 


2. General Case of Several Functions 
Similarly to the case of two functions, the m functions u1, U2,. . . , Um of n variables x1, 15,. .. , x, in their 
common domain G are called dependent if one of them can be expressed as a function of the others, 
i.e., if for every point of the domain G the identity 

ui = f(uy,us,..., Ui—1, Ui+l;---;Um) Or (u1, uo,..., Um) = 0 (2.282) 


is valid. If there is no such functional relationship, they are independent functions. 


Β The functions u = πι + 32 ++ En, V = T1? +2 +- r,? and w = T182 + 3133 d: d 31044 
2 
2924 +- + $4 11, are dependent because v = u^ — 2w holds. 
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3. Analytical Conditions for Independence 
Suppose every partial derivative mentioned below exists. Two functions u = f(x, y) and 


v = q(r,y) are independent on a 
Of Of domain 7 their ο determi- 
ΠΣ, Πε . ] nant or Jacobian determinant is not 
as V. short (^ 9) Or 28, » (2.283a) identically zero here. Analogously, 
Bp ὃρ D(v, y) Dv, y) in the case of n functions of n vari- 
Or Oy ables uy = fi(zi,...,24),..., Un = 
fultus bn): 
Of, Of of 
Or, Ot. ^7 Oan i the number m of the e 
- aF - ions U1, U2, ..., Um is smaller 
5} 8 2 2E . Dífis fo; sd) : than the number of variables 
Or, ÖT Or, | = Ῥίτι,α....., En) 0 .(2.283b) T1, T2, ..., En, these functions 
: : : : eee are independent if at least one 
Of, Of, Of, subdeterminant of order m of 
dr, δα; ᾿ Om, the matrix (2.283c) is not iden- 
tically zero: 

Ou, Oui Qui The number of independent func- 

Oz, Ot. ^^ Om, tions is equal to the rank r of the ma- 

Bis Dus Aus trix (2.283c) (see 4.1.4, 7., p. 274). 

- T e Here these functions are indepen- 

Or, Φα; ὅσα |. (2.283c) dent, whose derivatives are the ele- 

: : : : ments of the non-vanishing determi- 

Ou, OUm Qu, nant of order r. If m > n holds, then 

Ox, Ot δε, among the given m functions at most 


n can be independent. 


2.18.3 Limits 

2.18.3.1 Definition 

A function of two variables u = f(x, y) has a limit A at x = a, y = bif when x and y are arbitrarily 
close to a and b, respectively, then the value of the function f(a, y) approaches arbitrarily closely the 
value A. Then one writes: 


lim f(z,y) Α. (2.284) 


yb 


The function may not be defined at (a, b), or if it is defined here, may not have the value A. 


2.18.3.2 Exact Definition 


A function of two variables u = f(x,y) has at the point (a, b) a limit 


A — lim f(v, y) (2.285a) 
yb 
if for arbitrary positive € there is a positive 7 such that (see Fig. 2.107) 
f(x,y) - AL <€ (2.285b) 
*" X holds for every point (x, y) of the square 
Figure 2.107 jc-—al<n, j|y-b| «nm. (2.285c) 


2.18.3.3 Generalization for Several Variables 


a) The notion of limit of a function of several variables can be defined analogously to the case of two 
variables. 


124 2. Functions 


b) The criteria for the existence of a limit of a function of several variables can be obtained by gener- 

alization of the criterion for functions of one variable, i.e., by reducing to the limit of a sequence or by 

applying the Cauchy condition for convergence (see 2.1.4.3, p. 53). 

2.18.3.4 Iterated Limit 

If for a function of two variables f (x, y) first the limit for x — a has been determined, i.e., lim f(x,y) 
x a 

for constant y, then for the function obtained, which is now a function only of y, one determines the 

limit for y > b, then the resulting number 


B = lim (lim f τμ) (2.286a) 
yob Nr—a 
is called an iterated limit. Changing the order of calculations generally yields an other limit: 
C — lim (tim f z,y)) ; (2.286b) 
αλα Ny—b 


In general B ¥ C holds, even if both limits exist. 


qe — y? + 73 + y? 


W For the function f(x,y) = 


and C = +1. 
Remark: If the function f(x, y) has a limit A = lim f(x, y), and both B and C exist, then B = C = A 


yb 
is valid. The existence of B and C does not follow from the existence of A. From the equality of the 
limits B — C the existence of the limit A docs not follow. 


2.18.4 Continuity 

A function of two variables f(x, y) is continuous at x = a, y = b, i.e., at the point (a, b), if 1. the point 
(a, b) belongs to the domain of the function and 2. the limit for x — a, y — b exists and is equal to the 
value, i.e., 


lim f(x,y) = f(a, 0). (2.287) 


yb 


for x + 0, y > 0 one gets the iterated limits B = —1 


r? +y 


Otherwise the function has a discontinuity at x = a, y = b. If a function is defined and continuous at 
every point of a connected domain, it is called continuous on this domain. Similarly can be defined the 
continuity of functions of more than two variables. 


2.18.5 Properties of Continuous Functions 


2.18.5.1 Theorem on Zeros of Bolzano 

If a function f(x, y) is defined and continuous in a connected domain, and at two points (x1, y1) and 
(x2, Y2) of this domain the values have different signs, then there exists at least one point (x3, y3) in this 
domain such that f(a, y) is equal to zero there: 


f(v3,y3) 50, if f(zı,yı)>0 and f(x, yo) <0. (2.288) 
2.18.5.2 Intermediate Value Theorem 


If a function f(x, y) is defined and continuous in a connected domain, and at two points (x1, y1) and 
(12, y) it has different values A = f(x1, yı) and B = f(x, y»), then for an arbitrary value C between 
A and B there is at least one point (3, y3) such that: 

f(z3,9)- C, A«C«B ο B«C«A. (2.289) 


2.18.5.3 Theorem About the Boundedness of a Function 


If a function f(x, y) is continuous on a bounded and closed domain, it is bounded in this domain, i.e., 
there are two numbers m and M such that for every point (x, y) in this domain: 


m< f(x,y) € M. (2.290) 
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2.18.5.4 Weierstrass Theorem (About the Existence of Maximum and 
Minimum) 

If a function f(x,y) is continuous on a bounded and closed domain, then it takes its maximum and 

minimum here, i.e., there is at least one point (2^, y^) such that all the values f(x, y) in this domain are 

less than or equal to the value f(x’, y’), and there is at least one point (x", y") such that all the values 

f(x,y) in this domain are greater than or equal to f(x", y"): For any point (x, y) of this domain 


fey) 2 f(v.y) 2 ία”. y") (2.291) 


is valid. 
2.19 Nomography 
2.19.1 Nomograms 


Nomograms are graphical representations of a functional correspondence between three or more vari- 
ables. From the nomogram, the corresponding values of the variables of a given formula — the key 
formula — in a given domain of the variables can be immediately read directly. Important examples of 
nomograms are net charts and alignment charts. 

Nomograms are still used in laboratories, even in the computer age, for instance to get approximate 
values or starting guesses for iterations. 


2.19.2 Net Charts 
To represent a correspondence between the variables x, y, z given by the equation 

F(z,y,z) 20 (2.292) 
(or in many cases explicitly by z = f(x, y)), the variables can be considered as coordinates in space. 
The equation (2.292) defines a surface which can be visualized on two-dimensional paper by its level 
curves (see 2.18.1.2, p. 118). Here, a family of curves is assigned to each variable. These curves form a 
net: The variables x and y are represented by lines parallel to the axis, the variable z is represented by 
the family of level curves. 
E Ohm’s law is U = R.I. The voltage U can be represented by its level curves depending on two 
variables. If and / are chosen as Cartesian coordinates, then the equation U — const for every con- 
stant corresponds to a hyperbola (Fig. 2.108). By looking at the figure one can tell the corresponding 
value of U for every pair of values R and J, and also | corresponding to every R,U, and also R corre- 
sponding to every J and U. Of course, the investigation is always to be restricted to the domain which 
is interpreted: In Fig. 2.108 holds 0 < R «10,0 < 7 < 10and 0 < U < 100. 


Remarks: 
1. By changing the calibration, the nomogram can be used for other domains. If, e.g., in (Fig. 2.108) 
the domain 0 < J < 1 should be represented but R should remain the same, then the hyperbolas of U 
are to be marked by U/10. 
2. By application of scales (see 2.17.1, p. 115) it is possible to transform nomograms with complicated 
curves into straight-line nomograms. Using uniform scales on the x and y axis, every equation of the 
form 

αφί2) + yu(z) + x(z) = 0 (2.293) 
can be represented by a nomogram consisting of straight lines. If function scales x = f(z.) and y = 
g(z2) are used, the equation of the form 


f(z2)e(z1) + g(22)0(3) + χ(αι) = Ὁ (2.294) 
has a representation for the variables σι, 22 and z3 as two families of curves parallel to the axis and an 
arbitrary family of straight lines. 


W By applying a logarithmic scale (see 2.17.1, p. 115), Ohm’s law can be represented by a straight-line 
nomogram. Taking the logarithm of R- J = U gives log R + log I = logU . Substituting x = log R 
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Figure 2.108 Figure 2.109 


and y = log / results in x + y = log U , i.e., a special form of (2.294). The corresponding nomogram is 
shown in Fig. 2.109. 


2.19.3 Alignment Charts 


A graphical representation of a relation between three variables z;, z2 and z3 can be given by assigning 
a scale (see 2.17.1, p. 115) to each variable. The z; scale has the equation 
αι vizi) yi = viu) (@=1,2,3). (2.295) 


The functions y; and wv; are chosen in such a manner that the values of the three variables σι, 22 and 
z3 satisfying the nomogram equation should lie on a straight line. To satisfy this condition, the area of 
the triangle, given by the points (x1, y1), (29. ys) and (23, y3) , must be zero (see (3.301), p. 195), i.e., 


ryt gilzr) Ψι(5ι} 1 
X9 Y2 l| = | poze) φο(σο) 1| =0 (2.296) 
T3 ya 1 p3(z3) Wa(z3) 1 


must hold. Every relation between three variables σι, 22 and z3, which can be transformed into the 
form (2.296), can be represented by an alignment nomogram . 
Next, follows the description of some important special cases of (2.296). 


2.19.3.1 Alignment Charts with Three Straight-Line Scales Througha Point 


If the zero point is chosen for the common point of the lines having the three scales z4, 20 or 24 then 
(2.296) has the form 
gı(21) magi) 1 
q2(22) mao (22) 1 
Q3 (23) m3p3(23) 1 
since the equation of a line passing through the origin has the equation y = ma. Evaluating the 
determinant (2.297), yields 


=i; (2.297) 


m-m, m-m m-m_ 0 2 2983 
φι(5ι) | (2(22) | ασ) ( ) 
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or 
a t E + 2a 0 with Ci + C5 - C3 — 0. (2.298b) 
φιίσι) v») Φε.) 


Here C1, Cz and C3 are constants. 
nosse ος : — : . . : 
W The equation — + nem f is a special case of (2.298b) and it is an important relation, for instance in 
α 


optics or for the parallel connection of resistances. The corresponding alignment nomogram consists 

of three uniformly scaled lines. 

2.19.3.2 ο. Charts with Two Parallel Inclined Straight-Line Scales 
and One Inclined Straight-Line Scale 


One of the scales is put on the y-axis, the other one on another line parallel to it at a distance d. The 
third scale is put on a line y = ma. In this case (2.296) has the form 


0 φι(σι) 1 
d ΠΟ 1-0. (2.299 
ασ) mip3(z3) 1 


Evaluation of the determinant by the first column yields 


d (mqa(z3) — Ψι(2ι)) + φα(σα) (Y1 (21) — Ψα(5α)) = 0. (2.300a 
Consequently: 
3(23) — d 
Φι(σι) zt P ) (W2(z2) — md) = 0 oder f(z1) - g(z3) — h(z2) = 0. (2.300b 
3 (23 
It is often useful to introduce measure scales E, and E; of the form 
E: 
Efla) IC) — Ezh(z2) = 0. (2.300c 
1 
d 
Then, ea(23) = ——z,—— holds. The relation E» : E, can be chosen so that the third scale is pulled 
I= g 909 
1 


near a certain point or it is gathered. Substituting m = 0, then E2h(z2) = w»(z2) and in this case, 
the line of the third scale passes through both the starting points of the first and of the second scale. 
Consequently, these two scales must be placed with a scale division in opposite directions, while the 
third one will be between them. 
W The relation between the Cartesian coordinates x and y of a point in the x, y plane and the corre- 
sponding angle ¢ in polar coordinates is: 

y? = acta o. (2.301) 
The corresponding nomogram is shown in Fig. 2.110. The scale division is the same for the scales of x 
and y but they are oriented in opposite directions. In order to get a better intersection with the third 
scale between them, their initial points are shifted by a suitable amount. The intersection points of the 
third scale with the first or with the second one are marked by y = 0 or y = 90° respectively. 
W For instance: x = 3 y = 3.5, gives y © 49.5°. 
2.19.3.3 Alignment Charts with Two Parallel Straight Lines and a 

Curved Scale 


If one of the straight-line scales is placed on the y-axis and the other one is placed at a distance d from 
it, then equation (2.296) has the form 


0 dia) 1 
d Va(23) 1-0. (2.302) 
(p3(23) 3(z3) 1 
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Consequently: 
(p3(23) w3(23) 
Φφι(σι) + Palza d = 
pilata Ὁ) d es(an) d — p3(23) 


Choosing the scale Æ; for the first scale and E» for the second one, then (2.303a) is transformed into 


(2.303a) 


Ef (zi) 4 πο h(z3) + Eyk(z3) = 0, (2.303b) 
where ψι(2ι) = Ei f(z), Vo(22) = Bog(z2) and 
Qa(23) = ο and wWs(z3) = BEES (s) (2.303c) 


- Ey + Eyh(z3) ΒΕ E,h(z3) 


holds. 
Β The reduced third-degree equation 2 + p*z + q* = 0 (see 1.6.2.3, p. 40) is of the form (2.303b). 
After the substitutions E; = By = 1 and f(z) = q* , g(z2) = p*, h(z3) = z, the formulas to calculate 


9 
d- αὖ 
1 -- and y = ψη(α) = mr 


In Fig. 2.111 the curved scale is shown only for positive values of z. The negative values one gets by 
replacing z by —z and by the determination of the positive roots from the equation z? 4- p*z — q* — 0. 
The complex roots u + iv can also be determined by nomograms. Denoting the real root, which always 
exists, by σι, then the real part of the complex root is u = —2;/2, and the imaginary part v can be 


the coordinates of the curved scale are x = y3(z) = 


2v +p -- 0. 


eI w 


determined from the equation 3u? — v? + p* = 


E For instance: y? + 2y — 5 = 0, i.e., p* = 2,q* = —5. One reads σι & 1.3. 


2.19.4 Net Charts for More Than Three Variables 


To construct a chart for formulas containing more than three variables, the expression is to decompose 
by the help of auxiliary variables into several formulas, each containing only three variables. Here, every 
auxiliary variable must be contained in exactly two of the new equations. Each of these equations is to 
be represented by an alignment chart so that the common auxiliary variable has the same scale. 


3 Geometry 


3.1 Plane Geometry 


3.1.1 Basic Notations 
3.1.1.1 Point, Line, Ray, Segment 
1. Point and Line 


Points and straight lines are not defined in today’s mathematics. The relations between them are de- 
termined only by axioms. The line is graphically imaginable as a trace of a point moving in a plane 
along the shortest route between two different points without changing its direction. 

A point is the intersection of two lines. 


2. Closed Half-Line or Ray, and Segment 

A ray is the set of points of a line which are exactly on one side of a given point O, including this point O. 

A ray is imaginable as the trace of a point which starts at O and moves along the line without changing 

its direction, like a beam of light after its emission until it is not led out of its way. 

A segment AB is the set of points of a line lying between two given points A and B of this line, including 

the points A and B. The segment is the shortest connection between the two points A and B ina 
— 

plane. The direction class of a segment is denoted by an arrowhead AB, or its direction starts at the 

first mentioned point A, and ends at the second B. 

3. Parallel and Orthogonal Lines 

Parallel lines run in the same direction; they have no common points, i.e., they do not move off and do 

not approach each other, and they do not have any intersection point. The parallelism of two lines g 

and g' is denoted by g||g' . 


Orthogonal lines form a right angle at their intersection, i.e., they are perpendicular to each other. 
Orthogonality and parallelism are mutual positions of two lines. 


3.1.1.2 Angle 


b 1. Notion of Angle 
B An angle is defined by two rays a and b starting at the same point S, so they can 
be transformed into each other by a rotation (Fig. 3.1). If A is a point on the 
5 a ray a and B is on the ray b, then the angle in the direction given in Fig. 3.1 is 
A denoted by the symbols (a, b) or by 4: AS B, or by a Greek letter. The point S is 
Figure 3.1 called the verter, the rays a and b are called sides or legs of the angle. 


In mathematics, an angle is called positive or negative depending on the rotation being counterclock- 
wise or clockwise respectively. It is important to distinguish the angle X AS B from the angle ¢ BS A. 
Actually, x ASB = — X BSA (0 € X ASB x 180°) and X ASB = 360° — X BSA (180° < X ASB < 
360^) holds. 

Remark: In geodesy the positive direction of rotation is defined by the clockwise direction (see 3.2.2.1, 
p. 144). 

2. Names of Angles 

Angles have different names according to the different positions of their legs. The names given in Ta- 
ble 3.1 are used for angles a in the interval 0 € a < 360° (see also Fig. 3.2). 
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Table 3.1 Names of angles in degree and radian measure 
Names of angles} Degree | Radian |Names of angles Degree Radian | 
round (full) angle |a? = 360°] «a —2-  |right angle o? = 90° a=7/2 | 
convex angle o? > 1805|π < a < 27 |acute angle 05 «o? < 90° [0° < a < 7/2] 
straight angle a = 180° απ [obtuse angle 90° <a < 180°|7/2<a<zT| 
TN δν «Σι Gg 0 
acute angle right obtuse straight convex round (full) 
angle angle angle angle angle 


Figure 3.2 
3.1.1.3 Angle Between Two Intersecting Lines 
At the intersection point of two lines gi, go there are four angles a, 3, γ,ὃ (Fig. 3.3). There are to be 
distinguished adjacent angles, vertex angles, complementary angles, and supplementary angles. 
1. Adjacent Angles are neighboring angles at the intersection point 
of two lines with a common vertex S, and with a common leg; both non- 
common legs are on the same line, they are rays starting from 5$ but in gı 
opposite directions, so adjacent angles sum to 180°. 


W In Fig. 3.3 the pairs (a, 8), (6,7), (η. ὃ) and (a, ὃ) are adjacent angles. ῥα 

2. Vertex Angles are the angles at the intersection point of two lines, VIE, 52 
opposite to each other, having the same vertex S but no common leg, and 

being equal. They sum to 180° by the same adjacent angle. Figure 3.3 


W In Fig. 3.3 (a, y) and (8,6) are vertex angles. 


3. Complementary Angles are two angles summing to 90°. 
4. Supplementary Angles are two angles summing to 180°. 
Β In Fig. 3.3 the pairs of angles (a, B) or (y, ὃ) are supplementary angles. 


3.1.1.4 Pairs of Angles with Intersecting Parallels 

Intersecting two parallel lines pi, pa by a third one g yields eight angles 
(Fig. 3.4). Besides the adjacent and vertex angles with the same vertex 
S , alternate angles, opposite angles, and corresponding (exterior-interior) 
angles are to be distinguished with different vertices. Figure 3.4 


1. Alternate Angles have the same size. They are on the opposite sides of the intersecting line g 
and of the parallel lines pı, po. The legs of alternate angles are in pairs oppositely oriented. 

E In Fig.3.4, e.g., (a1, 19). (£1, 62), (γι, αὐ). (δι, 82) are alternate angles. 

2. Corresponding or Exterior-Interior Angles have the same size. They are on the same sides 
of the intersecting line g and of the parallel lines Ῥι, po. The legs of corresponding angles are oriented 
in pairs in the same direction. 

W In Fig. 3.4 the pairs of angles (αι, a2), (81, 82), (γι. γα). and (01,62) are corresponding angles. 

3. Opposite Angles are on the same side of the intersecting line g but on different sides of the 
parallel lines pı, pa. They sum to 180°. One pair of legs has the same orientation, the other one is 
oppositely oriented. 
W In Fig. 3.4, e.g., the pairs of angles (αι, 2), (£1, 9). (γι. 82), and (δι, a2) are opposite angles. 
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3.1.1.5 Angles Measured in Degrees and in Radians 

In geometry, the measurement of angles is based on the division of the full angle into 360 equal parts 
or 360° (degrees). This is called measure in degrees. Further division of degrees is not a decimal one, 
it 15 sexagesimal: 1° = 60’ (minute, or sexagesimal minute), 1’ = 60" (second, or sexagesimal second). 
For grade measure see 3.2.2.2, p. 146 and the remark below. 

Besides measure in degrees the radian measure is used to define the size of an angle. The size of the 


central angle a of an arbitrary circle (Fig. 3.5a) is given as the ratio of the corresponding arc-length / 
and the radius of the circle r: 
a= 3 (3.1) | 1 rad = 57° 17' 44.8" = 57.2958°, 
a 1° = 0.017453 rad, 
The unit of radian measure is the radian (rad), i.e., the central | 1’ ^ = 0.000291 rad, 
angle belonging to an arc with arclength | equal to the radius | 1" = 0.000005 rad. 


r. In the table you will find approximate conversion values. 
If the measure of the angle is a° in degrees and a in radian measure, then for conversion 
ae π 180° 

= a° with o= : 3.2 
o 180° d" (3:2) 
holds. In particular: 360° = 27, 180° = 7, 90° = 7/2, 270° = 37/2, etc. Formulas (3.2) refer to 
decimal fractions and the following examples show how to make calculations with minutes and seconds. 


a 
o? = ga = 180°-, a= 
π 


W A: Conversion of an angle given in degrees into radian measure: 
52° 37' 23" = 52 - 0.017453 + 37 - 0.000291 + 23 - 0.000005 = 0.918447 rad. 
W B: Conversion of an angle given in radians into degrees: 
5.645 rad = 323 - 0.017453 + 26 - 0.000291 + 5 - 0.000005 = 323° 26' 05". 
The result is getting from: 
5.645:0.017453 = 323 + 0.007611 
0.007611 : 0.000291 = 26 + 0.000025 
0.000025 : 0.000005 = 5. 
The notation rad is usually omitted if it is obvious from the text that the number refers to the radian 
measure of an angle (see also p. 1055). 
Remark: In geodesy a full angle is divided into 400 equal parts, called grades. This is called measure 
in grades. A right angle is 100 gon. A gon is divided into 1000 mgon. 
On calculators the notation DEG is used for degree, GRAD for grade, and RAD for radian. For con- 
version between the different measures see Table 3.5, p. 146. 


9.1.2 Geometrical Definition of Circular and Hyperbolic 
Functions 


3.1.2.1 Definition of Circular or Trigonometric Functions 


1. Definition by the Unit Circle 
The trigonometric functions of an an- 
gle o are defined for both the unit circle 
with radius R = 1 and the acute angles 
of a right-angled triangle (Fig. 3.5a,b) 
with the help of the adjacent side b, op- 
posite side a, and hypotenuse c. In the 
unit circle the measurement of an angle 
is made between a fixed radius O A (with 
length 1) and a moving radius OC coun- 
terclockwise (positive direction): Figure 3.5 
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: f πα. 9 ; : Ap ὃ 
sine : sina = BC = Pu (3.3) cosine : cosa = OB = -, 
: ο 
re -— aw 8 
tangent: tana = AD = b (3.5) cotangent: cota = EF — —, 
a 
= [e —ÀÁ C 
secant : seca — OD — P (3.7) cosecant: | csca = OF — —. 
α 


2. Signs of Trigonometric Functions 


(3.4) 
(3.6) 


(3.8) 


Depending in which quadrant of the unit circle (Fig. 3.5a) the moving radius OC is, the functions 


have well-defined signs which can be taken from Table 2.2 (see p. 79). 


3. Definition of Trigonometric Functions by Area of Circular Sectors 
The functions sin a, cos a, tan a, cot a are de- 
fined by the segments BC, OB, AD of the 
unit circle with R = 1 (Fig. 3.6), where the 
argument is the central angle a = + AOC. 
For this definition one could use also the area 
x of the sector COK, which is denoted in 
Fig. 3.6 by the shaded area. With the cen- 


tral angle 2a measured in radians, one gets 
μας 

m 52a = a for the area with R = 1. 

Therefore, one has the same equations for 

sing = BC, cosx = OB, tanx = AD as 

in (3.3, 3.4, 3.5). Figure 3.6 Figure 3.7 


3.1.2.2 Definitions of the Hyperbolic Functions 


In analogy with the definition of the trigonometric functions in (3.3), (3.4), (3.5) now instead of the 
area of a sector of the unit circle with the equation x? +y? = 1 here the corresponding area of the sector 
of the hyperbola is used with the equation x? — y? = 1 (only the right branch in Fig. 3.7). Denoting 
by x the area of COK, the shaded area in Fig. 3.7, the defining equations of the hyperbolic functions 


are: 


sinh z = BC, (3.9) coshx = OB, (3.10) tanha = AD. 


(3.11) 


Calculating the area x by integration and expressing the results in terms of BC, OB, and AD yields 


x =In(BC 4 VBC’ 4 1) 2 In(OB 4 OB )- in 5, 


(3.12) 


and so, from now on, the hyperbolic functions can be expressed in terms of exponential functions: 


BCs — — sinhz, (3.13a) OB- τς = cosh x, 
AD = pet tanh z. 
et + et 


These equations represent the most popular definition of the hyperbolic functions. 


3.1.3 Plane Triangles 
3.1.3.1 Statements about Plane Triangles 


1. The Sum of Two Sides of a plane triangle is greater than the third one (Fig. 3.8): 


b+e>a. 


(3.13b) 


(3.136) 


(3.14) 
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2. The Sum ofthe Angles of a plane triangle is 

a+ +y= 180°. (3.15) 
3. Unique Determination of Triangles A triangle is uniquely determined by the following data: 
e by three sides or 
ο by two sides and the angle between them or 
e by one side and the two angles on it. 
If two sides and the angle opposite one of them are given, then they define two, one or no triangle (sce 
the third basic problem in Table 3.4, p. 145). 


4. 


b 


Figure 3.8 Figure 3.9 Figure 3.10 


4. Median ofa Triangle is a line connecting a vertex of the triangle with the midpoint of the 
opposite side. The medians of the triangle intersect each other at one point, at the center of gravity of 
the triangle (Fig. 3.10), which divides them in the ratio 2 : 1 counting from the vertex. 

5. Bisector ofa Triangle isa line which divides one of the interior angles into two equal parts. 
The bisectors intersect each other at one point. 

6. Incircle is the circle inscribed in a triangle, i.e., all the sides are tangents of the circle. Its center 
is the intersection point of the bisectors (Fig. 3.9). The radius of the inscribed circle is called the 
apothem or the short radius. 

7. Circumcircle is the circle drawn around a triangle, i.e., passing 

through the vertices of the triangle (Fig. 3.11). Its center is the inter- 


section point of the three right bisectors of the triangle. 
8. Altitude of a Triangle is the perpendicular line that starts at a a a 


vertex and is perpendicular to the opposite side. The altitudes intersect 
each other at one point, the orthocenter. a, 
9. Isosceles Triangle In an isosceles triangle two sides have equal 


length. The altitude, median, and bisector of the third side coincide. For a 
triangle the equality of any two of these sides is enough to make it isosceles. Figure 3.11 


10. Equilateral Triangle In an equilateral triangle with a — b — c the centers of the incircle and 
the circumcircle, the center of gravity, and the orthocenter coincide. 

11. Median isa line connecting two midpoints of sides of a triangle; it is parallel to the third side 
and has the half of length as that side. 

12. Right-Angled Triangle is a triangle that has a right angle (an angle of 90°) (Fig. 3.31, see 
p. 142). 


3.1.3.2 Symmetry 
1. Central Symmetry 


A plane figure is called center symmetric if by a rotation of the plane by 180° around the central point 
or the center of symmetry S it exactly covers itself (Fig. 3.12). Because the size and shape of the 
figure do not change during this transformation, it is called a congruent mapping. Also the sense class 
or orientation class of the plane figure remains the same (Fig. 3.12). Because of the same sense class 
such figures are called directly congruent. 
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The orientation of a figure means the traverse of the boundary of a figure in a direction: positive direc- 
tion, hence counterclockwise, negative direction, hence clockwise (Fig. 3.12, Fig. 3.13). 


C rotation of the plane by 180° A C rotation in the space__C’ 
by 180' around a 
line g 


AY -£E__Vav 


Figure 3.12 Figure 3.13 
2. Axial Symmetry 


A plane figure is called axially symmetric if the corresponding points cover each other after a rotation in 
space by 180? around a line g (Fig. 3.13). The corresponding points have the same distance from the 
axis g, the axis of symmetry. The orientation of the figure is reversed for axial symmetry with respect 
to the line g. Therefore, such figures are called indirectly congruent. This transformation is called a 
reflection in g. Because the size and the shape of the figures do not change, it is also called a indirect 
congruent mapping. The orientation class of the plane figure is reversed under this transformation 
(Fig. 3.13). 

Remark: For space figures hold the analogous statements. 

3. Congruent Triangles, Congruence Theorems 

a) Congruence: Plane figures are called congruent if their size and shape coincide. Congruent fig- 
ures can be transformed into a position of superimposition by the following three transformations, by 
translation, rotation, and reflection, and combinations of these. 

It is to distinguish between directly congruent figures and indirectly congruent figures. Directly con- 
gruent figures can be transformed into a covering position by translation and rotation. Because the 
indirectly congruent figures have a reversed sense class, an axially symmetric transformation with re- 
spect to a line is also needed to transform them into a covering position. 

W Axially symmetric figures are indirectly congruent. To transform them into each other all three 
transformations are needed. 


b) Laws of Congruence for Triangles: Two triangles are congruent if they coincide for: 

e three sides (SSS) or 

e two sides and the angle between them (SAS), or 

e one side and the interior angles on this side (ASA), or 

e two sides and the interior angle being opposite to the longer one (SSA). 

4. Similar Triangles, Similarity Theorems 

Plane figures are called similar if they have the same shape without having the same size. For similar 
figures there is a one-to-one mapping between their points such that every angle in one figure is the 
same as the corresponding angle in the other figure. An equivalent definition is the following: In similar 
figures the length of segments corresponding to each other are proportional. 


a) Similarity of figures requires either the equality of all the corresponding angles or the equality of 
the ratio of all corresponding segments. 

b) Area The areas of similar plane figures are proportional to the square of the ratio of corresponding 
linear elements such as sides, altitudes, diagonals, etc. 

c) Laws of Similarity For triangles the following laws of similarity hold. Triangles are similar if they 
coincide for: 
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e two ratios of sides, 

e two interior angles, 

e the ratio of two sides and the interior angle between them, 

e the ratio of two sides and the interior angle opposite of the longer one. 

Because in the laws of similarity only the equality of ratios of sides is required and not the equality of 
length of sides, therefore the laws of similarity require less than the corresponding laws of congruence. 


5. Intercept Theorems 
The intercept theorems are a consequence of the 
laws of similarity of a triangle. 
1. First Intercept Theorem If two rays start- 
ing at the same point S are intersected by two par- 
allels pı, p2, then the segments of one of the rays 
(Fig. 3.14a) have the same ratio as the correspond- 
ing segments on the other one: 


SP, SP, 
5Qi| |9Q» 
Consequently, every segment on one of the rays is 


proportional to the corresponding segment on the 
other ray. Figure 3.14 


(3.16) 


2. Second Intercept Theorem If two rays starting at the same point S are intersected by two par- 
allels pı, p» , then the segments of the parallels have the same ratio as the corresponding segments on 
the rays (Fig. 9.144): 

| SP | συ SP, | AP 
SQi} |Q1Q» 5Q3|  |Q1Q» 
'The intercept theorems are also valid in the case of intersecting lines at the point S, if the point S is 
between the parallels (Fig. 3.14b). 


(3.17) 


9.1.4 Plane Quadrangles 


3.1.4.1 Parallelogram 

Α quadrangle is called a parallelogram (Fig. 3.15) if it has the following properties: 

e the sides opposite to each other have the same length, 

e the sides opposite to each other are parallel, 

e the diagonals intersect each other at their midpoints, 

e the angles opposite to each other are equal. 

Supposing only one of the previous properties for a quadrangle, or supposing the equality and the 
parallelism of one pair of opposite sides, then all the other properties follow from it. 

The relations between diagonals, sides, and area are the following: 


dj +d3=2(a? +b), (318) h = bsina, (3.19) S=ah. (3.20) 
δα p? 
[à X P VA a 
a a " 


Figure 3.15 Figure 3.16 Figure 3.17 
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3.1.4.2 Rectangle and Square 

A parallelogram is a rectangle (Fig. 3.16), if it has: 

e only right angles, or 

e the diagonals have the same length. 

Only one of these properties is enough, because either of them follows from the other. It is sufficient 
to show that one angle of the parallelogram is a right angle, then all the angles are right angles. If a 
quadrangle has four right angles, it is a rectangle. 

The perimeter U and the area S of a rectangle are: 


U — 2(a 4- 0), (3.21a) S — ab. (3.21b) 


If a = b holds (Fig. 3.17), the rectangle is called a square, and the following formulas 


a ὦ 
d=aV/2~1.4l4a, (9.22) a= ue m 0.707d, (3.33) Sg 3 (3.24) 


3.1.4.3 Rhombus 


A rhombus (Fig. 3.18) is a parallelogram in which 

e all the sides have the same length, or 

e the diagonals are perpendicular to each other, or 

e the diagonals are bisectors of the angles of the parallelogram. 

Any of the previous properties is enough alone; all the others follow from it. For the rhombus 


dy = 2a cos 5, (3.25) d; = 2a sin τ, (3.26) d +d? =4a?. — (337) 


dida 


; (3.28) 


2. 
S — ah — a^sina = 


3.1.4.4 Trapezoid 

A quadrangle is called trapezoid if it has two parallel sides (Fig. 3.19). The parallel sides are called 
bases. With the notation a and b for the bases, h for the altitude and m for the median of the trapezoid 
which connects the midpoints of the two non-parallel sides 


_ h(a + 2b) 


a+b (a +b)h 3(a-- b) 
--------- = mh, (3.30) hs = 3(a+b) ` 


m= 9, (4339) s= 


(3.31) 


The centroid is on the connecting segment of the midpoints of the parallel basis a and b, at a distance hg 
(3.31) from the base a. For the calculation of the coordinates of the centroid by integration see 8.2.2.3, 
p. 506. 

For an isosceles trapezoid with d = c 


S = (a—ccosy)csiny = (b + ccos y)esin y. (3.32) 
P EN 
X X 

a a 

Figure 3.18 Figure 3.19 Figure 3.20 


3.1.4.5 General Quadrangle 
A closed plane figure bounded by four straight line segments is called a general quadrangle. If the 
diagonals lie fully inside the quadrangle, it is convex, otherwise concave. The general quadrangle is 
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divisible by two diagonals dı, d» in two triangles (Fig. 3.20). Therefore, in every quadrangle the sum 
of the interior angles is 2 - 180° = 360°: 


4 
Ya; = 360°. (3.33) 
i=1 


The length of the segment m connecting the midpoints of the diagonals (Fig. 3.20) is given by 
HPHHP +P = BHE +m. (3.34) 
The area of the general quadrangle is 


Y 
S= 7414 sina. (3.35) 


3.1.4.6 Inscribed Quadrangle 

A quadrangle which can be circumscribed by a circumcircle is called an inscribed quadrangle (Fig. 
3.21a) and its sides are chords of this circle. A quadrangle is an inscribed quadrangle if and only if the 
sums of its opposite angles are 180°: 


a+ty=8+6 = 180°. (3.36 
Ptolemy’s theorem is valid for the insribed quadrangle: 
ac + bd = dida. (3.37 
The radius of the circumcircle of an inscribed quadrangle is 
a 
1 
F E R= τον (ab + cd)(ac + bd)(ad + cb) . (3.38 
The diagonals can be calculated by the formulas 
(ac + bd)(ab + cd) i 
; me ad + bc (8:398, 
b) (ac + bd)(ad + be 
dy ; (3.39b 
Figure 3.21 abea 


1 
The area can be expressed in terms of the half-perimeter of the quadrangle s = σία +b+c+d): 


S = /(s—a)(s—b)(s—0)(s—d). (3.40) 


If the inscribed quadrangle is also a circumscribing quadrangle (see Fig. 3.21 and 3.1.4.7), then 
S = Vabcd. (3.41) 
3.1.4.7 Circumscribing Quadrangle 


If a quadrangle has an inscribed circle (Fig. 3.21b), then it is called a circumscribing quadrangle, and 
the sides are tangents to the circle. A quadrangle has an inscribed circle if and only if the sum of the 
lengths of the opposite sides are equal, and this sum is also equal to the half-perimeter s: 


s=5(atbte+d)=ate=b4d (3.42) 


The area of the circumscribing quadrangle is 


S = (a + c)r = (b + d)r, (3.43) 
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where r is the radius of the inscribed circle. 
3.1.5 Polygons in the Plane 
3.1.5.1 General Polygon 


A closed plane figure bounded by straight-line segments as its sides, can be decomposed into n — 2 
triangles (Fig. 3.22). The sums of the exterior angles β;, and of the interior angles γι, and the number 
of diagonals D are 


n 


ΣΤΗ, = 360°, (344) Στη = 180°(n — 2), (3.45) pa Nee 


i=1 i=1 2 


(3.46) 


Figure 3.22 Figure 3.23 


3.1.5.2 Regular Convex Polygons 

Regular convex polygons (Fig. 3.23) have n equal sides and n equal angles. The intersection point of 
the mid-perpendiculars of the sides is the center M of the inscribed and of the circumscribed circle with 
radii r and R, respectively. The sides of these polygons are tangents to the inscribed circle and chords 
of the the circumcircle. They form a circumscribing polygon or tangent polygon for the inscribed circle 
and a inscribed polygon in the circumcircle. The decomposition of a regular convex n gon (regular 
convex polygon) results in τι isosceles congruent triangles around the center Μ. 


ὙΠΟ 9 
Central Angle yn = 200 . (3.47) Base Angle απ = (1 - $) - 90°. (3.48 
n 
i ; 360° : o 
Exterior Angle Bn = . (3.49) Interior Angle γι = 180° — bn. (3.50 
n 
n p 5 1, 
Circumcircle Radius R= a , Bart im δ (3.51 
2sin 
n 1805 180° 
Inscribed Circle Radius r= cot m R cos τ : (3.52 
2 n n 
Side Length a, = 2V R? — r? = 2Rsin = = 2r tan 5 .3.53) | Perimeter U = παν. (3.54 
. απ 3 a2, 
Side Length of the 2n gon dyn = Ry} 2— 24/1 (5 ) > Gn d». 4 ml (3.55 
1 2,..¥n 1 p. l 2 ᾧπ 
Area of the n gon Sn = -na,r = nr*tan z jnE sin Yn = πας cot 3 (3.56 


2 
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nR? 452 $2, 
Area of the 2n gon Son = Ji h Vp Sn = δυη 1 — air . (3.57) 


3.1.5.3 Some Regular Convex Polygons 

The properties of some regular convex polygons are collected in Table 3.2. 

'The pentagon and the pentagram deserve special attention since it is presumed that Hippasos of Meta- 
pontum (ca. 450 BC) recognized irrational numbers by the properties of these polygons (see 1.1.1.2, 
p. 2). A discussion follows in the example: 


Β The diagonals of a regular pentagon (Fig. 3.24) form an inscribed 
pentagram. Its sides enclose a regular pentagon again. In a regular pen- A 
tagon, the proportion of a diagonal and a side is equal to the proportion 
of a side and the (diagonal minus side): αρ: αι = αι : (ag — αι) = a1 : a2, 
where ay = ag — αι. 

Considering smaller and smaller nested pentagons with a3 = a; — a2, a4 = 


dg — à3,... and az < d4,03 < 43,04 < a3..., yields ag : ay αι : a9 = 
ü2 : dg = a3 : a4 = +. The Euclidean algorithm for αρ and a; never 
breaks off, since ay = 1-a, +a, a, = 1- a2 +43, a2 = 1-a3+4a4,..., hence 


qn = 1. The side αι and diagonal αρ of the regular pentagon are incom- 
mensurable. The continued fraction determined by αρ: αι is identical to 
the golden section in BI B, 1.1.1.4, 3., p. 4, i.e., it results in an irrational Figure 3.24 
number. 


3.1.6 The Circle and Related Shapes 
3.1.6.1 Circle 


The circle is the locus of the points in a plane which are at the same given distance from a given point, 
the center of the circle. The distance itself, and also the line segment connecting the center with any 
point of the circle, is called the radius. The circumference or periphery of the circle encloses the area of 
the circle. A line segment connecting two points of the circle is called a chord. A line passing through 
two points of a circle is called a secant. Lines having exactly one common point with the circle are called 
tangent lines or tangents of the circle. 


Chord Theorem (Fig. 3.26) ΑΟ: AD = ΑΒ. AE = r? — πι. 
Secant Theorem (Fig. 3.27 AB. AE = AC. ΑΡ = m — r°. 


Secant-Tangent Theorem (Fig. 3.27) AT’ = AB. AE = AC- AD = m? — r°. 
Perimeter U —2zr26,283r, U — de 3,142d, U —2vmS ~ 3,545 V/ S. 


ΤΑ 
" Ç g LA 
ον, κ 7 


D 


Figure 3.25 Figure 3.26 Figure 3.27 
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Table 3.2 Properties of some regular polygons 


Gn R T S 
. c " a3 2 a3 R 1 a2 - 3R? 
3gon | az = RV3 = 2rv3 3 V3 = 2r 3^ 6 V3 2 3^ 1 v3 1 V3 
_ ag i 2 93,2, fz 
h= ον ὃ ο ὁ ΠΕ 
R 5 5 3 
5 gon as = 510 - 2V5 = 1650 + 10V5 = T8V25--10V8 | = "E25. 10V5 
R 5R? 
=2ry5- 2V5 | --τ(νὸ -- 1) = vor) ae 10 +2V5 
= 5r?\/5 — 25 
2 2 2 2 
6 gon | ag = ἀν ut 3 = £ 3 = TA VS = E V3 
= 253 
8gon | ag = Ry2— V2 =F 4422 = (241) = 2a2(v2 + 1) 
—2r(y2—1) |=ry4- 2V2 BEN ENT. =2R?V/2 
= δν (/2 -1) 
10 gon ao = (V8 1) = Fis F1) => 54-25 = 5+ 2V5 
= z V25 10/8 | = T 50— 10/5 - v1 -2/5  |- 77-10-25 
5 F 
= 2r?y25 — 10ν5 
, j P : 1 
πο ο M, s= cove, S. (3.62 
4 4 
; U . 
Radius r— ΞΡ 0.159U. (3.63) Diameter d= 2r= af e 1128/9. (3.64 
π 
For the following formulas with angles see the definition of the angle in 3.1.1.2, p. 129. 
ies Τ 1 
Angle of Circumference (Fig. 3.25) a=5 BC= 5 xB0C = 39- (3.65a 
A Special Case is the Theorem of Thales (see p. 142) y = 180°, ie, a = 90°. (3.65b 
lc Ἱ 
Angle Between a Chord and a Tangent (Fig. 9.25) 8 = 5 AC= 7 ΧΟΌΑ. (3.06 
LY: ae: i 
Interior Angle (Fig. 3.26) y- (CB + ED) = σα BOC + X EOD). (3.67 
1. αφ. I 
Exterior Angle (Fig. 3.27) a= (DE - BC)= a(t E0C — COB). (3.68 


Lee 73 i 
Angle Between Secant and Tangent (Fig. 3.27) 6 = σα ΕΤΕ) = 5 (t TOE — X BOT).(3.69 
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Inscribed Angle (Fig. 3.28), D and E are arbitrary points on the arcs to the left and to the right. 


a= 5(BDC CEB) = σα BOC ~ Xx C0B) 
z 5 (360° XCOB — XCOB) = 180° — X COB. (3.70) 


3.1.6.2 Circular Segment and Circular Sector 
Defining quantities: Radius r and central angle a (Fig. 3.29). The amounts to determine are: 


Chord a = 2V2hr — h? = 2r sin 5 . (3.71 
Central Angle a = 2arcsin = (α measured in degrees). (3.72 
= 
i : x a α a a 
Height of the Circular Segment h =r r? "eid (1 cos z) E tan "E (3.73 
2πτα τ r ; 
Arc Length l = 360 ^ 0.01745ra (ain radian measure) (3.74a 
b—a 1 
TN 5 T oor lx eam. 


i ~ τν 


Figure 3.28 Figure 3.29 Figure 3.30 


tra 


Area of the Sector S= sey ~ 0.00873r?a. (3.75) 


2 
Area of Circular Segment S = 7 (= sin a) x: ids a(r—h), S% Js 4-8b). (3.76) 
2 X180 2 15 


3.1.6.3 Annulus 


Defining quantities of an annulus: Exterior radius R, interior radius r and central angle ϱ (Fig. 3.30). 


Exterior Diameter D —2R. (3.77) Interior Diameter d — 2r. (3.78 
p RET " ς 

Mean Radius pe (3.79) Breadth ofthe Annulus  ó— R-—r. (3.80 

Area of the Annulus S = T(R? — r°) = το" d?) = 21 po. (3.81 


Area of an Annulus Sector for a Central Angle ¢ (shaded area in Fig. 3.30) 


Sy = = (m di = τη (D° d?) = D (yin radian measure). (3.82 
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3.2 Plane Trigonometry 


3.2.1 Triangles 
3.2.1.1 Calculations in Right-Angled Triangles in the Plane 


1. Basic Formulas 

Notation (Fig. 3.31): 

c hypotenuse; a, b other sides, or legs of the right angle; a and 8 
the angles opposite to the sides a and b respectively; h altitude; 
p, q hypotenuse segments; S area. 


Sum of Angles a+ 8+7=180° with y = 90°, (3.83) 
Calculation of Sides a = csina = ccos B 

= btana = bcot B, (3.84) 
Pythagoras Theorem a? 4 — c. (3.85) 


'Thales Theorem The vertex angles of all triangles in a semicir- 

cle with hypotenuse as the base are right angles, i.e., all angles 

at circumference in this semicircle are right angles (see Fig. 3.32 

and (3.65b), p. 140). 

Euclidean Theorems h?=pq, a?=pc, b --ης, (3.86) 
ab a? 


2 
Area S- wc zta B= sin 20. (3.87) Figure 3.32 


2. Calculation of Sides and Angles of a Right-Angled Triangle in the Plane 
In a right-angled triangle among the 
six defining quantities (three angles 
a, 3,7 and the sides opposite to them 
a,b,c, which are not all independent, of 
course), one angle, in Fig. 3.31 the an- 


Table 3.3 Defining quantities of a right-angled triangle 
in the plane 


Given Calculation of the other quantities 


Jsa Ἱειρίσρη neon? a 
gle 7, is given 88 90°. . eg. a,a | 8 = 90° — a | b=acota | c= — 
A plane triangle can be determined by sina 
three defining quantities but these can- . ctus - Ob 

: RONDA j e.g. b,a | 8 = 90° — a | a = btana | c = 
not be given arbitrarily (see 3.1.3.1, cosa 


p. 133). So, in the case of a right-angled 
triangle only two more quantities can be 
given. The remaining three quantities 
can be determined from Table 3.3 and 
(3.15) and (3.83). 


3.2.1.2 Calculations in General (Oblique) Triangles in the Plane 


1. Basic Formulas 
Notation (Fig. 3.33): a, b, c sides; a, 3, y the angles opposite to them; S area; R radius of the circum- 


, i -— a+b+c . 
circle; r radius of the incircle; s — —— half of the perimeter. 


e.g. ca | 8 =90°-a|a=csina |b = ccosa 


a a 
-= tana c= — B=90°-a 
b sina 


e.g. a,b 


Cyclic Permutation 
Because an oblique triangle has no distinguishing side or angle, from every formula containing the 


3.2 Plane Trigonometry 145 


Κ᾽ 
Se 


C 
Figure 3.33 Figure 3.34 Figure 3.35 LE 


sides and angles it is possible to get two further formulas by cyclic permutation the sides and angles 
according to Fig. 3.34. 


o 


pM 


CO. 


( 


3 b sng > si 
B From 2 = = (sine law) one gets by cyclic permutation: - = a ] f- EN 
b sing c sing? a sina 
b c 
Sine Law E te ccu LH (3.88 
sina sinf siny 
Projection Rule (see Fig. 3.35) c= acos +bcosa. (3.89 
Cosine Law or Pythagoras Theorem in General Triangles œ = a? + b? — 2ab cos y . (3.90 
Mollweide Equations 
ν--β ν--β 
(a+b) sin 3 = 60058 (5 5 ) , (391a) (a — b) cos =csin (< 5 ) . (391b 
ia ac B 
b an 
Tangent Law = = (3.92 
i tan 
—b)(s— 
Half-Angle Formulas tan E $n) (3.93 
s(s—aq) 
in 8 ji 
Tangent Formula tana = iau Soe, (3.94 
c—acosB b—acosy 
Additional Relations 
(3.95a) du em (3.95b 
2 bc 
Height Corresponding to the Side a ha = bsin y = csinB. (3.96 
1 
Median of the Side a Ma = ον ἐλ + c? -- 2becosa. (3.97 
2bc cos Ž 
Bisector of the Angle a jm. (3.98 
bcc 
: ‘ x a b c 
Radius of the Circumcircle R = (3.99 


2sinoa 2538  2siny- 
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s—a)(s—b)(s—c ay B8 
Radius of the Incircle r — (= aleae) stan : tan Ξ tan t (3.100) 
8 
(B 
AR sin s sin 5 sin; (3.101) 
1 : 
Area S= jabsin y = 2R’sinasinBsiny=rs= Vs(s a)(s — b)(s— c). (3.102) 


The formula S = γα a)(s — b)(s — c) is called Heron’s formula. 


2. Calculation of Sides, Angles, and Area in General Triangles 
According to the congruence theorems (see 3.1.3.2, p. 134) a triangle is determined by three independent 
quantities, among which there must be at least one side. 


From here follow the four so-called basic problems. If from the six defining quantities (three angles 
a, β.η and the sides opposite to them a, b, c) three independent quantities are given, one can calculate 
the remaining three with the equations in Table 3.4, p. 145. 
In contrast to spherical trigonometry (see the second basic problem, Table 3.9, p. 172) in a plane 
triangle there is no way to get any side only from the angles. 


3.2.2 Geodesic Applications 
3.2.2.1 Geodesic Coordinates 


In geometry usually right-handed coordinate systems are used to determine points in plane or space 
(Fig. 3.170). In contrast with this, in geodesy left-handed coordinate systems are in use. 


1. Geodesic Rectangular Coordinates 

In a plane left-handed rectangular coordinate system (Fig. 3.37) the x-axis of abscissae is shown up- 
ward, the y-axis of ordinates is shown to the right. A point P has coordinates yp, xp. The orientation 
of the x-axis follows from practical reasons. When measuring long distances, for which mostly the 
Soldner- or the Gauss-Krueger coordinate systems are in use (see 3.4.1.2, p. 162), the positive x-axis 
points to grid North, the y-axis oriented to the right points to East. The enumeration of the quadrants 
follows a clockwise direction in contrast with the usual practice in geometry (Fig. 3.37, Fig. 3.38). 


If besides the position of a point in the plane also its altitude is to be considered, one can use a three- 
dimensional left-handed rectangular coordinate system (y, x, z), where the z-axis points to the zenith 


(Fig. 3.36). 


Figure 3.36 Figure 3.37 Figure 3.38 


2. Geodesic Polar Coordinates 

In the left-handed plane polar coordinate system of geodesy (Fig. 3.38) a point P is given by the 
directional (azimuthal) angle t between the axis of abscissae and the line segment s, and by the length of 
the line segment s between the point and the origin (called the pole). In geodesy the positive orientation 
of an angle is the clockwise direction. 

To determine the altitude the zenith distance ¢ can be used or the vertical angle respectively the angle 
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of tilt a. In Fig. 3.36 is shown that in a three-dimensional rectangular left-handed coordinate system 
(see also left and right-handed coordinate systems 3.5.3.1, 2., p. 209), the zenith distance is measured 
between the zenith axis z and the line segment s, the angle of tilt between the line segment s and its 
perpendicular projection on the y, x plane. 


Table 3.4 Defining quantities of a general triangle, basic problems 


Given Formulas for calculating the other quantities 
sin 8 
l. lsideand2 y = 180? — o — f, = mE A 
angles (a, a, B) asiny sina 
ε-----, S=-—absiny. 
sina 2 
α-β a-b at B 1 
2. 2sides and the tan -= 5 cot UA —=90°-=7; 
angle between 2 at 2 2 2 
them (a, b, y) o and B come from a + B and a — f, 
sinq 1 
gamn S= -absiny. 
sina 2 
; ; bsina 
3. 2sides and the sin f = ; 
angle opposite . de utes . 
one of them If a > b holds, then 8 < 90° and is uniquely determined. If 
(a, b, α) a « b holds, the following cases occur: 


1. B has two values for bsina < a (82 = 180° — βι). 
2. B has exactly one value (90°) for bsin a = a. 


3. For bsina > a there is no such triangle. 


Si 1 
y = 180? — (a+ 8), ja S-—c-absiny. 
sina 2 
s—a)(s—b)(s—c 
4. 3 sides (a,b, c) r= ΕΘ > 
E 
a T B r y r 
tan = = , tan== , tan== ; 
2 s-a 2 s-—b 2 s—c 


S=rs= ass a)(s — b)(s — c). 


3. Scale 
In cartography the scale factor M is the ratio of a segment sj in a coordinate system K with respec 
to the corresponding segment sxo in another coordinate system K. 
1. Conversion of Segments With m as a modulus or scale and N as an index for the nature and K 
as the index of the map holds: 

M-—1:m-sg:sw. (3.103a 
For two segments sj, $i» with different moduli m, mg yields: 

Ski: SK2 = mo: Mı. (3.103b 
2. Conversion of Areas If the areas are calculated according to the formulas Fx = agbg, Fy = 
ayby, then: 

Fy = Fym?. (3.104a 
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For two areas F1, F> with different moduli mi, m2 : 
Fri: Fro = m2 : må. (3.104b) 


3.2.2.2 Angles in Geodesy 


1. Gradeor Gon Division 

In geodesy, in contrast to mathematics (see 3.1.1.5, p. 131) the gon measure is in use as a unit for angles. 
The perigon or full angle corresponds here to 400 grade or gon. The conversion between degrees and 
gons can be performed by the formulas in Table 3.5: 


'Table 3.5 Grade and Gon Division 


lFulangle =360° --2πτοὶ = 400gon 
lrightangle =90° = 5 rad = 100 gon 

π 
1 gon = 200 rad = 1000 mgon 


2. Directional Angle 

The directional angle t at a point P gives the direction of an oriented line segment with respect to a 
line passing through the point P parallel to the x-axis (see point A and the directional angle {αρ in 
Fig. 3.39). Because the measuring of angles in geodesy is made in a clockwise direction (Fig. 3.37, 
Fig. 3.38), the quadrants are enumerated in the opposite order to the right-handed Cartesian coordi- 
nate system of plane trigonometry (Table 3.6). The formulas of plane trigonometry are valid without 
change. 


Table 3.6 Directional angle in a segment with correct sign for arctan 


| Quadrant I II ΠῚ IV 

| Sign of numerator + - = + 
A 

| =e tan > 0 tan < 0 tan > 0 tan < 0 
Δα 

| Directional angle { to gon | to + 200 gon | to + 200 gon | to + 400 gon 


3. Coordinate Transformations 


1. Calculation of Polar Coordinates from Rectangular Coordinates For two points A(yA, 4) 
and B(yg, vg) in a rectangular coordinate system (Fig. 3.39) with the segment s4g oriented from A 
to B and the directional angles t 4p, tga the following formulas are valid: 


UB — | Δι 
πο (3.105a) san = V Aga + Ain, (3.105b) 
tp—x,4 Aap * d 


AYAB 
ATAB ᾿ 


{απ {αρ = 


(3.105c) ἐπα = tap + 200 gon. (3.105d) 


N 
The quadrant of the angle t depends on the signs of Ay4p and Azr4g. If using a calculator A is 
1 


punched in with the correct signs for Ay and Az, then one gets an angle tọ by pressing the arctan 
button to which is to add the gon-value given in Table 3.6 according to the corresponding quadrant. 
2. Calculation of Rectangular Coordinates from Distances and Angles In a rectangular co- 
ordinate system are to determine the coordinates of a point C by measuring in a local polar system 
(Fig. 3.40). 
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Given: ya,v4;ye,vp. Measured: a,sgc. Find: yc, rc. 


Solution: 
Ayap i 
tanta4g = (3.106a) tec = tap +a+200gon, (3.106b 
AX AB 
Uc = YB t SBC sin tgc, (3.106c) πο Tp + Spc cos tege. (3. 06d 


If also s4p is measured, then the difference between the locally measured distance and the distance 
computed from the coordinates can be considered by multiplying with the scale factor q, where q mus 
be very close to 1: 


calculated distance 


measured distance 


yo = yn + Sgoqsintpo, (3.107b) 


Figure 3.39 Figure 3.40 


3. Coordinate Transformation Between Two Rectangular Coordinate Systems In order to 
locate a given point on a country-map the local system y’, x’ is to be transformed into the system y, x of 
coordinates of the map (Fig. 3.41). The system y’, x’ is rotated into y, x by an angle o and is translated 
parallel by yo, xo. The directional angles in the system y’, 2’ are denoted by Ὁ. The coordinates of A and 
B are given in both systems and the coordinates of a point C in the z^, y'-system. The transformation 
is given by the following relations: 


sap = y Aug + Aag , (3.1082) Sp = y Ayfg + Aag , (3.108b 


E 
c= — (3.108c) p —tap — Uap, (3.108d 
SAB 
Ay, Δι, 
tantap = ea (3.108e) tan Jap = Um (3.108f 
Yo = ya — qxasin y — qya cosy, (3.108g) To = tA +qyasinp—qracosy, (3.108h 
yo = ya + qsin (vc — 2/4) + q cos e(yo — Ya), (3.108i 
zo = z4 + qcose(at — 24) — qsin e(t — ya): (3.108j 
Remark: The following two formulas can be used as a check. 
Uc = VAT qS4c sin(p4- 4c), (3.108k) zo = vAdqs4ocos(p4- 4c). (3.1081 
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If the segment AB is on the z'-axis, the formulas reduce to 


A Är 

u= -= = qsin p, (3.109a) b= = = qcos 9, (3.109b) 
UB Tg 

yo = ya tare + bye, (3.109c) To = T4 + bto — ayo, (3.109d) 

Uc = Ayacb — Ataca, (3.1096) vg A-Acb + Ayaca. (3.109f) 


Figure 3.41 Figure 3.42 


3.2.2.3 Applications in Surveying 

The determination of the coordinates of a point N to be fixed by triangulation is a frequent measuring 
problem in geodesy. The methods of solving it are intersection, three-point resection, arc intersection, 
free stationing and traversing. 'The last two methods are not discussed here. 

1. Intersection 

1. Intersection of Two Oriented Lines or first fundamental problem of triangulation: To deter- 
mine a point N by two given points A and B with the help of a triangle ABN (Fig. 3.42). 

Given: y4, x4; Yg, xp. Measured: a, D, if it is possible also y or y = 200 gon —a — b. Find: yy, zw. 
Solution: 


ΔΙ 2 ; 
tantapg = A (3.110a) SAB = V Aun T Az pg = |A^yAB sin {αμ + [Ax AB cos tag] ,(3.110b) 


TAB k 
sina sina (3.110c) sin 8 sin 8 (3.1104) 
SBN — 58 =s , (3.110c SAN = SAB——— = SAB— , (3.1100 
BN PAB iny ^P sin (a +)’ ΡΝ γ "AB Sin (a+ B) 
tan —lap— Q, (3.1106) tgn = tga + B = tag + B + 200 gon, (3.110f) 
UN = Ya + san Sin tan = YB + SBN Sintpw, (3.110g) 
YN = πα + SAN COStan = vB + 5px COStBN. (3.110h) 


2. Intersection Problem for Non-Visible B Ifthe point B cannot be seen from A, the directional 
angles tay and tgy are to be determined with respect to reference directions to other visible points D 
and E whose coordinates are known (Fig. 3.43). 

Given: y4, t4; YB, B; YD, Xp; Ye, tg. Measured: ὁ in A, £ in B, and if it is possible, also 7. 

Find: YN, TN. 
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Figure 3.43 


Solution: Reducing to the first fundamental problem, calculating tant4g, according to (3.110b 


yields: 
A 
tantap = AL (3.1112) 
AD 
tan = tap + Ô, (3.111c) 
a=tyap- tan, (3.111e) 
ΔΙ 
tantay = m (3.111g) 
‘NA 
Ayspat+xatantan — Tg tantgy 
XN = 


tantan — tan tgn 


2. Three-Point Resection 


Figure 3.44 


tantpe — DUEB. (3.111b 
ATEB 

tpn =tpeté, (3.111d 

B=tpn — tpa, (3.111 

tantpy = Aye : (3.111h 
ATyNB 


UN = yp + (zy — £g) tantgy. (3.111j) 


1. Snellius Problem of Three-Point Resection or to determine a point N by three given points 
A, B,C; also called the second fundamental problem of triangulation (Fig. 3.44): 


Given: y4, t4; Up, £B; Yc, zo. Measured: δι, δ in N. Find: yy, £y. 


Solution: 


Ayac 
Azac’ 


tantac = 


Ayac Ax Ac 
a= κ. 


sin tac cos tac i 


Ύ tca — toB = tac — tec, (3.112e) 


(3.112a) 


(3.112c) 


Δψρο 

tantgo = reum (3.112b) 
Δι Arse 

jo He ee (3.1124) 
sin tge cos tge 

TERN, |. db 

eae — 189 — 7+ - a (3.1121) 


The equality (3.112f) is a first condition to determine and v. A second condition one gets from 


(2.114) and (2.115), p. 83: 


sin y + sin v T 
? τωρ PP. cot 


e-v 


sin y — sin v 2 2 


(3.112g) 
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With the sine law (3.88) follows 


sin y SCN sin v SON 
= us = 3.1121 
sin δι a sin δρ b ( 1) 
putting into (3.112g) gives 
a v ne +y. Ein p= sin Y S +y. Bein ói—a sin δ) (3.1121) 
2 sin y + sin v 2 bsinó; + asin δ) 
pa 
From (3.1121) one gets PY and together with (3.112f) 
perth e-v , οτι φ-ύΨ ; 
ied 4 T5 p = 5 2 (3.1125) 
From here one can determine the following line segments and points: 
Le b 
SAN — sin(ó ; 3.112k SBN = — —- sin(6s + Y 
η sin(, + €), ( ) SBN πρ βἰη(δ» + v), (3.1121) 
a, A 
SCN sin siny, (3.112m) 
inó, in δρ 
XN = XA-d-SAwCOSÍAN = TB + Sgy COSÉpN, (3.112n) 
YN = YA + SAN sin tAN = Yg + SBN sin tgy. (3.1120) 


2. Three-Point Resection by Cassini 


Given: y4, x4; YB, tp; yc, tc. Measured: δι, ὃν in N. Find: yy, ry. 


For this method two reference points P and Q are to be used, which are on the reference circles passing 


through A, C, P and B, C,Q so that 
circles Hı and H» are at the intersec 


both are on a line containing N (Fig. 3.45). The centers of the 
ion points of the mid perpendicular of AC and of BC with the 


segments PC and QC . The angles δι, 62 measured at N appear also at P and Q as angles of circum- 


ference. 
Solution: 


UP = ya + (xc — v4) cot ô, 


Yo = YB + (xp — πο) cot da, 


UN = yp + (ty — vp) tan tpo 


YN = yo — (αν — πο) cot tpo 


Dangerous Circle: When choosing 


3.113a) zp = xa + (yc —ya)cotô, (9.199) 
3.113c) tq = £g + (yg — yc) cot ôz,  (3.113d) 
3.1130) ay = ap 10 9h (fo trotter (S agr 


tantpo + cot tpo 


(tan tpo < cot tpo), (3.113g) 
(3.113h) 


the points it is to be ensured that they do not lie on one circle, 


(cot tpo < tan tpo) 


because then there is no solution; one talks about a so-called dangerous circle. The closer the points 
are near to the dangerous circle, the less gets the accuracy of the method. 


3. Arc Intersection 


With this method a so-called new point N is to be determined as intersection point of two arcs around 
two points A and B with known coordinates and with measured radii say and sgy (Fig. 3.46). The 
unknown line segment s 4g is calculated and the angles are to be calculated from the now already known 


three sides of the triangle ABN. 


A second solution — not discussed here — starts from the decomposition of the general triangle into two 
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Figure 3.45 Figure 3.46 


right-angled triangles. 
Given: y4, $4 : yg, xp. Measured: say; spy. Find: s4p,yn,tn- 


Solution: 
2 Ρ AYAB 
sap = \Ayip t+ Δαλῃ, (3.114a) tantas = , (3.114b 
° g ATAB 
tga = tap + 200 gon, (3.114c 
2 2 2 3 53. οἳ 
cosa = TAN SABT SEN — qo dail cosg = PNT TAB SAN {3γ14ε 
25ANSAB 25BNSAB 
tan =tap—a, (3.114f) tay — tpA — P, (3.114g 
UN = Yat SAN Sintan, (3.114h) ΖΝ = TA + SAN COSÍAN, (3.114i 
UN = YBT SBN SIN ΗΝ. (3.114j) XN = tpg-spNCOSÍpBwN. (3.1 4k 
3.3 Stereometry 
3.3.1 Lines and Planes in Space 
Z. 1. Two Lines Two lines in the same plane have either one or no common 
point. In the second case they are parallel. If there is no plane such that 


it contains both lines, they are skew lines. The angle between two skew 

linesis defined by the angle between two lines parallel to them and passing 

through a common point (Fig. 3.47). The distance between two skew 

lines is defined by the segment which is perpendicular to both of them. 

(There is always a unique transversal line which is perpendicular to both 
Figure 3.47 skew lines and intersects them, too.) 


2. Two Planes Two planes intersect each other in a line or they have no common point. In this 
second case they are parallel. If two planes are perpendicular to the same line or if both are parallel to 
every intersecting pair of lines in the other, the planes are parallel. 
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3. Line and Plane A line can be incident to a plane, it 
can have one or no common point with the plane. In this last 
case it is parallel to the plane. The angle between a line and a 

plane is measured by the angle between the line and its orthog- 
/ «| — onal projection on the plane (Fig. 3.48). If this projection is 
only a point, i.e., if the line is perpendicular to two different 
intersecting lines in the plane, then the line is perpendicular or 
Figure 3.48 orthogonal to the plane. 


| 3.3.2 Edge, Corner, Solid Angle 


1. Edge or dihedral angle is a figure formed by two infinite half-planes starting at the 
same line (Fig.3.49). In everyday terminology the word edge is used for the intersec- 
tion line of the two half-planes. As a measure of edges, one uses the edge-angle ABC, 
CoB the angle between two half-lines lying in the half-planes and being perpendicular to 
^ the intersection line DE at a point B. 
2. Corner or polyhedral angle OABC DE (Fig. 3.50) is a figure formed by several 
E planes, the lateral faces, which go through a common point, the vertex 0, and intersect 
L^ each other at the lines 0A, 0B, .... 
Two lines which bound the same lateral face form a plane angle, while neighboring 
Figure 3.49 faces form a dihedral angle. 


Polyhedra are equal to each other, i.e., they are congruent, if they are superposable. For this the corre- 
sponding elements, i.e., the edges and plane angles at the vertex must be coincident. If the correspond- 
ing elements at the vertex are equal, but they have an opposite order of sequence, the corners are not 
superposable, and they are called symmetric corners, because they can be brought into a symmetric 
position as shown in Fig. 3.51. 

A convex polyhedral angle lies completely on one side of each of its faces. 

The sum of the plane angles AOB + BOC + ... + E0A (Fig. 3.50) is less than 360° for every convex 
polyhedron. 


3. Trihedral Angles are congruent if they coincide in the following elements: 
in two faces and the corresponding dihedral angle, 

in one face and both dihedral angles belonging to it, 

in three corresponding faces in the same order of sequence, 


9 
9 
9 
e in three corresponding dihedral angles in the same order of sequence. 


B 


Solid Angle A pencil of rays starting from the same point (and intersecting a closed curve) forms 
a solid angle in space (Fig. 3.52). It is denoted by Q and calculated by the equality 
S 
Q= Ho (3.115a) 
Here S means the piece of the spherical surface cut out by the solid angle from a sphere whose radius is 
r, and whose center is at the vertex of the solid angle. The unit of solid angle is the steradian (sr) (see 
also p. 1055): 


1n? 


lsr= 25 
Im“ 


(3.115b) 


i.e., a solid angle of 1 sr cuts out a surface area of 1 m? of the unit sphere (r = 1m). 

Bl A: The full solid angle is Q = 4rr?/r? = 4r. 

W B: A cone with a vertex angle (also called an apex angle) a = 120° defines(determines) a solid angle 
Q = 2zr?(1— cos(a/2))/r? = v, where the formula for a spherical cap (3.163)has been used. 
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SA. 


Figure 3.50 Figure 3.51 Figure 3.52 Figure 3.53 
3.3.3 Polyeder or Polyhedron 


In this paragraph the following notations are used : V volume, 5 total surface, M lateral area, h alti- 
tude, Ag base area. 


1. Polyhedron is a solid bounded by plane polygons. 
2. Prism (Fig. 3.53) is a polyhedron with two congruent bases, and having parallelograms as ad- 
ditional faces. A right prism has edges perpendicular to the base, a regular prism is a right prism with 
a regular polygon as the base. For the prism 

V = Ach, (3.116) M — pl, (3.117) S=M+2Ag. (3.118) 
holds. Here p is the perimeter of the section perpendicular to the edges, and / is the length of the edges. 
If the edges are still parallel to each other but the bases are not, the lateral faces are trapezoids. If the 
bases of a triangular prism are not parallel to each other, its volume can be calculated by the formula 
(Fig. 3.54): 
(a+b+c)Q 
NES NÉ 
where Q is a perpendicular cut, a, b, and c are the lengths of the parallel edges. If the bases of the prism 
are not parallel, then its volume is 


V -1Q, (3.120) 


where | is the length of the line segment BC connecting the centers of gravity of the bases, and Q is the 
cross-cut perpendicular to this line. 


qi 
Lf} 
A 


Figure 3.54 Figure 3.55 Figure 3.56 


V= (3.119) 


3. Parallelepiped is a prism with parallelograms as bases (Fig. 3.55), i.e., it is bounded by six 
parallelograms. In a parallelepiped all the four body diagonals intersect each other at the same point, 
the midpoint, and halve each other. 

4. Rectangular Parallelepiped or block is a right parallelepiped with rectangles as bases. In a 
block (Fig. 3.56) the body diagonals have the same length. If a, b, and c are the edge lengths of the 
block and d is the length of the diagonal, then 
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ἆἲ -- αἲ Ευ” +c, (3121) V=abe, (3.122) S —2(ab--bc-- ca). (3.123) 
5. Cube (or regular hexahedron) is a block with equal edge lengths: a — b — c, 
d? = 3a’, (3.124) γ-- αὖ, (3.125) S -- θα”. (3.126) 


6. Pyramid (Fig. 3.57) is a polyhedron whose base is a polygon and its lateral faces are triangles 
with a common point, the vertex. A pyramid is called right if the foot of the perpendicular from the 
vertex to the base Ag is at the midpoint of the base. It is called regular if it is right and the base is a 
regular polygon (Fig. 3.58), and n faced if the base is an n gon. Together with the base the pyramid 
has (n 4- 1) faces. For the volume 


= - 3.127 
s (3.127) 
holds. For the lateral area of the regular pyramid 
1 
M = ahs (3.128) 


holds with p as the perimeter of the base and h, as the altitude of a face. 


7. Frustum ofa Pyramid or truncated pyramid is a pyramid whose vertex is cut away by a plane 
parallel to the base (Fig. 3.57, Fig. 3.59). Denoting by 50 the altitude of the pyramid, i.e., the 
perpendicular from the vertex to the base, then 


SA, SB, 5C, 50, (3.129) 

AA BB QC ᾿ 0,0’ l 
Area ABCDEF S0 (3.130 
Area ABCDEF \S0,) © 290) 


holds. If Ap and Ag are the upper and lower bases, resp., h is the altitude of the truncated pyramid, 
i.e., the distance between the bases, and ap and ag are corresponding sides of the bases, then 


. 1 1 ap ap? 
V - gh [Ac +Ap+ VAcAp] = zhác [ +24 (2) | . (3.131) 
The lateral surface of a regular truncated pyramid is 
M= poten, (3.132) 


where pp and pg are the perimeters of the bases, and h, is the altitude of the faces. 


B 


Figure 3.57 Figure 3.58 Figure 3.59 
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a 
VA S 
a 
Figure 3.60 Figure 3.61 Figure 3.62 
8. Tetrahedron is a triangular pyramid (Fig. 3.60). With the notation 
0A =a, 0B =b, 0C = c, CA = q, BC = p and AB = r the following holds: 
0 r? n a? 1 
1 r? 0 pi 
Ves 288 q po cil. (3.133) 
Clg b? 20 1 
11110 


9. Obelisk is a polyhedron whose lateral faces are all trapezoids. In the special case in Fig. 3.61 
the bases are rectangles, the opposite edges have the same inclination to the base but they do not have 
a common point. If a,b and αι, bı are the sides of the bases of the obelisk and h is the altitude of it, 
then: 


V= " [ία + αι) b + (αι + a) bı] í [ab + (a + αι) (b + b1) + abı]. (3.134) 


10. Wedge is a polyhedron whose base is a rectangle, its lateral faces are two opposite isosceles 
triangles and two isosceles trapezoids (Fig. 3.62). For the volume holds 


1 
V= g (24 +a,)bh. (3.135) 


50Booe 


a) d) 
Figure 3.63 


11. Regular Polyeder have congruent regular polyeders as faces and congruent regular corners. 
'The five possible regular polyheders are represented in Fig. 3.63; Table 3.7 shows the corresponding 
data. 

12. Euler's Theorem on Polyeders Ife is the number of vertices, f is the number of faces, and k 
is the number of edges of a convex polyhedron then 


e-k+f=2. (3.136) 
Examples are given in Table 3.7. 
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Table 3.7 Regular polyeders with edge length a 


Number and Number of Total area Volume 

Name torm ortaca Edges | Corners S/a? V/a 

; y2 
Tetrahedron 4 triangles 6 4 νᾶ = 1.7321 12 — 0.1179 
Cube 6 squares 12 8 6 — 6.0 1210 

v2 
Octahedron 8 triangles 12 6 2/3 = 3.4641 —— = 0.4714 

15 15 
5+ 
Dodecahedr 12 pentagons 2 4/5 2/5) = 20.64 — — —— = 7.6631 
odecahedron pentagons | 30 0 3y5(5 + 2V5) 0.6457 τ 7.663 
zb. [s 

Icosahedron 20 triangles 30 12 5/3 = 8.6603 Be) = 2.1817 


3.3.4 Solids Bounded by Curved Surfaces 


In this paragraph the following notations are used: V volume, S total surface, M lateral surface, h 
altitude, Ag base. 
1. Cylindrical Surface is a curved surface which can be got by parallel translation of a line, the 
generating line or generator along a curve, the so-called directing curve (Fig. 3.64). 
2. Cylinder is a solid bounded by a cylindrical surface with a closed directing curve, and by two 
parallel bases cut out from two parallel planes by the cylindrical surface. For every arbitrary cylinder 
(Fig. 3.65) with the base perimeter p, with the perimeter s of the cut perpendicular to the apothem, 
whose area is Q, and with the length / of the apothem the following is valid: 

V-2Ach-Ql, (3.137) M=ph=sl. (3.138 


3. Right Circular Cylinder has a circle as base, and its apothems are perpendicular to the plane 
of the circle (Fig. 3.66). With a base radius R 
V =7 Rh, (3.139) M =2r Rh, (3.140) S —2n R(R +h). (3.141 


4. Obliquely Truncated Cylinder (Fig. 3.67) 


yos t (3.142) M =r B nci f; (3.143 
j hy — hy 2 
S=rR |h +h + R+, R? + 7 ; (3.144 
5. Ungula of the Cylinder With the notation of Fig. 3.68 and with a = y/2 in radians 

H P : g / 3 si 3, 

eS S [a(3 R? — a2) +3 R?(b R)a] = E (sn a a ^ — acos a) ; (3.145) 
2Rl 

Μ- [ο - R)a +a], (3.146) 


where the formulas are valid even in the case b > R, o » m. 
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generating line 


directing curve 


Figure 3.64 Figure 3.65 Figure 3.66 Figure 3.67 


6. Hollow Cylinder With the notation R for the outside radius and r for the inside one, ô = R—r Κα 
R+r 
2 


V =7h(R? — r°) = s hó(2 R — ὃ) = x hó(2r + ô) = 2m hóp. (3.147) 


for the mean radius (Fig. 3.69) 


for the difference of the radii, and o — 


Figure 3.68 Figure 3.69 
7. Conical Surface arises by moving a line, the generating line, along a curve, the direction curve 
so that it always goes through a fixed point, the vertex (Fig. 3.70). 
8. Cone (Fig. 3.71) is bounded by a conical surface with a closed direction curve and a base cut 


out from a plane by the surface. For an arbitrary cone 


bes (3.148) 


3 


(A curve 


vertex 


Figure 3.70 Figure 3.71 Figure 3.72 


9. Right Circular Cone has a circle as base and its vertex is right above the centre of the circle 
(Fig. 3.72). With 1 as the length of the apothem and R as the radius of the base 


l ua 
V = um RPh, (3.149) M=n7Rl=7RVR?2+h?2, (3150 | S—m R(R-I). (3.151) 


10. Frustum of Right Cone or Truncated Cone (Fig. 3.73) 
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a ` 
| a 
Figure 3.73 


l= n? 4 (R— η)”, (3.152) 


th 


V=—(R?+r?+Rr), (3.154) 


3 
11. Conic Sections see 3.5.2.11, p. 206. 


Figure 3.74 


M —nl(R-r), (3.153) 


hr 


R-r 


H=h+ (3.155) 


12. Sphere (Fig. 3.74) with radius R and diameter D = 2R. Every plane section of it is a circle. 
A plane section through the center results in a great circle (see 3.4.1.1, p. 160) with radius R. Only 
one great circle can be fitted through two surface points of the sphere if they are not the endpoints of 
the same diameter. The shortest connecting surface curve between two surface points is the arc of the 


great circle between them (see 3.4.1.1, p. 160). 


Formulas for the surface and for the volume of the sphere: 


S = 4r R? z 12.57 R?, 


S = V36 r V? ~ 4.836 VV?, | (3.156c) 


(3.156a) 


p? l 
V= - & 0.5236 D3, (3.157b) 
1 
R- AE = 0.2821 VS, (3.1582) 


13. Spherical Sector (Fig. 3.75) 
S=nR(2h+a), (3.159) 
14. Spherical Cap (Fig. 3.76) 


a? =h(2R—h) (3.161) 


M -27 Rh s (a) +h’) ; (3.163) 


15. Spherical Layer (Fig. 3.77) 


a? =b = h? 


2_ 2 
Riad È 2h 


) , (3.165) 


M=27Rh, (3.167) 


S =n D? = 3.142 D’, (3.156b) 
4 i s 

V= πω zz 4.189 R?, (3.1572) 
1 S3 - f 

V = gu ~ 009403 v S3, (3.157c) 


[E τ ~ 0.62044 V. (81580) 
T 


. 11 a 2,22) loa 
yz g^ ^ (3a +h ) = 37h (3 R — h), (3.162 


S=n(2Rh | a?) =r (h? H 2a”). (3.164 


B 2 972 2 
V = gh (3a +367 +h), (3.166 


S=n(2Rh+a?+b’). (3.168 


If Vj is the volume of a truncated cone written in a spherical layer (Fig. 3.78) and { is the length of its 
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apothem, then 


V -V = παλ, (3.169) 
Figure 3.75 Figure 3.76 Figure 3.77 


16. Torus (Fig. 3.79) is the solid which can be generated by rotating a circle around an axis which 
is in the plane of the circle but does not intersect it. 


S = 4r? Rr ~ 39.48 Rr, (3.170a) S =7° Dd 9.870 Dd, (3.170b) 
V -- απ” Rr’ 19.74Rr*, — (3.171a) V= iD d? ~ 2.467 Dd?.  (3.171b) 
d 
«πως 
SG 
Figure 3.78 Figure 3.79 Figure 3.80 


17. Barrel (Fig. 3.80) arises by rotation of a generating curve; a circular barrel by rotation of 
a circular segment, a parabolic barrel by rotation of a parabolic segment. For the circular barrel the 
following approximation formulas hold, 


V e 0.262h (2 D? + d?) (3.172a) or V ~ 0.0873 h(2 D +d)’, (3.172b) 


and for the parabolic barrel 


V= d (2p? +Dd+ jd ) ~ 0.05236 (8 D? -- 4 Dd - 3d?) . 


(3.173) 
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3.4 Spherical Trigonometry 


For geodesic measures which extend over great distances the spherical shape of the Earth is taken into 
consideration. Therefore the spherical geometry is necessary. In particular one needs formulas for 
spherical triangles, i.e., triangles lying on a sphere. This was also realized by the ancient Greeks, so 
besides the trigonometry of the plane the trigonometry of the sphere has been developed, and nowadays 
Hipparchus (around 150 BC) is considered as the founder of spherical geometry. 


3.4.1 Basic Concepts of Geometry on the Sphere 
3.4.1.1 Curve, Arc, and Angle on the Sphere 
1. Spherical Curves, Great Circle and Small Circle 


Curves on the surface of a sphere are called spherical curves. Important spherical curves are great 
circles and small circles. They are intersection circles of a plane passing through the sphere, the so- 
called intersecting plane (Fig. 3.81): 
If a sphere of radius R is intersected by a plane K at distance h from the center O of the sphere, then 
for radius r of the intersection circle holds 

r=VR-h? (0<h< R). (3.174) 
For h = 0 the intersecting plane goes through the center of the sphere, and r takes the greatest possible 
value. In this case the intersection circle g in the plane T is called a great circle. Every other intersection 
circle, with 0 < h « R, is called a small circle, for instance the circle k in Fig. 3.81. For h = R the 
plane K has only one common point with the sphere. Then it is called a tangent plane. 


Figure 3.81 Figure 3.82 Figure 3.83 


E On the Earth the equator and the meridians with their countermeridians — which are their reflections 
with respect to the Earth's axis — represent great circles. The parallels of latitude are small circles (see 
also 3.4.1.2, p. 162). 

2. Spherical Distance 

"Through two points A and B of the surface of the sphere, which are not opposite points, i.e., they are 
not the endpoints of the same diameter, infinitely many small circles can be drawn, but only one great 
circle (with the plane of the great circle η). Consider two small circles Κι, ko through A and B and 
turn them into the plane of the great circle passing through A and B (Fig. 3.82). The great circle has 
the greatest radius and so the smallest curvature. So the shorter arc of the great circle is the shortest 
connection between A and B. It is the shortest connection between A and B on the surface of the 
sphere, and it is called the spherical distance. 


3. Geodesic Lines 
Geodesic lines are the curves on a surface which are the shortest connections between two points of the 


surface (see 3.6.3.6, p. 268). 
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W In the plane the straight lines, on the sphere the great circles, are the geodesic lines (see also 3.4.1.2, 
p. 162). 

4. Measurement of the Spherical Distance 

The spherical distance of two points can be expressed as a measure of length or as a measure of angle 
(Fig. 3.83). 

1. Spherical Distance as a Measure of Angle is the angle between the radii 0A and 0B measured 
at the center 0. This angle determines the spherical distance uniquely, and hereafter it is denoted by a 
lowercase Latin letter. The notation can be given at the center or on the great circle arc. 

2. Spherical Distance as a Measure of Length is the length of the great circle arc between A 


and B. It is denoted by AB (arc AB). 
3. Conversions from Measure of Angle into Measure of Length and conversely can be done 
by the formulas 


a è P 
AB= Rarce = RÉ, (3.175a) e =AB D (3.175b) 
ϱ 
Here e denotes the angle given in degrees and arc e denotes the angle in radian (see radian measure 
3.1.1.5, p. 131). The conversion factor ϱ is equal to 


ο 


1 
o = lrad a 57.2958° = 3438’ = 206265”. (3.175c) 
T 


'The determinations of the distance as a measure of length or angle are equivalent but in spherical 
trigonometry the spherical distances are given mostly as a measure of angle. 

W A: For spherical calculations on the Earth's surface usually a sphere is considered with the same 
volume as the biaxial reference ellipsoid of Krassowski. This radius of the Earth is R = 6371.110 km, 
and consequently holds 1° 4 111.2 km, 1’ £ 1853.3m = 1 oldseamile. Today 1 seamile = 1852 m. 


W B: The spherical distance between Dresden and St. Petersburg is AB = 1433 km or 
1433km _ 

2 .3° = 12.89? = 12°53’. 

e= 37km 57.3 89 53 


Figure 3.84 


5. Intersection Angle, Course Angle, Azimuth 

The intersection angle between spherical curves is the angle between their tangent lines at the intersec- 
tion point Pj. If one of them is a meridian, the intersection angle with the curve segment to the north 
from P, is called the course angle a in navigation. To distinguish the inclination of the curve to the east 
or to the west, a sign is to be assigned to the course angle according to Fig. 3.84a,b restricting it to 
the interval —90° < a < 90°. The course angle is an oriented angle, i.e., it has a sign. It is independent 
of the orientation of the curve — of its sense. 

The orientation of the curve from P; to Py, as in Fig. 3.84c, can be described by the azimuth ô: It is 
the intersection angle between the northern part of the meridian passing through P, and the curve arc 
from ΡΙ to Py. The azimuth is restricted to the interval 0° < ὁ < 360°. 
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Remark: In navigation the position coordinates are usually given in sexagesimal degrees; the spherical 
distances and also the course angles and the azimuth are given in decimal degrees. 


3.4.1.2 Special Coordinate Systems 
1. Geographical Coordinates 


To determine points P on the Earth's surface geographical coordinates are in use (Fig. 3.85), i.e., 
spherical coordinates with the radius of the Earth, the geographical longitude λ and the geographical 
latitude vp. 

To determine the degree of longitude the surface of the Earth is subdivided by half great circles from the 
north pole to the south pole, by so-called meridians. The zero meridian goes through the observatory 
of Greenwich. From here one counts the east longitude with the help of 180 meridians and the west 
longitude with 180 meridians. At the equator they are at a distance of 111 km of each other. East 
longitudes are given as positive, west longitudes are given as negative values. So —180° < A < 180°. 


—X 


Figure 3.85 Figure 3.86 


Το determine the degree of the latitude the Earth's surface is divided by small circles parallel to the 
equator. Starting from the equator 90 degrees of latitude are to be counted to the north, the northings, 
and 90 southern latitudes. Northings are positive, southern latitudes are negative. So —90° < y < 90°. 
2. Soldner Coordinates 
The right-angled Soldner coordinates and Gauss- Krüger coordinates are important in wide surface sur- 
veys. To map parts of the curved Earth's surface onto a right-angled coordinate system in a plane, 
distance preserving in the ordinate direction, according to Soldner the z-axis is to be placed on a merid- 
jan (it is called a central meridian), and the origin at a well-measured center point (Fig. 3.86a). The 
ordinate y of a point P is the segment between P and the foot of the spherical orthogonal (great circle) 
on the central meridian. The abscissa x of the point P is the segment of a circle between P and the 
main parallel passing through the center, where the circle is in a parallel plane to the central meridian 
(Fig. 3.86b). 
Transferring the spherical abscissae and ordinates into the plane coordinate system the segment Ax is 
stretched and the directions are distorted. The coefficient of elongation a in the direction of the abscissa 
- Δα γιὰ y? 
= Dr 9527 
Το moderate the stretching, the system may not be extended more than 64km on both sides of the 
central meridian. A segment of 1 km length has an elongation of 0.05 m at y = 64 km. 
3. Gauss-Krüger Coordinates 
In order to map parts of the curved Earth's surface onto the plane with an angle-preserving (conformal) 
mapping, at Gauss-Krüger system first a partition into meridian zones is prepared. For Germany these 
mid-meridians are at 65, 0”, 12?, and 15? east longitude (Fig. 3.87a). The origin of every meridian 
zone is at the intersection point of the mid-meridian and the equator. In the north-south direction the 
total range is to be considered, in the east-west direction a 1°40! wide strip on both sides. In Germany 
it amounts ca. 3100 km. The overlap is 20’, which is here nearly 20 km. 


R = 6371 km. (3.176) 
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rig salue. H gh alue y-elongation The coefficient o elongation a 
x in the abscissa direction 
x - elon- (Fig. 3.87b) is the same as in 


Î gation [Ἢρ Soldner system (3.176). To 
keep the mapping angle-preser- 


Ba: | ving, the quantity b must be 
a) |-- 500 km 4 b) added to S S 
b= (3.177) 
Figure 3.87 6 R? 


3.4.1.3 Spherical Lune or Biangle 


Suppose there are two planes ΓῚ and Τὸ passing through the endpoints A and B of a diameter of the 
sphere and enclosing an angle a (Fig. 3.88) and so defining two great circles gı and gy. The part of 
the surface of the sphere bounded by the halves of the great circles is called a spherical lune or biangle 
or spherical digon. The sides of the spherical biangle are defined by the spherical distances between A 
and B on the great circles. Both are 180°. 

As the angles of the biangle one defines the angles between the tangents of the great circles g, and gz at 
the points A and B. They are the same as the so-called dihedral angle a between the planes Γι and Το. 
If C and D are the bisecting points of both great circle arcs A and B , the angle a can be expressed as 
the spherical distance of C and D. The area Aj of the spherical biangle is proportional to the surface 
area of the sphere just as the angle α to 360? . Therefore the area is 


4r Ra 2R?a 
360 ο 


—2Rarco with the conversion factor o as in (3.175c). (3.178) 


Figure 3.88 Figure 3.89 Figure 3.90 


3.4.1.4 Spherical Triangle 


Consider three points A, B, and C on the surface of a sphere, not on the same great circle. Connecting 
every two of them by a great circle yields a spherical triangle ABC (Fig. 3.89). 

The sides of the triangle are defined as the spherical distances of the points, i.e., they represent the 
angles at the center between the radii 0A, 0B, and 0C. They are denoted by a, b, and c, and hereafter 
they are given in angle measure, independently of whether they are denoted at the center as angles or 
on the surface as great circle arcs. The angles of the spherical triangle are the angles between every two 
planes of the great circles. They are denoted by a, 3, and y. 

The order of the notation of the points, sides, and angles of the spherical triangle follows the same 
scheme as for triangles of the plane. A spherical triangle is called a right-sided triangle if at least one 
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side is equal to 90°. There is an analogy with the right-angled triangles of the plane. 
3.4.1.5 Polar Triangle 


1. Poles and Polar The endpoints of a diameter P, and P are called poles, and the great circle g 
being perpendicular to this diameter is called polar (Fig. 3.90). The spherical distance between a pole 
and any point of the great circle g is 90°. The orientation of the polar is defined arbitrarily: Traversing 
the polar along the chosen direction there is a left pole to the left and a right pole to the right. 

2. Polar Triangle A'B'C' of a given spherical triangle ABC is a spherical triangle such that the 
vertices of the original triangle are poles for its sides (Fig. 3.91). For every spherical triangle ABC 
there exists one polar triangle A'D'C'. If the triangle A'B'C" is the polar triangle of the spherical 
triangle ABC , then the triangle ABC is the polar triangle of the triangle A'B'C". The angles of a 
spherical triangle and the corresponding sides of its polar triangle are supplementary angles, and the 
sides of the spherical triangle and the corresponding angles of its polar triangle are supplementary 
angles: 


a 2180 —o, Wf =180°-8,  c-180—, (3.1792) 


a! = 180? — a, θ' = 180? — b, y = 180° -- c. (3.179b) 


Figure 3.91 Figure 3.92 


3.4.1.6 Euler Triangles and Non-Euler Triangles 


The vertices A, B, C of a spherical triangle divide every great circle into two, usually different parts. 
Consequently there are several different triangles with the same vertices, e.g., also the triangle with 
sides a’, b, c and the shadowed surface in Fig. 3.92a. According to the definition of Euler one should 
always choose the arc which is smaller than 180° as a side of the spherical triangle. This corresponds 
to the definition of the sides as spherical distances between the vertices. Considering this, all spherical 
triangles of whose sides and angles are less than 180° are called Euler triangles, otherwise they are called 
non-Euler triangles. In Fig. 3.92b there is an Euler triangle and a non-Euler triangle. 


3.4.1.7 Trihedral Angle 


This is a three-sided solid formed by three edges sa, Sp, s. starting at a vertex O (Fig. 3.93a). The 
angles a, b, c are defined as sides of the trihedral angle, every of them is enclosed by two edges. The 
regions between two edges are called the faces of the trihedral angle. The angles of the trihedral angle 
are a, B, and y, the angles between the faces. If the vertex of a trihedral angle is at the center O of 
a sphere, it cuts out a spherical triangle of the surface (Fig. 3.93b). The sides and the angles of the 
spherical triangle and the corresponding trihedral angle are coincident, so every theorem derived for a 
trihedral angle is valid for the corresponding spherical triangle, and conversely. 
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Figure 3.93 


3.4.2 Basic Properties of Spherical Triangles 
3.4.2.1 General Statements 


For an Euler triangle with sides a,b,c, whose opposite angles are a, 3,7, the following statements are 
valid: 
1. Sum ofthe Sides The sum of the sides is between 0° and 360°: 


0° <a+b+c< 360°. (3.180 
2. Sum of two Sides The sum of two sides is greater than the third one, e.g., 
a b c. (3.181 


3. Difference of Two Sides The absolute value of the difference of two sides is smaller than the 
third one, e.g., 


|a — b| < c. (3.182 
4. Sum ofthe Angles The sum of the angles is between 180° and 540°: 

180? < a + B +y < 540°. (3.183 
5. Spherical Excess The difference 

ce=a+ß +y- 180° (3.184 


is called the spherical excess. 
6. Sum of Two Angles The sum of two angles is less than the third one increased by 180°, e.g., 
a 4- B < y+ 180°. (3.185 

7. Opposite Sides and Angles Opposite to a greater side there is a greater angle, and conversely. 
8. Area The area Ar of a spherical triangle can be expressed by the spherical excess e and by the 
radius of the sphere R with the formula 

" R 2e 
180° 
Here ϱ is the conversion factor (3.175c). From the theorem of Girard, with Ag as the surface area of the 
sphere, holds 


As 
e. 1860] 
2 (3.186b) 


If the sides are known and not the excess, then e can be calculated by the formula of L'Huilier (3.201). 


Ap =eR?- = R? arce. (3.186a) 


T= 


3.4.2.2 Fundamental Formulas and Applications 


The notation for the quantities of this paragraph corresponds to those of Fig. 3.89. 
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1. Sine Law 


si n à s " b T. B ο] d si 
sina sina (3.1872) sino . sin| , (3187b) HEC SHE (3.187c) 


sinc siny sina sina’ 


sinb — sing" 
The equations from (3.1872) to (3.187c) can also be written as proportions, i.e., in a spherical triangle 
the sines of the sides are related as the sines of the opposite angles: 


sina sinb sinc 


= = ἱ 3.187d 
sina  sinf siny ( ) 
'The sine law of spherical trigonometry corresponds to the sine law of plane trigonometry. 
2. Cosine Law, or Cosine Law for Sides 
cosa = cosbcosc + sinbsinccosa, (3.188a) cosb = cosccosa + sincsinacos B, (3.188b) 
cos c = cosa cos b + sin a sin bcos y. (3.188c) 


The cosine law for sides in spherical trigonometry corresponds to the cosine law of plane trigonometry. 
From the notation one can see that the cosine law contains the three sides of the spherical triangle. 
3. Sine-Cosine Law 
sina cos β = cos bsin c — sin b cos c cos q, (3.189a 
sin a cos y = cos csin b — sinc cos bcos a. (3.189b 
One can get four more equations by cyclic change of the quantities (Fig. 3.34). 
The sine-cosine law corresponds to the projection rule of plane trigonometry. Because it contains five 
quantities of the spherical triangle it is not used directly for solving problems of spherical triangles, bu 
it is used for the derivation of further equations. 


4. Cosine Law for Angles of a Spherical Triangle 


cosa = — cos D cos y + sin B sin y cos a, (3.190a 
cos B = — cos y cosa + sin y sin a cos b, (3.190b 
cosy = — cosa cos f + sina sin f cos c. (3.190c 


'This cosine rule contains the three angles of the spherical triangle and one of the sides. With this 
law one can easily express an angle by the opposite side with the angles on it, or a side by the angles; 
consequently every side can be expressed by the angles. Contrary to this, for plane triangles the third 
angle is calculated from the sum of 180°. 
Remark: It is not possible to determine any side of a plane triangle from the angles, because there are 
infinitely many similar triangles. 
5. Polar Sine-Cosine Law 

sin a cos b = cos f sin y + sin cos y cos a, (3.191a) 


Sino cosc = cos y sin B + sin ^ cos B cos a. (3.191b) 


Four more equations can be get by cyclic change of the quantities (Fig. 3.34). 
Just as for the cosine law for angles, also the polar sine-cosine law is not usually used for direct calcu- 
lations for spherical triangles, but to derive further formulas. 
6. Half-Angle Formulas 
Το determine an angle of a spherical triangle from the sides one can use the cosine law for sides. The 
half-angle formulas allow us to calculate the angles by their tangents, similarly to the half-angle for- 
mulas of plane trigonometry: 

α 


sin(s — b) sin(s — sin(s — c)sin(s — a) 


(3.192b) 


tan 


sin ssin(s — a) sin s sin(s — b) 
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sin(s — a) sin(s — b) 


(3.194) 


3.192c s —————— 
sin ssin(s — c) ( ) 5 
If from the three sides of a spherical triangle all the three angles should be determined, then the follow- 
ing calculations are useful: 
k β k 


α B 
tan 9 = sin(s — a)’ (3.193a) tan 3 = sin(s — D) — by (3.193b) 


y k : 
tanc = —————— νι” .193c 
DE "PE with (3.193c) 
_ ou a) sin(s b) sin(s o) (3.193d) -f H + c (3.1930) 
sins 


7. Half-Side Formulas 
With the half-side formulas it is possible to determine one side or all three sides of a spherical triangle 
from its three angles: 


ΜΝ cos(a — B) cos(o — η) (3.1942) eet _ | cos(a — 7) cos(o = a) 3.194b 
2 — cosa cos(o — a) 2 — cosa cos(a — p) 
: :os(o — a) cos(o — £ (EB 
cot E = | E8 = α) cos(a B) 3.194c) z= ο ης 3 1944 
2 — cosa cos(a — η) 2 
or 
a k' b κ’ 
= = ——_ 1955 ι--------, 1951 
cot 5 cic =a 3.195a) cot 5 e cm 3.195b 
T ENDE itl (3.195c 
CO 2  cos(s— 4) with Ἢ ο 
" = a) cos(a — B) cos(a » (3.195d) _ at B+ Y (3.195e 
— cosa 2 
Since for the sum of the angles of a spherical triangle according to (3.183): 
180° < 20 < 540° or 90° < ø< 270° (3.196 


holds, cosa < 0 must always be valid. Because of the requirements for Euler triangles all the roots are 
real. 

8. Applications of the Fundamental Formulas of Spherical Geometry 

With the help of the given fundamental formulas, for instance distances, the azimuth, and course angles 
on the Earth can be determined. 


W A: Determine the shortest distance between Dresden (Ay = 13°46’, φι = 51°16’) and Alma Ata 
(λο = 76°55’, p2 = 43°18’). 

Solution: The geographical coordinates (λι, 1), (As, %2) and the north pole N (Fig. 3.94) result in 
two sides of the triangle P| P,N a = 90? — y2 and b = 90° — yı lying on meridians, and also the angle 
between them y = Ay — λι. For c = e it follows from the cosine law (3.188a) 


cosc = cosa cosb + sin asin bcos y, 
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Figure 3.94 Figure 3.95 


cose = cos(90° — φι) cos(90° — ϱο) + sin(90° — y1) sin(90° — p2) σοβ(λο — A1) 
sin y1 sin Y2 + cos 1 cos p2 cos(À — A1), (3.197) 


i.e., cose = 0.53498 + 0.20567 = 0.74065,e = 42.213. The great circle segment P,P, has length 
4694 km using (3.1752). 

W B: Calculate the course angles δι and 62 at departure and at arrival, and also the distance in sea miles 
of a voyage from Bombay (A, = 72°48’, pı = 19*00") to Dar es Saalam (Ay = 39°28’, ο = —6°49') 
along a great circle. 

Solution: The calculation of the two sides a = 90? — οι = 71°00',b = 90° — yo = 96549’ and the 
enclosed angle y = λι — As = 33?20' in the spherical triangle P, PN with the help of the geographical 
coordinates {λι, %1), (A2, ρα) (Fig. 3.95) and the cosine law (3.188c) cosc = cose = cosacosb + 


sina sin b cos y yields P, P5— e = 41.777°, and because 1’ ~ 1 sm follows P, P5z: 2507 sm. 
With the cosine law for sides (3.188a) 
OS a — cos bcos c 3b — cosa cos € 
a = arccos τος = 51.248° and f = arccos = 125.018". 
sin bsinc sinasinc 

Therefore, the results are 

δι = 360° — 8 = 234.982° and δὲ = 180° + a = 251.248". 
Remark: It makes sense to use the sine law to determine sides and angles only if it is already obvious 
from the problem that the angles are acute or obtuse. 


3.4.2.3 Further Formulas 


1. Delambre Equations 
Analogously to the Mollweide formulas of plane trigonometry the corresponding formulas of Delambre 
are valid for spherical triangles: 


α-β . a+b . q—f . a—b 
cos sin sin sin 
— --ᾱ--, (3.198a) 2 — (3.198b) 
sin 9 sin 9 cos 9 sin 9 
a+ a+b a+b a—b 
cos cos Sin cos 
— 2... (3.198c) 2 2 (3.198d) 
sın 9 cos 9 cos 9 cos 9 


Since for every equation two more equations exist by cyclic changes, altogether there are 12 Delambre 
equations. 
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2. Neper Equations and Tangent Law 


-α-ὖ m e b 
a—g sin 3 a+8 S= η 
tan 5 cee cot 5 (3.1992) tan 5 ap cot 2" (3.199b) 
sin cos 
. a—f co α-β 
a-b ΠΠ ς a+b Ότο c 
tan gj ~ Bam tan 2’ (3.199c) tan μμ... tan 5: (3.199d) 
sin cos — 


These equations are also called Neper analogies. From these one can derive formulas analogous to the 
tangent law of plane trigonometry: 


a—b α-β b-c β- γ 
tan tan 9 tan tan 9 
" Ξ a (3.2002) " δα = : Hos (3.200b) 
an jan 2 jan 28 an 2 
tan — tan 2 7 Ξ 
= z a (3.200c) 
tan T tar EM 


3. L'Huilier Equations 

The area of a spherical triangle can be calculated with the help of the excess e , either from the known 
angles a, B, ^; according to (3.184), or if the three sides a, b, c are known, with the formulas about the 
angles from (3.193a) to (3.193e). The L’Huilier equation makes possible the direct calculation of e from 
the sides: 


: s, s$— S—b, « 
tant Ξ Tm - 3 7 tan? tan? (3.201) 


ο 
2 2 
This equation corresponds to the Heron formula of plane trigonometry. 


3.4.3 Calculation of Spherical Triangles 


3.4.3.1 Basic Problems, Accuracy Observations 

'The different cases occurring most often in calculations of spherical triangles are arranged into so-called 
basic problems. For every basic problem for acute-angled spherical triangles there are several ways to 
solve it, and it depends on whether the calculations are based only on the formulas from (3.187a) to 
(3.191b) or also on the formulas from (9.192) to (3.201), and also whether one is looking for only one 
or more quantities of the triangle. 
Formulas containing the tangent function yield numerically more accurate results, especially in com- 
parison to the determination of defining quantities with the sine function if they are close to 90°, and 
with the cosine function if their value is close to 0° or 180°. For Euler triangles the quantities calculated 
with the sine law are bi-valued, since the sine function is positive in both of the first quadrants, while 
the results obtained from other functions are unique. 


3.4.3.2 Right-Angled Spherical Triangles 

1. Special Formulas 

In a right-angled spherical triangle at least one of the angles is 90°. The sides and angles are denoted 
analogously to the plane right-angled triangle. If as in Fig. 3.96 η is a right angle, the side c is called 
the hypotenuse, a and b the legs and a and £ are the leg angles. From the equations from (3.187d) to 
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(3.191b) it follows for y = 90°: 


sina = sin a sin c, (9.202. sin b = sin f sin c, (3.202b 
60856 = cos a cos b, (3.202c cosc = cot a cot B, (3.202d 
tana = cos B tan c, (3.202e tanb = cosa tan c, (3.202 
tanb = sina tan β, (3.202g tana = sin btana, (3.202h 
cosa = sin P cosa, (3.202i cos B = sina cos b. (3.202j 


z 
909-b 90-a 


- 


Figure 3.96 Figure 3.97 


If in certain problems other sides or angles are given, for instance instead of a, β, y the quantities b, y, a, 
the necessary equations can be get by cyclic change of these quantities. For calculations in a right- 
angled spherical triangle one usually starts with three given quantities, the angle y = 90° and two 
other quantities. There are six basic problems, which are represented in Table 3.8. 


Table 3.8 Defining quantities of a spherical right-angled triangle 


Basic Given defining Number of the formula to 
problem | quantities determine other quantities 
1 Hypotenuse and a leg c, a a (3.202a), 8 (3.202e), b (3.202c) 
2 Two legs a, b a (3.202h), 8 (3.202g), c (3.202c) 
3. Hypotenuse and an angle c, a a (3.202a), b (3.2021), 8 (3.2024) 
4 Leg and the angles on it a, 3 c (3.2020), b (3.202j), o (3.2021) 
5. Leg and the angle opposite to it a, a | b (3.202h), c (3.2022), 8 (3.2021) 
6. Two angles a, ϐ a (3.2021), b (3.202j), c (3.202d) 
2. Neper Rule 
The Neper rule summarizes the equations (3.202a) to (3.2027). If the five determining quantities of a 


right-angled spherical triangle, not considering the right angle, are arranged along a circle in the same 
order as they are in the triangle, and if the legs are replaced by their complementary angles 90° — a, 
90° — b, (Fig. 3.97), then the following is valid: 
1. The cosine of every defining quantity is equal to the product of the cotangent values of its neighboring 
quantities. 


2. The cosine of every defining quantity is equal to the product of the sine values of the non-neighboring 
quantities. 


tanb 
W A: cosa = cot(90° — b) cot c = € (see (3.202f) . 
an 


W B: cos(90° — a) = sincsina = sina (see (3.202a)). 


W C: Map the sphere given by a grid, onto a cylinder which contacts the sphere along a meridian, the 
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so-called central meridian. This meridian and the equator form the axis of the Gauss-Kriiger system 


(Fig. 3.98a,b). 


x^ geogr. 


ΠΠ grid 


Figure 3.98 Figure 3.99 


Solution: A point P of the surface of the sphere will have the corresponding point P’ on the plane. 
The great circle g passing through the point P perpendicular to the central meridian is mapped into 
a line g' perpendicular to the x-axis, and the small circle k passing through P parallel to the given 
meridian becomes a line k’ parallel to the x-axis (grid meridian). The image of the meridian m through 
P is not line but a curve m’ (true meridian). The upward direction of the tangent of m’ at P' gives the 
geographical north, the upward direction of Κ' gives the grid north direction. The angle y between the 
two north directions is called the meridian convergence. 

In a right-angled spherical triangle QPN with c = 90° — p, and b = η one gets y from a = 90° — η. 
tan 
tan(90° — ϱ) 
y and η are mostly small, one can consider siny ~ y, tan” & η; consequently y = 1 tan y is valid. 
The length deviation y of this cylinder sketch is pretty small for small distances η, and so one can 
substitute η = y/ R, where y is the ordinate of P. This yields y = (y/ R) tanq. The conversion of y 
from radian into degree measure yields a meridian convergence of y = 0.018706 or y = 1*04'19" at 

ip = 50°, y = 100 km. 


3.4.3.3 Spherical Triangles with Oblique Angles 


For three given quantities, there is to be distinguished between six basic problems, just as it was done 
for right-angled spherical triangles. The notation for the angles is a, 8, γ and a,b,c for the opposite 
sides (Fig. 3.99). 

Tables 3.9, 3.10, 3.11, and 3.12 summarize, which formulas should be used for which defining quan- 
tities in the case of the six basic problems. Problems 3, 4, 5, and 6 can also be solved by decomposing 
the general triangle into two right-angled triangles. Το do this for problems 3 and 4 (Fig. 3.100, 
Fig. 3.101) one can use the spherical perpendicular from B to AC to the point D, and for problems 5 
and 6 (Fig. 3.102) from C to AB to the point D. 


. tanb . 
The Neper rule yields cosa = " or cos(90° — 3) = ,siny = tanņtan y. Because 
anc 


Figure 3.100 Figure 3.101 Figure 3.102 
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In the headings of Tables 3.9, 3.10, 3.11, and 3.12 the given sides and angles are denoted by S and 
W respectively. This means for instance SWS: Two sides and the enclosed angle are given. 


Table 3.9 First and second basic problems for spherical oblique triangles 


First basic problem Second basic problem 
Given: 3 sides a, b, c SSS Given: 3 angles a, B, γ WWW 
Conditions: Conditions: 
0° < a+b + c « 360°, 180° < a + B + y < 540°, 
a+b>c,a+c>b,b+c>a. a+ B < 180? +7y,œa +y < 180? + B, 
B τ s 180? + a. 
Solution 1: Required a. Solution 1: Required a. 
cosa — cosbcosc cos a + cos f cos y 
cosa = ———— — ——— or cosa = ——— ———— —- 
sinbsinc sin sin y 
tang- sin(s - b) sin(s — c) -— cos(a — B) cos(o — η) 
2 3 2 — cosa cos(c — a) 
ac By 
icc ens Να τε καὶ 
Solution 2: Required a, 3, y. Solution 2: Required a, b, ο. 
k = (ae — a)sin(s — b) sin(s — c) pu = — a) cos(o — B) cos(o — η) 
sins , — cosa i 
tan € k i B k κα κ’ b k! 
an = , tan- = , |cot- = , cot- = ; 
2. si(s—a)' 2 sin(s — b) 2 cos(a—a)’ 2  cos(c — B) 
tan A i t $ EE oo 
ang = —— ———. cot = = i 
2  sin(s—c) 2  cos(c — 3) 
Checking: (s a) t (s b) + (s c) =g Checking: (σ α) } (σ B) | (σ 7) σ, 
mE MP OE EA „a b 6 "TM 
tan P tan 5 tan 7 Sins = k. cot — cot 7 cot -{-- cosa) = k’. 


Wi A Tetrahedron: A tetrahedron has base ABC and vertex S 
(Fig. 3.103). The faces ABS and BCS intersect each other at an angle 
B = 74°18’, BCS and CAS at an angle η = 63°40’, and CAS and ABS 
at a = 80°00’. How big are the angles between every two of the edges 
AS, BS, and CS? 

Solution: From a spherical surface around the vertex S of the pyramid, 
the trihedral angle (Fig. 3.103) cuts out a spherical triangle with sides 
a, b, c. 

The angles between the faces are the angles of the spherical triangle, 
the angles between the edges, which are searched for, are the sides. The 
determination of the angles a, b, c corresponds to the second problem. 
Solution 2 in Table 3.9 yields: 

c = 108°59’, o — a = 28°59, c — B = 34°41’, o — y = 4519’, k' 
1.246983, cot(a/2) = 1.425514, cot(b/2) = 1.516440, cot(c/2) 
Figure 3.103 1.773328. 


Β B Radio Bearing: In the case of a radio bearing two fixed stations Pj(À1, φι) and Ρη(λο, 3) 
receive the azimuths δι and δὲ via the radio waves emitted by a ship (Fig. 3.104). The task is to 
determine the geographical coordinates of the position Pp of the ship. The problem, known by marines 
as the shore-to-ship bearing, is an intersection problem on the sphere, and it will be solved similar to the 
intersection problem in the plane (see 3.2.2.3, p. 148). 

1. Calculation in triangle P; PN: In triangle P, P5N the sides ΡΙΝ = 90° — 1, ΕΝ = 90° — e» and 
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the angle X PiN P = A2 — Ay = AA are given. The calculations of the angle Ζει, €2 and the segment 
P, P, = e correspond to the third basic problem. 


Table 3.10 Third basic problem for spherical oblique triangles 


'Third basic problem SWS 
Given: 2 sides and the enclosed angle, e.g., a, b, 7 
Condition: none 
. a=b 
. : α- 8 sin 
Solution 1: Required c, or canda. | tan LA cot 
cosc = cosa cos b + sin a sin b cos y, 2 kia d +6 2 
, sina sin 7 
sina = ———.. α-β 
bei pan (—90° < 5 — « 90°) 
a can >e in quadrant I. or II. a+8 d "T up 
We apply the theorem: a 5 i B- 3 : 
Larger angle Is opposite Solution 4: Required a, 3, c. 
to the larger side or db P 
checking calculation: , a48 099—055 Z 
s Jb cose 0 a in q. I. tan + ; 
cosa " cos b cos p zU- ain q. IL. os a sin Y 1 
Solution 2: Required a, or a and c. 2 2 
a- 
tanu = vana goy α-- sin - cos 2 z 
tan ysin u tan = = a = t7 
tang = — > 2 a ᾱ- Ν 
sin(b — u) sin —— sin 5 
tan(b — u oc B 5 
tanc = tanb =u) (—90? < < 90°) 
; cosa, , , f 2 f 
Solution 3: Required o and (or) 2. gi +8 a-B ga ot B a-B 
-— a—b ΠῚ 2 2 
a+ adi / à , 1 
tan == 2 cot, οὓς = zi Sine Z 
a+b 2 . a+b’ 9 .α-β᾽ 
2 sin sin 
Checking: Double calculation of c. 


2. Calculation in the triangle Pj; P; Py: Because £1 = 
δι — £1, & = 360? — (05 + εὐ), the side e and the angles 
lying on it € and 6ο are known in P, Pp P3. Calculations 
of the sides εἰ and eg correspond to the fourth basic prob- 
lem, third solution. The coordinates of the point P can 
be calculated from the azimuth and the distance from P, 
or Pz. 

3. Calculation in the triangle NP; Py: In the triangle 
N P, Ps there are given two sides N P, = 905--φι, Pi Po = 
ei and the included angle δι. The side N Py = 90°— po and 
the angle Άλι are calculated according to the third basic 
problem, first solution. For checking one can calculate in 
the triangle N P5 P5 for a second time N P) = 90°— yo, and 
also AAs. Consequently, the longitude Ag = Ay + Άλι = 
Figure 3.104 λ2 — A2» and the latitude qo of the point Py are known. 
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3.4.3.4 Spherical Curves 

Spherical trigonometry has a very important application in navigation. One basic problem is to deter- 
mine the course angle, which gives the optimal route. Other fields of application are geodesic surveys 
and also robot-movement planning. 


Table 3.11 Fourth basic problem for spherical oblique triangles 


Fourth basic problem WSW 
Given: One side and two adjacent angles, e.g., a, B, c 
Condition: none 
Solution 1: Required η, or η and a . 
cos y = — cosa cos 3 + sino sin B cosc, 
: : a—b 
: sin ¢ sin a (—90° < < 90°), 
sina = ———__.. i 
Sin y i a+b α-ὐ a+b a—b 
a can be in quadrant I or II. We ap- a= H , b= 9 5 
ply the theorem: The larger side is in | Solution 4: Required a, b, y. ^ 
opposite to the larger angle or checking a-B.c 
calculation: +b «55 sm> 7 
aing. I tan = 8 2 πμ 
cosa + cos B cosy Z0 — win q. IL. 2 cos ΤΕ "TE 
Solution 2: Required a, or a and y. Δ 2 E E 
tanccos sin ———— sin — 
cot u = tana cosc, tana = ; E i α-ὺ M du D EA 
cos(B = n) tan c Q B C Ν’ 
οοί(ϐ — u) sin OS = 
πρ, pb 
cosa o o 
Solution 3: Required a and (or) b. (90° < < 90°) 
oa α-β a+b  a—b b a+b a—b 
p. c9 o og = avs 
tan mr = — 3 tan i y 2 2 2 2 
2 a4 8 2 y 7 γ Z 
os sinc = , col = i 
2 2 . a+b’ 2 . a—b 
.α-β sin sin 
a-b  9m79 ς . . 
tan zm. tan 5 Checking: Double calculation of y. 
sin —— 


1. Orthodrome 
1. Notion The geodesic lines of the surface of the sphere — which are curves, connecting two points 
A and B by the shortest path — are called orthodromes or great circles (see 3.4.1.1, 3., p. 160). 
2. Equation ofthe Orthodrome Moving on an orthodrome — except for meridians and the equator 
— needs a continuous change of course angle. These orthodromes with position-dependent course angles 
a can be given uniquely by their point closest to the north pole Py(An, qx), where yy > 0° holds. The 
orthodrome has the course angle ay = 90° at the point closest to the north pole. The equation of the 
orthodrome through Py and the running point Q(A, p), whose relative position to Py is arbitrary, can 
be given by the Neper rule (3.4.3.2,2., p. 170) according to Fig. 3.105 as: 

tan qw cos(A — Ax) = tan y. (3.203) 
Point Closest to the North Pole: The coordinates of the point closest to the north pole Px(Aw, ὧν) 
of an orthodrome with a course angle a4 (a4 # 0°) at the point A(A4, pa) (ya 90”) can be calcu- 
lated by the Neper rule (3.4.3.2,2., p. 170) considering the relative position of Py and the sign of a4 
according to Fig. 3.106 as: 


tanya 
arccos ———.|. 


ὧν = arccos(sin|a4|cosy4) (3.204a) and Ay = λα sign(a4) - (3.204b) 
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Remark: 


If a calculated geographical distance A is not in the domain —180° < A < 180°, then for 


A Æ +k -180° (k € IN) the reduced geographical distance Area is: 


λ 
Area = 2 arctan (ων 3) : (3.205) 
This is called the reduction of the angle in the domain. 
Table 3.12 Fifth and sixth basic problems for a spherical oblique triangle 
Fifth basic problem SSW Sixth basic problem WWS 


Given: 2 sides and an angle opposite 
to one of them, e.g., a, b, « 


Given: 2 angles and a side opposite 
to one of them, e.g., a, a, B 


Conditions: See distinction of cases. 


Conditions: See distinction of cases. 


Solution: Required is any missing quantity. 
; sin bsina 
sin b = == 
sina : 
Let 84 be acute and Gy = 180? — βι obtuse. 
Distinction of cases: 
sinbsina s 
————— 0 solution. 
sina 
sinbsina , 
— — 1 solution 8 = 90°. 
sina 
sinbsina 
sina ' 
sina > sinb: 
1 6 < 90° 
2. b > 90° 
3.2. sina <sinb: 
a < 90? , a < 90° 
a > 90° ,a > 90° 
; να» 90° 
3.2.2. a > 90°, < 90° 
Further calculations with one angle 
or with two angles 8: 


1 solution f. 
1 solution £5. 


2 solutions 


By, Bo. 


0 solution. 


2 values 61, £5 are possible. 


Solution: Required is any missing quantity. 
sinasin 3 


sinb — 2 values bı, by are possible. 


sina 
Let b; be acute and by = 180? — βι obtuse. 
Distinction of cases: 
sina sin 8 . 
——— »]1 Osolution. 
sina 


sina sin f 
PER ec 1 I1solution b = 90°. 


sina 
sina sin 8 
sina 
3.1. sino > sin f. 
3.11. 8 «90? LIsolution b. 
3.12. 8 >90° 1 solution bə. 


3.2. sina «sinf: 


391, 2 < 90? , a < 90? | 2 solutions 
UU" a »90*,o > 905 by, bo. 
ə a<90°,a > 90° ate 
3.2.2. a> 905 a < 90° } 0 solution. 


Further calculations with one side 
or with two sides b: 


Method 1: Method 2: | 

tanu = tanbcosa, tan’ ntt p cos a : 4 Tm 2t Boos : 5 : 
anu = os É an = = tan ——~——_4—_., jan — — cot ; 

= ' = a i l 2 2 cos 2 z 8 2 sin arh 

cot p = cosbtana, . a+ .a—b 

cot? = cosa tan f, =o a— psn 9. . -cot a — Bsn 2 

Y -etv. m 2 " 


Checking: Double calculation of : and ; 


Intersection Points with the Equator: The intersection points Pg, (Ag, , 0?) and Pg, (Ag,, 0°) of the 
orthodrome with the equator can be calculated from (3.203) because tan py cos(Ag, — An) = 0 (v = 


1,2) must hold: 
λε, = An F 90° 


(v = 1,2). 


(3.206) 
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Figure 3.105 Figure 3.106 


Remark: In certain cases an angle reduction is needed according to (3.205). 
3. Arclength If the orthodrome goes through the points A(A4, ya) and B(Ag, pB), the cosine law 


for sides yields for the spherical distance d or for the arclength between the two points: 
d = arccos|sin y4 sin yg + cos 4 cos og cos(Ap — A4)]. (3.2072) 
This central angle can be converted into a length considering the radius of the Earth R: 
; Ρ TR 
d = arccos|sin y4 sin yg + cos q4 cos pp cos(Ag — A4)]- (3.207b) 


180° 
4. Course Angle Using the sine and cosine laws for sides to calculate sin a4 and cos a4, gives the 
final result for the course angle a4 after division: 


COS pA cos pp sin(Àg — λα) 


QA — arctan 


3.208 
sin yg — sin y4 cos d ( ) 
Remark: With the formulas (3.207a), (3.208), (3.204a) and (3.204b), the coordinates of the point Py 
closest to the north pole can be calculated for the orthodrome given by the two points A and B. 

5. Intersection Point with a Parallel Circle For the intersection points X4(Ax,, x) and Χο(λχ,, 
px) of an orthodrome with the parallel circle y = yx one gets from (3.203): 


tan yx 


Ax, = An F arccos 


1,2). 3.209 
τ. τ 2) (3.209) 
From the Neper rule (3.4.3.2,2., p. 170) used for the intersection angles αχ, and ax,, by which an 
orthodrome with a point Pxy(An, Yn) closest to the north pole intersects the parallel circle y = yx: 


CEN. νεα, (3.210) 
Ux 


[ax | = arcsin 


The argument in the arc sine function must be extremal with respect to the variable yx for the minimal 
course angle |as;,|. This gives: sin ον =0 — yx = 0, i.e., the absolute value of the course angle is 
minimal at the intersection points of the equator: 

[a Xmin| = 90° — py. (3.211) 
Remark 1: Solutions of (3.209) exist only for |gx| € ex. 
Remark 2: In certain cases a reduction of the angle is needed according to (3.205). 
6. Intersection Point with a Meridian For the intersection point Y (Ay, yy) of an orthodrome 
with the meridian A = Ay according to (3.203) is given by 

py = arctan[tan ex cos(Ay — Ax)]. (3.212) 
2. Small Circle 
1. Notion The definition of a small circle on the surface of the sphere must be discussed here in more 
detail than in 3.4.1.1, p. 160: The small circle is the locus of the points of the surface (of the sphere) 
at a spherical distance r (r < 90°) from a point fixed on the surface M (Am, Ym) (Fig. 3.107). The 
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spherical center is denoted by M; r is called the spherical radius of the small circle. 

The plane of the small circle is the base of a spherical cap with an altitude h (see 3.3.4, p. 158). The 
spherical center M is above the center of the small circle in the plane of the small circle. In the plane 
the circle has the planar radius of the small circle ro (Fig. 3.108). Hence, parallel circles are special 
small circles with ymy = 15905. 

W For r — 90° the small circle tends to an orthodrome. 


Sp 


small circle plane 


Figure 3.107 Figure 3.108 


2. Equations of Small Circles As defining parameters either M and r can be used or the small 
circle point Py(An, pn) nearest the north pole and r. If the running point on the small circle is Q(A, v), 
one gets the equation of the small circle with the cosine law for sides corresponding to Fig. 3.107: 


cosr = sin sin yy + cos y cos Ym cos(A — Ay). (3.213a) 
From here, because of oy = YN — r and Ag = An, 

cosr = sinysin(yy — r) + cos y cos(py — τ) cos(A — Ax). (3.213b) 
ΒΑ: For qj = 90° the parallel circles are obtained from (3.213a), since cosr = sing = sin(90° — 
T) — sing > y = const. 
W B: For r — 90° follow orthodromes from (3.213b). 
3. Arclength The arclength s between two points A(A4, ρα) and B(Ag, op) on a small circle k can 
2πτρ 


^ 360 


9 I2 
cos d = cos? r + sin? r cos o 


be calculated corresponding to Fig. 3.108 from the equalities E 
and rg = Rsinr: 7 
cosd—cos*r nR 

sin? r 180° ` 
"i r — 90? the small circle becomes an orthodrome, and from (3.214) and (3.207b) it follows that 
4. Course Angle According to Fig. 3.109 the orthodrome passing through A(A4, pa) and M (Ay, 


(pm) intersects perpendicularly the small circle with radius r. For the course angle aou, of the ortho- 
drome after (3.208) 


(3.214) 


S = sin r arccos 


COS p4 COS Ym sin(Àm — λα) (3.215a) 


Qorth = arctan - - 
sin Ym — singacosrT 


is valid. So, one gets for the required course angle αλ of the small circle at the point A: 
aa = (|Aortn| — 90°) sign(aortn)- (3.215b) 


5. Intersection Points with a Parallel Circle For the geographical longitude of the intersection 
points Χι(λχ,, ex) and X3(Ax,, Yx) of the small circle with the parallel circle o = yx follows from 


178 3. Geometry 


Figure 3.109 Figure 3.110 
(3.213a): 


cosr — si sin 
Ax, = Àm F arccos ΕΚΕ (v = 1,2). (3.216) 
COS (px COS YM 


Remark: In some cases an angle reduction is needed according to (3.205). 

6. Tangential Point The small circle is touched by two meridians, the tangential meridians, at the 
tangential points Ti(Xr,, pr) and To(Ar,, pr) (Fig. 3.110). Because for them the argument of the arc 
cosine in (3.216) must be extremal with respect to the variable yx, holds: 

cosr — sin gx sin Ym 


ἜΜ. (3.217a) Àr, = Am F arccos (v = 1,2). (3.217b) 
Sr 


. Si 
¢ = arcsin 
ΜΠ cos cos Y yx COS YM 


Remark: In certain cases an angle reduction is needed according to (3.205). 

7. Intersection Points with a Meridian The calculation of the geographical latitudes of the in- 
tersection points Yi (Ay, yy,) and Yo(Ay, Yy) of the small circle with meridian À = Ay can be done 
according to (3.213a) with the equations 


C+ BVA? + B2 — C2 


(v =1,2), (3.218a) 


y, = arcsin 


ΑΣ + B? 
where the following notations are used: 
A — singy, B = cos yyy cos(Ay — àm), C = — cosr. (3.218b) 


For A? + B? > C?, in general, there are two different solutions, from which one is missing if a pole is 
on the small circle. 

If A? + B? = C? holds and there is no pole on the small circle, then the meridian touches the small 
circle at a tangential point with geographical latitude yy, = Yy, = vr. 


3. Loxodrome 

1. Notion A spherical curve, intersecting all meridians with the 
same course angle, is called a loxodrome or spherical helix. So, par- 
allels (a = 90°) and meridians (a = 0°) are special loxodromes. 

2. Equation of the Loxodrome Fig. 3.111 shows a loxodrome 
with course angle a through the running point Q(A, p) and the in- 
finitesimally close point P(A+dA, p+d). The right-angled spher- 
ical triangle QC P can be considered as a plane triangle because of 
Figure 3.111 its small size. Then: 


tana = aes dà = tan ody : (3.219a) 
Rdy COS (p 


3.4 Spherical Trigonometry 179 


Considering that the loxodrome must go through the point A(A4, ya), therefore, the equation of the 
loxodrome follows by integration: 


tan e t 5) 1805 
tan (45° + 24) 2 
2 
In particular if A is the intersection point Pg(Ag, 0°) of the loxodrome with the equator, then: 
QN 180° 
3 
Remark: The calculation of Ag can be done with (3.224). 
3. Arclength From Fig. 3.111 one can find the differential relation 
Rdy Rdy 


cosa = — ds = aaa (3.220a) 


A — àa = tanaln (a # 90°). (3.219b) 


(a # 90°). (3.219¢) 


A — Ap = tana ln tan e H 


Integration with respect to ọ results in the arclength s of the arc segment with the endpoints A(A4, Φα) 
and B(Ag, op): 


"m leg — αἱ TR 

cosa 180° 

If A is the starting point and B is the endpoint, then from the given values A, a and s can be determined 
step-by-step first Yg from (3.220b), then Ag from (3.219b). 

Approximation Formulas: According to Fig. 3.111, with Ω = A and P = B one can get an 


approximation for the arclength | with the arithmetical mean of the geographical latitudes with (3.221a) 
and (3.221b): 


(a # 90°). (3.220b) 


pg esa NES cos £47 5 B 
sina = ες IN onis l Ap 0 o. η. 
ος [ ie 19421) sma ο πο lh) 


4. Course Angle For the course angle a of the loxodrome through the points Α(λα, ρα) and 
Π(λη.ρῃ). or through A(A4, pa) and its equator intersection point Pg(Ag, 0°) according to (3.219b) 
and (3.219c) the following holds: 

(A B= λα) . T 
tan (us + £2) 180° 


tan (ο: + ga) 


,(8.222a) 


a = arctan 


(Aa = àe) 2T 
Intan (ο: a 2a) 180° 


Q = arctan 


In (3.222b) 


5. Intersection Point with a Parallel Circle Suppose a loxodrome passes through the point 
Α(λα, pa) with a course angle a. The intersection point X (Ax, yx) of the loxodrome with a paral- 
lel circle o = yx is calculated from (3.219b): 


οι, PX 
tan (45 aj £x) 180° 
tan (ο: + ga) * 
2 
With (3.223) the intersection point with the equator Pg(Ag, 0°) is calculated as 


180° 
λε = àa — tana: Intan (s + ga) . E 


Ax = àa tana ln (a # 90°). (3.223) 


(a £ 90°). (3.224) 


π 
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Remark: In certain cases an angle reduction is needed according to 
(3.205). 

6. Intersection Point with a Meridian Loxodromes - except par- 
allel circles and meridians — wind in a spiral form around the pole 
(Fig. 3.112). The infinitely many intersection points Y,(Ay, Yy, ) 
(v € Z) of the loxodrome passing through A(A4, ya) with course angle 
a with the meridian À = Ay can be calculated from (3.219b): 


Ay — Aa v: 360° OT 
tana 180° 


yy, = 2arctan [ον | 


ΠῚ e " z -9» (νετ). (3.225) 


Figure 3.112 


If A is the equator intersection point Pg(Ag, 0?) of the loxodrome, then simply holds 
Ay —Ag-c v:360 π 
tana 180° 


py, = 2 arctan exp | | 90° (v € 2). (3.226) 
4. Intersection Points of Spherical Curves 


1. Intersection Points of Two Orthodromes Suppose the considered orthodromes have points 
PN, (An,, 9n,) and Py, (Anz, Yno) closest to the north pole, where Py, # Pw, holds. Substituting the 
intersection point S(As, ys) in both orthodrome equations gives the system of equations: 


tan yy, cos(Ag — λνι) = tan ys, (3.227a) tan yy, cos(Ag — AN,) = tan ys. (3.227b) 
Elimination of ys and using the addition law for the cosine function yields: 


tan yn, cos Ay, — tan pn, COS AN, 


tan Às = (3.228) 


tan py, sin An, — tan yy, sin An, 
The equation (3.228) has two solutions As, and Ag, in the domain —180° < A € 180° of the geographical 
longitude. The corresponding geographical latitudes can be got from (3.227a): 

ps, = arctan[tan yy, cos(As, — An, )} (v = 1,2). (3.229) 
The intersection points Sı and S» are antipodal points, i.e., they are the mirror images of each other 
with respect to the centre of the sphere. 
2. Intersection Points of Two Loxodromes Suppose the considered loxodromes have equator in- 
tersection points Pg, (Ag, , 0?) and Pg, (Ag,, 0?) and the course angles αι und a» (αι ¥ αὐ). Substituting 
the intersection point S(As, ys) in both loxodrome equations gives the system of equations: 


180° 
As — Ag, = tana, -Intan e | 55) E (αι # 90°), (3.230a 
180* 
As — An, = tana: ln tan e £s) . (αυ # 90°). (3.230b 
π 


Elimination of As and expressing ys gives an equation with infinitely many solutions: 
λει = AE, +v- 360° π 


tan as — tana, 180° 


Ps, = 2 arctan exp | | 90° (v € 2). (3.231 


The corresponding geographical longitudes As, can be found by substituting qs, in (3.2302): 
Ps, 180° 
es) . 


(αι 90), (ν εἴ). (3.232 
2 π 


Remark: In certain cases an angle reduction is needed according to (3.205). 


As, = Ag, + tana; Intan e } 
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3.5 Vector Algebra and Analytical Geometry 


3.5.1 Vector Algebra 
3.5.1.1 Definition of Vectors 


1. Scalars and Vectors 

Quantities whose values are real numbers are called scalars. Examples are mass, temperature, energy, 
and work (for scalar invariant see 3.5.1.5, p. 185, 1., p. 214 and 4.3.5.2, p. 288). 

Quantities which can be completely described by a magnitude and by a direction in space are called 
vectors. Examples are power, velocity, acceleration, angular velocity, angular acceleration, and electri- 
cal and magnetic force. We represent vectors by directed line segments in space. 

In this book the vectors of three-dimensional Euclidean space are denoted by a, and in matrix theory 
by a (see also 4.1.3, p. 271). 

2. Polar and Axial Vectors 

Polar vectors represent quantities with magnitude and direction in space, such as speed and acceler- 
ation; axial vectors represent quantities with magnitude, direction in space, and direction of rotation, 
such as angular velocity and angular acceleration. In notation they are distinguished by a polar or by 
an axial arrow (Fig. 3.113). In mathematical discussion they are treated in the same way. 

3. Magnitude or Absolute Value and Direction in Space 

For the quantitative description of vectors à or a, as line segments between the initial and endpoint A 
and B resp., are used the magnitude, i.e., the absolute value |a], the length of the line segment, and the 
direction in space, which is given by a set of angles. 


4. Equality of Vectors 
Two vectors & and b are equal if their magnitudes are the same, and they have the same direction, i.e., 
if they are parallel and oriented identically. 
Opposite and equal vectors are of the same magnitude, but oppositely directed: 
— — — — 
AB=a, BA--a but |AB| = |BA]. (3.233) 


Axial vectors have opposite and equal directions of rotation in this case. 


B B zA Pxyz) C 
D € 
a I z d 
b p 
a)'A b)/A ο) ΖΑ 8 
Figure 3.113 Figure 3.114 Figure 3.115 


5. Free Vectors, Bound or Fixed Vectors, Sliding Vectors 

A free vector is considered to be the same, i.e. its properties do not change, if it is translated parallel to 
itself, so its initial point can be an arbitrary point of the space. If the properties of a vector belong to 
a certain initial point, it is called a bound or fixed vector. A sliding vector can be translated only along 
the line it is already in. 

6. Special Vectors 

a) Unit Vector a? = 6 isa vector with length or absolute value equal to 1. With it the vector a can 
be expressed as a product of the magnitude and of a unit vector having the same direction as a: 


ᾱ-- ela]. (3.234) 


The unit vectors i, j, k or €;, €j, δι (Fig. 3.114) are often used to denote the three coordinate axes in 
the direction of increasing coordinate values. 
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In Fig. 3.114 the directions given by the three unit vectors form an orthogonal triple. These uni 
vectors define an orthogonal coordinate system because for their scalar products 


eje; = ee, = eje = 0 (3.235 
is valid. Furthermore also 
eje; = eje; = exe = 1, (3.236 


holds, i.e. it is an orthonormal coordinate system. (For more about scalar product see (3.248).) 
b) Null Vector 0 or zero vector is the vector whose magnitude is equal to 0, i.e., its initial and endpoin 
coincide, and its direction is not defined. 


= 
c) Radius Vector r or position vector of a point P is the vector OP with the initial point at the origin 
and endpoint at P (Fig. 3.114). In this case the origin is also called a pole or polar point. The poin 
P is defined uniquely by its radius vector. 

d) Collinear Vectors are parallel to the same line. 

e) Coplanar Vectors are parallel to the same plane. They satisfy the equality (3.260). 


3.5.1.2 Calculation Rules for Vectors 


1. Sum of Vectors 


— — = 
a) The Sum of Two Vectors AB = ἃ and AD = b can be represented also as the diagonal of the 


E 
parallelogram ABC D, as the vector AC = c in Fig. 3.115b. The most important properties of the 
sum of two vectors are the commutative law and the triangle inequality: 


a+b=b+a, |a+b|<|al+|bl. (3.237a) 


— E — 
b) The Sum of Several Vectors à, b,c,...,& is the vector f = AF, which closes the broken line 
composed of the vectors from a to € as in Fig. 3.115a. For n vectors a; (i = 1,2,...,n) holds: 


Yü- f. (3.23 Tb 
i=1 


Important properties of the sum of several vectors are the commutative law and the associative law ο 
addition. For three vectors holds: 


a+b+¢=G+b+a,  (&4-b)46—&-4 (ba c). (3.237¢ 
c) The Difference of Two Vectors à — b can be considered as the sum of the vectors à und —b, i.e., 
a—b=a+(-b)=d (3.2374 


which is the other diagonal of the parallelogram (Fig. 3.115b). The most important properties of the 
difference of two vectors are: 


a—a=0 (null vector) |a—b|>|\a|—|b]]. (3.237e 


Ao AA 


Figure 3.116 
2. Multiplication of a Vector by a Scalar, Linear Combination 
The products aa and aa are equal to each other and they are parallel (collinear) to a. The length 
(absolute value) of the product vector is equal to |a||à|. For a > 0 the product vector has the same 
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direction as a; for a < 0 it has the opposite one. The most important properties of the product of 
vectors by scalars are: 


αὔ-ᾶα, αβᾶ-βαᾶ, (αβ)ᾶ--  αᾶ- Bü, a(€+b)=aa+ab. (3.238a) 
The linear combination of the vectors a, b, C ei _d with the scalars a, B,...,ó is the vector 
k=od4+6b+-:-+6d. (3.238b) 


3. Decomposition of Vectors 

In three-dimensional space every vector à can be decomposed uniquely into a sum of three vectors, 

which are parallel to the three given non-coplanar vectors üi, V, w (Fig. 3.116a,b): 
a=ati+fv+ yw. (3.239a) 

The summands au, 8V and yw are called the components of the decomposition, the scalar factors a, 

B and y are the coefficients. When all the vectors are parallel to a plane one can write 

a=at+fv (3.239b) 
with two non-collinear vectors u and v being parallel to the same plane (Fig. 3.116c,d). 
3.5.1.3 Coordinates of a Vector 

E 
1. Cartesian Coordinates According to (3.239a) every vector AB — 
a can be decomposed uniquely into a sum of vectors parallel to the basis 
vectors of the coordinate system i,j, K or δι, €j, 8: 

α ari + αμ] + a.k = a,€; + a6; + α-θι, (3.240a) 
where the scalars az, a, and a, are the Cartesian coordinates of the vector 
a in the system with the unit vectors &;, €; and δι. Also is written 

A= {az,ay,az} or las; Oyz): (3.240b) 
The three directions defined by the unit vectors form an orthogonal direc- 


tion triple. The components of a vector are the projections of this vector 
Figure 3.117 on the coordinate axes (Fig. 3.117). 


The coordinates of a linear combination of several vectors are the same linear combination of the co- 
ordinates of these vectors, so the vector equation (3.238b) corresponds to the following coordinate 
equations: 
= Qar + Bb, +: +de, 
ky = aay + Bb, ἡ i 
k, = aaz + Bbz +- -δά.. 
For the coordinates of the sum and of the difference of two vectors 
é=atb (3.242a 
the equalities 


> 
3 
| 


(3.241 


Gp = Ay D ο αι E by; C= a, 0; (3.242b 
are valid. The radius vector r of the point P(x, y, 2) has the Cartesian coordinates of this point: 


=o, Wu duum ri+yj+zk. (3.243 


2. Affine Coordinates are a generalization of Cartesian coordinates with respect to a system o 
linearly independent but not necessarily orthogonal vectors, i.e., to three non-coplanar basis vectors 
E1, €2,63. The coefficients are a!,a?,a?, where the upper indices are not exponents. Similarly to 


(3.240a,b) for a holds 
a=a'é,+a°&+0°@, (9.244) or a= {a', a, a’ or a (a, a’, a?) i (3.244b) 


This notation is especially suitable as the scalars a, a”, αὖ are the contravariant coordinates of a vector 
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(see 3.5.1.8, p. 188). For δι = i, & = J, & = k the formulas (3.244a,b) become (3.240a,c). For the 
linear combination of vectors (3.238b) just as for the sum and difference of two vectors (3.242a,b) in 


analogy to (3.241) the same coordinate equations are valid: 
k! = aa! - βθὶ + +d, 
k? = aa? + Bl 4 sd, (3.245) 
k? = aa +8 ++. +d; 
οἳ -- αἲ -Ε δν, ας --αἳ b, σ -- αἳ +B. (3.246) 


3.5.1.4 Directional Coefficient 


The directional coefficient of a vector a along a vector b is the scalar product 
ay = &b? = |&| cosy, (3.247) 
x b 4 5 E 
where b? — 5 is the unit vector in the direction of b and ᾧ is the angle between a and b. 
The directional coefficient represents the projection of a on b. 


E [nthe Cartesian coordinate system the directional coefficients of the vector a along the x, y, z axes are 
the coordinates az, a,, az. This statement is usually not true in a non-orthonormal coordinate system. 


3.5.1.5 Scalar Product and Vector Product 
1. Scalar product 


The scalar product or dot product of two vectorsaand — , 
b is defined by the equation b e| E, 
a-b = ab = (ab) = |a| |b | cosy, (3.248) = 
a a 


where y is the angle between a and b considering them 

with a common initial point (Fig. 3.118). The value 

of a scalar product is a scalar. Figure 3.118 Figure 3.119 
2. Vector Product 
or cross product of the two vectors a and b is a vector € such that it is perpendicular to the vectors ἃ and 
b, and in the order a, b, and € the vectors form a right-hand system (Fig. 3.119): If the vectors have 
the same initial point, then looking at the plane of a and b from the endpoint of €, the shortest rotation 


of & in the direction of b is counterclockwise. The vectors a, b, and € have the same arrangement as the 
thumb, the forefinger, and the middle finger of the right hand. Therefore this is called the right-hand 
rule. The vector product (3.249a) has the magnitude (3.249b) 


üxb-[&üb]-€, (3.2492) Ισ] = |a| |b] sing, (3.249b) 


where y is the angle between a and b. Numerically the length of ¢ is equal to the area of the parallelo- 
gram defined by the vectors à and b. 
3. Properties of the Products of Vectors 
a) The Scalar Product is commutative: 

ab=ba. (3.250) 
b) The Vector Product is anti commutative (changes its sign by interchanging the factors): 

ax b=—(b x a). (3.251) 
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c) Multiplication by a Scalar Scalars can be factored out: 


o(&b) = (aa) b, (3.252a) a(a x b) = (aa) x b. (3.252b 
d) Associativity The scalar and vector products are not associative: 

a(bé) z (ab)é, (3.253a) ax (bx @) 4 (axb) x@ (3.253b 
e) Distributivity The scalar and vector products are distributive over addition: 

a(b+@) =ab+aé, (3.254a 

ax (b+é@)=axb+axé and (b+é)xa=bxa+exa (3.254b 
f) Orthogonality of Two Vectors Two vectors are perpendicular to each other (a L b) if the equality 

āb=0 holds, and neither & nor b are null vectors. (3.255 


g) Collinearity of Two Vectors Two vectors are collinear ( || B) if the equality 


àxb-6 holds, and neither à nor b are null vectors. (3.256 
h) Multiplication of the same vectors: 
áü—-a-a, ἄχᾶ--ῦ. (3.257 


i) Linear Combinations of Vectors can be multiplied in the same way as scalar polynomials (be- 
cause of the distributive property), only one must be careful with the vector product. If interchanging 
the factors, also the signs are to be changed. 


Bl A: ( 3a +5b — 2c) (a — 2b — 40) = 38? + 5ba — 268 — GAD — 10b? + 4cb — 1286 — 20be + 8c? 
= 3a? — 10b? + 8c? — ab — 1486 — 16be. 
B B: ( 38 +55 — 2€) x (à—2b — 46) = 3à x ã + 5b x & 26x & — 68 x b — 10b x b 
+ 46 xb— 128 x € — 20b x €+ 86x €= 0 — 58 x b +28 x €— 68 x b -0— 4b x € 
— 128 x€ 20b x € +0 = —118 x b — 10a x € — 24b x € = 11b x A+ 10€ x A+ 248 x b. 


j) Scalar Invariant is a scalar quantity if it does not change its value under a translation or a rotation 
of the coordinate system. The scalar product of two vectors is a scalar invariant. 


W A: The coordinates of a vector ἃ = (a1, a2, az} are not scalar invariants, because in different coor- 
dinate systems they can have different values. 

W B: The length of a vector ais a scalar invariant, because it has the same value in different coordinate 
systems. 

W C: Since the scalar product of two vectors is a scalar invariant, the scalar product of a vector by 
itself is also a scalar invariant, i.e., 2a = |a|? cosy = |a|?, because y = 0. 


3.5.1.6 Combination of Vector Products 
1. Double Vector Product 


The double vector product & x (b x @) results in a vector coplanar to b and c: 
&x (bx @) = b(ac) — &(&b). (3.258) 
2. Mixed Product 


The mixed product (a x b) C, which is also called the triple product, results in a scalar whose absolute 
value is numerically equal to the volume of the parallelepipedon defined by the three vectors; the result 


is positive if a, b, and c form a right-hand system, negative otherwise. Parentheses and crosses can be 
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omitted: 

(üxb)é—a(bxc)-—abc-bca-cab- —acb = —bac— -čb ā. (3.259) 
The interchange of any two terms results in a change of sign; the cyclic permutation of all three terms 
does not affect the result. E 
For coplanar vectors, i.e., if a is parallel to the plane defined by b and 6, holds: 


a(b x ο) =0. (3.260) 
3. Formulas for Multiple Products 
a) Lagrange Identity: (a x b)(é x d) = (àc) (bd) — (bc) (ad), (3.261) 
. ad af ag 
b) abé-éfg=|be bf bgl. (3.262) 
cé cf cg 


4. Formulas for Products in Cartesian Coordinates 
If the vectors a, b, c are given by Cartesian coordinates as 


ï= {asaya}, B= {br byb}, C= (ecc), (3.263) 
the calculation of the products can be made by the following formulas: 
1. Scalar Product: ab= Azby + ab, + azbz. (3.264) 


2. Vector Product: ἄχ b = (ajb, — azby) Ï + (azby — a,b.) j + (ανν — ab.) k 
ijk 

= |a, ay azl- (3.265) 
be by b. 


αν Ay Az 
b, by bz}. 


£x Cy Cz 


3. Mixed Product: abé= (3.266) 


5. Formulas for Products in Affine Coordinates 
1. Metric Coefficients and Reciprocal System of Vectors If the affine coordinates of two vec- 


tors & and b in the system of ej, 65, 63 are given, i.e., 
Āā = a! ë, Lada, D= bē +b? 8 JU 8, (3.267 
and there is to calculate the scalar product 
ab = a'b! δι δι + a? b? δ) δν + a? b? δ) δ, 
+ (a'b? +a? 03) & & + (a? b +a? b?) 6, δι (aU +a’ 0) és (3.368 
or the vector product 
ax b = (a? b — a? b’) & x δι + (αὐ δὶ — at t?) & x δι + (a'b? — a? b) & xe,  (3.269a 


with the equalities 


6, x δι = & x & = & x 6, — 0, (3.269b 
then the pairwise products of the coordinate vectors must be known. For the scalar product these are 
the six metric coefficients (numbers) 

gu — ὄιδι, 922 = 626, 933 = G3 63, 

gi = 6, €) = ὅ,δι, g23 = 6564 = 616, σε = διδι = δι δι (3.270) 
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and for the vector product the three vectors 
E! = N (E X E), E= N (EXE), E? = N (8 x&), 


which are the three reciprocal vectors with respect to δι, 2, δη, where the coefficient 


1 
61 65 &3 
is the reciprocal value of the mixed product of the coordinate vectors. This notation serves only as 
a shorter way of writing in the following discussion. Calculations with the coefficients will be easy to 
perform with the help of the multiplication Tables 3.13 and 3.14 for the basis vectors . 


(3.271a) 


2-2 


Table 3.13 Scalar product Table 3.14 Vector product 
of basis vectors of basis vectors 
Multipliers 
| jalele ΚΚ 
= a3 a2 
| & | gu | στα | gis 3S 6 0 μυς 
| - B|1 Q Q 
2 | G21 | 922 | 923 A EE al 
7 5! & | —-—] 0 — 
e3 | 931 | 932 | gas 5 2 Q 
A n e? el 
(gui = gir) “δ mee 0 
Table 3.15 Scalar product Table 3.16 Vector product 
of reciprocal basis vectors of reciprocal basis vectors 
Multipliers 
| iJj k P[j k 
=> ul > > > 
| ililolo $ ilolk!|-j 
8 
> Q > — > 
| j/o]1]0 =| j |-k| o | i 
| k|olol|1 e| k|ljl-iio 
E 


2. Application to Cartesian Coordinates The Cartesian coordinates are a special case of affine 
coordinates. From Tables 3.15 and 3.16 for the basis vectors holds 

&-i &-j &=k (3.272a) 
with the metric coefficients 


1 
gii =92=93=1, ιο 008 =931=0, Q SE 1, (3.272b) 
1] 


and the reciprocal basis vectors 

ē! =i, ej e-k (3.272c) 
So the basis vectors coincide with the reciprocal basis vectors of the coordinate system, or, in other 
words, in the Cartesian coordinate system the basis vector system is its own reciprocal system. 
3. Scalar Product of Vectors Given by Coordinates 


3 
Ab = Y^ E Ginna b" = gaga® D. (3.273) 
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For Cartesian coordinates (3.273) coincides with (3.264). 

After the second equality in (3.273) a shorter notation for the sum was applied which is often used in 
tensor calculations (see 4.3.1, 2., p. 280): instead of the complete sum one writes only a characteristic 
term so that the sum should be calculated for repeated indices, i.e., for the indices appearing once down 
and once up. Sometimes the summation indices are denoted by Greek letters; here they have the values 
from 1 until 3. Consequently holds 


Japa” b? = guat b! + giza b? + giga! b? + gua b! + gona? b? + go3a? b? 
H ga1a^ b! + gaoa? b? + gasa? D. (3.274) 
4. Vector Product of Vectors Given by Coordinates In accordance with (3.2692) 


δι 66 al a? a? 


bi b? b3 
= é& δν δι |(a? b? — a? b?)é! + (a? b' — a! b?) &? + (αἱ b? — a? 0)e?] (3.275) 


axb 


is valid. For Cartesian coordinates (3.275) coincides with (3.265). 


5. Mixed Product of Vectors Given by Coordinates In accordance with (3.269a) holds 
Abé = δι ὄ) δι np. (3.276) 


For Cartesian coordinates (3.276) coincides with (3.266). 
3.5.1.7 Vector Equations 


Table 3.17 contains a summary of the simplest vector equations. In this table a, b, c are given vectors, 
X is the unknown vector, a, 3, y are given scalars, and x, y, z are the unknown scalars to be calculated. 


3.5.1.8 Covariant and Contravariant Coordinates of a Vector 


1. Definitions The affine coordinates αἱ, a?, a? of a vector a in a system with basis vectors 6ι, &, 
é€;, defined by the formula 


ï = αἱ δι +a? & +a? δι = a^ Ea (3.277) 


are also called contravariant coordinates of this vector. The covariant coordinates are the coefficients 
in the decomposition with the basis vectors e!, €?, €, i.e., with the reciprocal basis vectors of e, e», 
é;. With the covariant coordinates a1, a2, a3 of the vector à 


a = më! a6? + mE’ = a, é*. (3.278) 
In the Cartesian coordinate system the covariant and contravariant coordinates of a vector coincide. 
2. Representation of Coordinates with Scalar Product 
The covariant coordinates of a vector ἃ are equal to the scalar product of this vector with the corre- 
sponding basis vectors of the coordinate system: 


αι - ἄδι. ας 865, ας R6. (3.279) 


The contravariant coordinates of a vector à are equal to the scalar product of this vector with the 
corresponding basis vectors: 
a —ad!, a?—ad?, a? = 86%. (3.280) 
3 


In Cartesian coordinates (3.279) and (3.280) are coincident: 


a,—&j, a,—àk. (3.281) 


Ai, 


ay 
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Table 3.17 Vector equations 


X unknown vector; à, b, C, d given vectors; zr, y, z unknown scalars; a, 8 , y given scalars 


Equation Solution 
1. X-a-b X=b-a 
er , a 
2. aX —a Χ-- 
α 
3. Xa=a Indeterminate equation; considering all vectors X satisfying 


the equation, with the same initial point, then the endpoints 
form a plane perpendicular to the vector à. Equation 3. is 
called the vector equation of this plane. 

4. xà—b (b 1 a) Indeterminate equation; considering all vectors X satisfying 
the equation, with the same initial point, then the endpoints 
form a line parallel to ἃ. Equation 4. is called the vector 
equation of this line. 


5 Xaed , aat+axb bt 
‘\exa=b (bia |*^ @ ΠΡ 
χΧᾶ--α EI AL AM = E - 
6. 1xb-g 4, a(b x SIS USED IMS b) _ "o T3 
Xé-5 | abc 
where ᾱ, b, Gare the reciprocal vectors of a, b, € (see 3.5.1.6, 
1., p. 186). . u 
7 d=ra+yb+zé quu TEL CC E 
z i übc abe abe 
8. d= z(b xc) da db dc 
ο. a ας ----, 2--- 
ty (é x 8) + z (8 x b) abc " abe abé 


3. Representation of the Scalar Product in Coordinates 
The determination of the scalar product of two vectors by their contravariant coordinates yields the 
formula (3.273). The corresponding formula for covariant coordinates is: 

ab = g^? ag b, (3.282) 
where g”” = €™ 6” are the metric coefficients in the system with the reciprocal vectors. Their relation 
with the coefficients gmn is 
(-1)™"" Amn 
Gu 912 913 
921 922 goa 
931 932 933 
where A™” is the subdeterminant of the determinant in the denominator obtained by deleting the row 
and column of the element gmn- 
If the vector a is given by covariant coordinates, and the vector b by contravariant coordinates, then 
their scalar product is 

ab=a! by + a? by + a? bz = a? by (3.284) 
and analogously holds 

Ab = a,b". (3.285) 


mn 
Q = 


(3.283) 
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3.5.1.9 Geometric Applications of Vector Algebra 

Table 3.18 demonstrates some geometric applications of vector algebra. Other applications from ana- 
lytic geometry, such as vector equations of the plane and of the line, are demonstrated in 3.5.1.7, p. 189 
and 3.5.3.10, p. 218ff. and on the subsequent pages. 


Table 3.18 Geometric application of vector algebra 


Vector Formula with coordinates 


Determination : : . 
formula (in Cartesian coordinates) 


Length of the vector à αξ να» 


Area of the parallelo- 


gram determined by 3 i 
> vectors ἃ e b =laxb = Qy Az az Ay Ay Ay 
the vectors a and b 5 la x b| S l by b, + b. bs + b, by 
Volume of the 
parallelepiped 
determined by the P Ag Ay Az 
vectors a, b, c = ἄρε] V — [bs by bs 
ὃς NEA 
Angle vp the ab πο... 
vectors ἃ anc cos p = cos — —— ς a ρα : 7 
Vb? (@t+a +a? b2 +b? +b? 


3.5.2 Analytical Geometry ofthe Plane 
3.5.2.1 Basic Concepts, Coordinate Systems in the Plane 


The position of every point P of a plane can be given by an arbitrary coordinate system. The numbers 
determining the position of the point are called coordinates. Mostly Cartesian coordinates and polar 
coordinates are in use. 


1. Cartesian or Descartes Coordinates 

The Cartesian coordinates of a point P are the signed distances of this point, given in a certain measure, 
from two coordinate axes perpendicular to each other (Fig. 3.120). The intersection point 0 of the 
coordinate axes is called the origin. The horizontal coordinate axis, usually the x-axis, is usually called 
the azis of abscissae, the vertical coordinate axis, usually the y-axis, is the axis of ordinates. 


I II II IV 


0 x 


Figure 3.120 Figure 3.121 
The positive direction is given on these axes: on the z-axis usually to the right, on the y-axis upwards. 
The coordinates of a point P are positive or negative according to which half-axis the projections of 
the point fall (Fig. 3.121). The coordinates x and y are called the abscissa and the ordinate of the 
point P, respectively. The point with abscissa a and ordinate b is denoted by P(a, b). The x, y plane is 
divided into four quadrants I, IT, III, and IV by the coordinate axes (Fig. 3.121,a). 
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2. Polar Coordinates 

The polar coordinates of a point P (Fig. 3. 122) are the radius p, i.e., the distance of the point from a 
given point, the pole 0, and the polar angle y, i.e., the angle between ‘the line OP and a given oriented 
half-line passing through the pole, the polar axis. The pole is also called the origin. The polar angle is 
positive if it is measured counterclockwise from the polar axis, otherwise it is negative. 


sb; u-a, 
Ῥ(ριφ) u=a, 
v=b, 
p 
v-b, u=a, 
0 
Figure 3.122 Figure 3.123 


3. Curvilinear Coordinate System 

This system consists of two one-parameter families of curves in the plane, the family of coordinate curves 
(Fig. 3.123). Exactly one curve of both families passes through every point of the plane. They intersect 
each other at this point. The parameters corresponding to this point are its curvilinear coordinates. In 
Fig. 3.123 the point P has curvilinear coordinates u = αι and v = 63. In the Cartesian coordinate 
system the coordinate curves are straight lines parallel to the coordinate axes; in the polar coordinate 
system the coordinate curves are concentric circles with the center at the pole, and half-lines starting 
at the pole. 


Figure 3.124 Figure 3.125 


3.5.2.2 Coordinate Transformations 

Under transformation of a Cartesian coordinate system into another one, the coordinates change ac- 
cording to certain rules. 

1. Parallel Translation of Coordinate Axes 

The axis of the abscissae is shifted by a, and the axis of the ordinates by b (Fig. 3.124). Suppose a 
point P has coordinates x,y before the translation, and it has the coordinates z^, y' after it. The old 
coordinates of the new origin 0' are a, b. The relations between the old and the new coordinates are the 
following: 


πξα-α, y=y +b, (3.286a) xt -—r:—a, y-y-b. (3.286b) 


2. Rotation of Coordinate Axes 
Rotation by an angle y (Fig. 3.125) yields the following changes in the coordinates: 


z’ =xcosp +ysing, ] v —z' cosy —y'sing, 
y’ = —rsinq +y cos p (3.2872) y —a'sing +y' cosy. (3.2870) 
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The coefficient matrix belonging to (3.287a) 


cosp sing quf x x aft ; 
D= : with =D and =D ; (3.287c) 
—sinp cosy y y y y 


is called the rotation matrix. 

In general, the transformation of a cartesian coordinate system into another can be performed in two 
steps, a translation and a rotation of the coordinate axes. 

Remark: With the so-called coordinate transformation considered here the coordinate system is trans- 
formed, but the represented object rests in its position. In contrast to this with a so-called geometric 
transformation the object is transformed, but the coordinate system remains unchanged in its position. 
In 3.5.4, p. 229 the rotation of an object is described by 


ν x 
( J = R( η, (3.288) 
Up UP 
where R is the rotation matrix. Between D and R there exists the relation 


R=D". (3.289) 


P565 ,y;) P 


1 
d / d 
Py / P, 


Ῥι(Χι γι) 1 


/Qy 0) P2 
0 


Figure 3.126 Figure 3.127 Figure 3.128 


x 


3. Transforming Cartesian Coordinates into Polar Coordinates and Conversely 
Supposing the origin coincides with the pole and the axis of abscissae coincides with the polar axis 
(Fig. 3.126), then 

t=ply)cosp y=p(y)sng (ππ«φσπ, ρ320): (3.290a) 


arctan y +a forz <0, 
£ 


arctan 2 forz > 0, 
ορ γα’ Εφ’, (3.290b) p= ; for x = 0 and y > 0, (3.290c) 
T 


= for x = 0 andy < 0, 
undefined for x = y = 0. 


3.5.2.3 Special Notations and Points in the Plane 


1. Distance Between Two Points 


If the two points given in Cartesian coordinates as Pj (x1, y1) and P, (xs, yo) (Fig. 3.127), then their 
distance is 


d= (za — αι)” + (yo — νι). (3.291) 
If they are given in polar coordinates as P, (p1, %1) and P» (p2, p2) (Fig. 3.128), their distance is 


d = pl + pl — 2p pa cos (p2 — ϕι). (3.292) 
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2. Coordinates of Center of Mass 
The coordinates (x,y) of the center of mass of a system of material points M; (x;,y;) with masses 
mi; (i= 1,2,...,n) are calculated by the following formula: 
mici mit 
go y = Σι (3.293) 
Dmi Dmi 


3. Division ofa Line Segment 


PP m 
1. Division in a Given Ratio The coordinates of the point P with division ratio PF =s” a 
2 n 
(Fig. 3.129a) of the line segment P, P, are calculated by the formulas 
nzi mro πι Àxo nyi mys yi Ms 
qu = 3.2942 y= = : 3.294} 
d n+m 1-A ^" ( Ü d n+m 1+A ( : 
For the midpoint M of the segment P, P», because of A = 1 
eat Ξ m (3.294c) ge 5 = (3.2944 


holds. The sign of the segments P,P and PP; can be defined. Their signs are positive or negative 
depending on whether their directions are coincident with P; P, or not. Then formulas (3.294a,b,c,d 


result in a point outside of the segment P, P» in the case À < 0. This is called an external division. 


If P is inside the segment P, P», i 
is called an internal division. One 
defines: 

a)A=0if P=P, 

b) \ = oo if P = P, and 

ΜΝ c) à = —1 if P is an infinite or im- 
proper point of the line g, i.e., if 
e a NU P is infinitely far from P, P> on g. 
The shape of A is shown in Fig. 


yA A^ 


EE μας et NE 3.129b. 
W For a point P, for which P; is 
b) the midpoint of the segment P,P, 
PP 
Figure 3.129 A- PE — —2 holds. 


2. Harmonic Division Ifthe internal and external division of a line segment have the same absolute 
value |A|, it is called harmonic division. Denote by P; and P, the points of the internal and external 
division respectively, and by A; and A, the internal and external rates. Then 
P,P, P,P, 
1 A = LL A. (3.295a) Or λι +A, = 0. (3.295b) 
ο PP 


If M denotes the midpoint of the segment P, P, at a distance b from Pj (Fig. 3.130), and the distances 
of P; and P, from M are denoted by x; and xq, then 
box mq Ti b 
= or =—, 
b-a, απ θ b Ge 
The name harmonic division is in connection with the harmonic mean (sce 1.2.5.3, p. 20). In Fig. 3.131 
the harmonic division is represented for \ = 5 : 1, analogously to Fig. 3.14. The harmonic mean r of 


ie, Lita = b. (3.296) 
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the segments Pj P; = p and P,P, = q according to (3.295a) equals in accordance with (1.67b), p. 20, to 


2pq 


=2b (see Fig. 3.132). (3.297) 


3. 


and the whole segment a have 


a) 


Figure 3.132 


Golden Section of a segment a is its division into two parts r and a — 2 such that the part x 


Figure 3.133 


A 


NID 


ο 


che same ratio as the parts a — x and x: 


ae τς (3.298a) 


In this case x is the geometric mean of a and a — x (see also golden 
section in 1.1.1.2, p. 2), and it holds: 


x= γαία- x), (3.298b) x = BUS & 0.618. a. (3.298c) 


The part x of the segment can be geometrically constructed as shown 
in Fig. 3.133a. 

Remark 1: The line segment z is also the length of the side of a 
regular decagon with circum radius a (see also 3.1.5.3, p. 139). 
Remark 2: The following geometrical problem produces also the 
equation of the golden section: Given a rectangle with the constant 
side length a and a variable side a — x. To find is the value 2; so, that 
the area a(a — Ὁ) of the rectangle is equal to the area x? of the square 
(see Fig.3.133b). 


3.5.2.4 Areas 
1. Areaofa Convex Polygon 
If the vertices are given by P (αι, y1), P» (£2, Y2), ---; Pn (En, Yn), then the area is 
1 
S= 5 [(£1 — 22) (yi + y2) + (02 — τα) (Y2 ys) t (En — σι) (Yn )]- (3.299) 


The formulas (3.299) and (3.300) result in a positive area if the vertices are enumerated counterclock- 
wise, otherwise the area is negative. 


2. Areaofa Triangle 


If the vertices are given by Pj (x1, y1), P» (£2, Y2), and P; (3, y3) (Fig. 3.134), then the area can be 
calculated by the formula 
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yA 
P565 Y3) 
{tml ji 
S = 3 |72 yo 1| = z [mi (yo — ys) + v2 (ys — νι) + z3 (yı — νο) 
“| x3 yz 1 ^ 
1 
o = g m πο) (yr + Y2) + (£2 — x3) (yo + ys) + Gra — πι) (ys + y1)] - (3.300) 
10% /Y1) P,O y2) 
| Til 
0 X Three points Pj, P», Pj are on the same line if | x» y» 1| = 0 holds. (3.301) 
z3 y3 1 


Figure 3.134 
3.5.2.5 Equation ofa Curve 


An equation F(x,y) = 0 for the coordinates x and y often corresponds to a curve, which has the prop- 
erty that every of its points P satisfies the equation, and conversely, every point whose coordinates 
satisfy the equation is on the curve. The set of these points is also called the geometric locus or simply 
locus. If there is no real point in the plane satisfying the equation F(x,y) = 0, then there is no real 
curve, and one talks about an imaginary curve: 

MA: z? +y? 11-60, 

HB: y—ln (a a? — cosh z) 0. 

The curve corresponding to the equality F(x,y) = 0 is called an algebraic curve if F(x,y) is a poly- 
nomial, and the degree of the polynomial is the order or degree of the curve (see 2.3.4, p. 65). If the 
equation of the curve cannot be transformed into the form F(x,y) = 0 with a polynomial expression 
F(x,y), then the curve is called a transcendental curve. 


The equation of a curve can be defined in the same way in any coordinate system. But from now on, in 
this book only the Cartesian coordinate system is used, except when stated otherwise. 


3.5.2.6 Line 


1. Equation of the Line 


Every equation that is linear in the coordinates is the equation of a line, and conversely, the equation 
of every line is a linear equation of the coordinates. 


1. General Equation of Line 

Ar- By+C=0 (A,B,C const). (3.302) 
For A = 0 (Fig. 3.135) the line is parallel to the x-axis, for B = 0 it is parallel to the y-axis, for C = 0 
it passes through the origin. 
2. Equation of the Line with Slope (or Angular Coefficient) Every line that is not parallel to 
the y-axis can be represented by an equation written in the form 

y=kax+b (k,b const). (3.303) 
The quantity k is called the angular coefficient or slope of the line; it is equal to the tangent of the angle 
between the line and the positive direction of the z-axis (Fig. 3.136). The line cuts out the segment b 
from the y-axis. Both the tangent and the value of b can be negative, depending on the position of the 
line. 
3. Equation ofa Line Passing Through a Given Point The equation of aline which goes through 
a given point P, (σι, y1) in a given direction (Fig. 3.137) is 


y—y i —k(r—z,), with k=tand. (3.304) 
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yA 
Ῥιίκι γι) 
0 x 
Figure 3.135 Figure 3.136 Figure 3.137 
4. Equation of a Line Passing Through Two Given Points If two points of the line Pj (21, y1), 


P» (x2, y2) are given (Fig. 3.138), then the equation of the line is 
—1 T-T 
LAE (3.305) 
Y2 — Yı T2 — Tı 
5. Intercept Equation of a Line If a line cuts out the segments a and b from the coordinate axes, 
considering them with sign, the equation of the line is (Fig. 3.139) 


Bi Ye . 
ο... (3.306) 
y 
y 
Pix γι) 
Pa% Υ2) b 
Pix 1) 
> 0 
0 x 0ἱ a x 
Figure 3.138 Figure 3.139 Figure 3.140 


6. Normal Form ofthe Equation of the Line (Hessian Normal Form) With p as the distance 
of the line from the origin, and with a as the angle between the x-axis and the normal of the line passing 
through the origin (Fig. 3.140), with p > 0, and 0 € a < 27, the Hessian normal form is 

x: cosa + ysina — p — 0. (3.307 
The Hessian normal form can be got from the general equation if multiply (3.302) by the normalizing 
factor 

oie 1 

BS [A+ BE 
The sign of jj must be the opposite to that of C in (3.302). 
7. Equation of a Line in Polar Coordinates (Fig. 3.141) With p as the distance of the line from 
the pole (normal segment from the pole to the line), and with a as the angle between the polar axis anc 
the normal to the line passing through the pole, the equation of the line is 


... P l 
ta T (3.309 


(3.308 


2. Distance ofa Point from a Line 
The distance d of a point P, (x1, 1) from a line (Fig. 3.140) can be got by substituting the coordinates 
of the point into the left-hand side of the Hessian normal form (3.307): 


d = xı cosa + yı sina — p. (3.310) 
If P, and the origin are on different sides of the line, yields d > 0, otherwise d < 0. 
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P(X yo) 


0! x 
Figure 3.141 Figure 3.142 Figure 3.143 


3. Intersection Point of Lines 


1. Intersection Point of Two Lines In order to get the coordinates (xo, yo) of the intersection 
point of two lines the system of equations given by the equations is to be solved. If the lines are given 
by the equations 


Aix d Byy+C,=0, Agr + Boy - C9 —0 (3.311a) 
then the solution is 
Bı Οι Οι Αι 
| By Cə C3 A2 i 
Xo A, Bil’ Yo AL Bal (3.311b) 
Ag By A» B 
B σ 
Ir f: B Ξ 0 holds, the lines are parallel. If ER = E = G holds, the lines are coincident. 
2. Pencil of Lines Ifa third line with equation 
Απ + Day T ο =0 (3.312a) 
passes through the intersection point of the first two lines (Fig. 3.142), then the relation 
Αι Bı Οι 
A B3 C9| —0 (3.312b) 
As Βα C3 


must be satisfied. 
The equation 
(Aya + Biy + ΟἹ) + A (Aga + Boy + C2) = 0 (-οο < A < +00) (3.312c) 
describes all the lines passing through the intersection point P (xo, yo) of the two lines (3.311a). By 
(3.312c) a pencil of lines is defined with center Po(xo, yo). If the equations of the first two lines are given 
in normal form, then λ = +1 yields the equations of the bisectrices of the angles at the intersection 
point (Fig. 3.143). 


0 x ol 
a) b) 
Figure 3.144 Figure 3.145 


4. Angle Between Two Lines 
In Fig. 3.144 there are two intersecting lines. If their equations are given in the general form 
Ard Bjy- C4 —0 and Agr 4- Boy + C — 0, (3.313a) 
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then for the angle ϱ holds 


A, Bə — A,B, 
tan = ——— ——-, 9131 
ang Z,A + B,B; (3.313b) 
A, A, + BB: A, Bz — AB 
cos p -— Tm =,  (3.313c) sing 12 721, — (38184) 
V At + Bi A3 + B3 yA? + BP AB + B3 
With the slopes kı and ζω of the intersecting lines holds 
ο — 
tang = Tthky’ (3.313e) 
1+ kik; kg — k 
ο πε. (3.3138) mys Aai (3.313g) 


Jit KÁA EC Jit key + 3 


Here the angle is to be considered into the counterclockwise direction from the first line to the second 
one. 


A B 
For parallel lines (Fig. 3.145a) the equalities μα. T or kı = kg are valid. 
2 2 
For perpendicular (orthogonal) lines (Fig. 3.145b) holds Αι Ay + B, By = 0 or kə = —1/kı . 
yA yA 
P 
> Υο|---[--------- 
Χ 1 
a) b) 0 X x 
Figure 3.146 Figure 3.147 


3.5.2.7 Circle 


1. Definition of the Circle The locus of points at the same given distance from a given point is 
called a circle. The given distance is called the radius and the given point is called the center of the 
circle. 

2. Equation of the Circle in Cartesian Coordinates The equation of the circle in Cartesian 
coordinates when its center is at the origin (Fig. 3.146a) is 


i ry = R. (3.314a 
If the center is at the point C (xo, yo) (Fig. 3.146b), then the equation is 

(a — xo)? + (y — yo)? = ΒΡ. (3.314b 
The general equation of second degree 

ax? + 2bxy + cy? + 2dx + 2ey + f =0 (3.315a 
is the equation of a circle only if b — 0 and a — c. In this case the equation can always be transformed 
into the form 

x+y? - πια + 2ny 4 q — 0. (3.315b 


For the radius and the coordinates of the center of the circle the following equalities hold 
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R= \/m2?+n?-q, (3.316a) t=—-m, yo--n. (3.316b) 


If q > m? + n? holds, the equation defines an imaginary curve, if q = m? + n? the curve has one single 
point P(29, yo). 
3. Parametric Representation of the Circle 

x= xo + Rcost, y= y + Rsint, (3.317) 
where t is the angle between the moving radius and the positive direction of the x-axis (Fig. 3.147). 


I Ρ Yt Psy) 
[2 
Z ' 2 
pA v 
0 ΑΣ Wo 


Figure 3.148 Figure 3.149 Figure 3.150 
4. Equation of the Circle in Polar Coordinates in the general case corresponding to Fig. 3.148: 
P — 2ppo cos (Y — qo) + pd = ΒΡ. (3.3182) 


If the center is on the polar axis and the circle goes through the origin (Fig. 3.149) the equation has 
the form 


p = 2R cos g. (3.318b) 
5. Tangent of a Circle The equation of the tangent of a circle, given by (3.314a) at the point 
P (xo, yo) (Fig. 3.150) has the form 

zzo + yyo = RP. (3.319) 


3.5.2.8 Ellipse 

1. Elements of the Ellipse In Fig. 3.151, AB = 2a is the major aris, CD = 2b is the minor 
axis, A, B, C, D are the vertices, F1, Fy are the foci at a distance c = Va? — b? on both sides from 
the midpoint, e = c/a < 1 is the numerical eccentricity, and p = b?/a is the semifocal chord, i.e., the 
half-length of the chord which is parallel to the minor axis and goes through a focus. 


xy 

directrice 

directrice 
xY 


d >< d 
Figure 3.151 Figure 3.152 


2. Equation of the Ellipse If the coordinate axes and the axes of the ellipse are coincident, the 
equation of the ellipse has the normal form. This equation and the equations in parametric form are 


= +5 =l, (3.320a) 1 -—acost, y=bsint. (3.320b) 


For the equation of the ellipse in polar coordinates see 3.5.2.11, 6., p. 208. 
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3. Definition of the Ellipse, Focal Properties The ellipse is the locus of points for which the sum 
of the distances from two given points, the foci, is a constant, and equal to 2a. These distances, which 
are also called the focal radii of the points of the ellipse, can be expressed as a function of the coordinate 
x from the equalities 


rı = AP =a—ezr, ra= FoP =a+er, rı+r2= 2a. (3.321) 
Also here, and in the following formulas in Cartesian coordinates, it is supposed that the ellipse is given 
in normal form. 


4. Directrices of an Ellipse are lines parallel to the minor axis at distance d = a/e from it 
(Fig. 3.152). Every point P(x,y) of the ellipse satisfies the equalities 
Ti το 
— = =e, 3.322 
A Tie (3.322) 


and this property can also be taken as a definition of the ellipse. 

5. Diameter of the Ellipse The chords passing through the midpoint of the ellipse are called 
diameters of the ellipse. The midpoint of the ellipse is also the midpoint of the diameter (Fig. 3.153). 
The locus of the midpoints of all chords parallel to the same diameter is also a diameter; it is called the 
conjugate diameter of the first one. For k and k’ as slopes of two conjugate diameters the equality 


E 
kk = — (3.323) 
a2 
holds. If 2a; and 2b, are the lengths of two conjugate diameters and a and 8 are the acute angles 
between the diameters and the major axis, where k = — tana and k’ = tan 8 hold, then the Apollonius 
theorem holds in the form 
a,b; sin (a + 8) = ab, a? 44-2? +h’. (3.324) 


Figure 3.153 Figure 3.154 Figure 3.155 


6. Tangent of the Ellipse at the point P(zro, yo) is given by the equation 
TTo , Uyo 
a P 

The normal and tangent lines at a point P of the ellipse (Fig. 3.154) are bisectors of the interior and 

exterior angles of the radii connecting the point P with the foci. The line Ax + By +C = 0 is a tangen 

line of the ellipse if the equation 
Ag + B’b? -. Ο5 --ῃ (3.326 
is satisfied. 

7. Radius of Curvature of the Ellipse (Fig. 3.154) If u denotes the angle between the tangen 

line and the radius vector connecting the point of contact P(xo, yo) with a focus, then the radius o 


-1. (3.325 
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curvature is 


3 


k ος i)" (nr? — p (3.327) 


at | bt ab sin?u ` 

b2 

At the vertices A and B (Fig. 3.151) and at C and D the radii are Πα = Rg = — = p and Rg = 
a 

R= a 

p = hé 

8. Areas of the Ellipse (Fig. 3.155) 

a) Ellipse: 


S —mab. (3.328a 
b) Sector of the Ellipse BOP: c) Segment of the Ellipse PBN: 
" x 
Spor = T arccos Z (3.328b) Sppn = a b arccos PNEU (3.328c 
a 


9. Arc and Perimeter of the Ellipse The arclength between two points A and B of the ellipse 
cannot be calculated in an elementary way as for the parabola, but with an incomplete elliptic integra 
of the second kind E(k, p) (see 8.2.2.2, 2., p. 502). 

The perimeter of the ellipse (see also 8.2.5, 1., 515) can be calculated by a complete elliptic integral o 


the second kind E(e) = E (e. 5) with the numerical eccentricity e = να” — b?/a and with y = 5 (for 


one quadrant of the perimeter), and it is 
π/2 


L = AaE(k, 5) = 4a / V1—k sine with k= e= va? — λα. (3.329a) 
0 


The calculation of L = 4aE(E, 7/2) = 4aE (e) can be performed by the help of the following methods: 
a) Series expansion 


L = 4aE(e) πα] (e e CE ZI (3.329b) 


2 2.4/ 3 2.4.67 5 
A 0M 25 2545 (a — b) 

L= +b) |14 t t t pes ith A= F 3.32 

may b) | 4 ' 64 256 16384 | i (a+b) (3.3290) 

b) Approximate formulas 
64 — 3X4 

Lem |1,5(a-4- b) — vab 3.329d 55 Ἶ — : e 

π [1, (a -ε 6) — Vab] (3.329d) or Lem(acÜ)c 16ο (3.329e) 


c) Using table 21.9, p. 1103 for the complete elliptic integral of the second kind. 
d) Methods of numeric integration to determine the integral in (3.329a). 


W Fora = 1.5, b = 1 one gets the following approximate values for L: according to (3.329e) L = 7.9327, 
according to (3.329e) 7.9327, by the help of table 21.9, p. 1103 L ~ 7.94 (see 8.1.4.3, BE p. 491) and 
by numeric integration the more exact value L ~ 7.932 721 . 


3.5.2.9 Hyperbola 


1. Elements ofthe Hyperbola In Fig. 3.156 AB = 2a is the real axis; A, B are the vertices; 0 the 
midpoint; F and F; are the foci at a distance c > a from the midpoint on the real axis on both sides; 


CD = 2b = 2 c? — a? is the imaginary axis; p = b/a the semifocal chord of the hyperbola, i.e., the 
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half-length of the chord which is perpendicular to the real axis and goes through a focus; e = c/a > 1 
is the numerical eccentricity. 


2. Equation of the Hyperbola The equation of the hy- 
perbola in normal form, i.e., for coincident x and real axes, 


and the equations in parametric form are ee i P(x,y) 
fr 
2 2 

x y 2 

ee (3.330a) 20 x 

x = kacosht, y = bsinht (Coo < t< oo),  (3.330b) 

a π π 
—: y = btant ( 2 ee 3) (3.330c) 

In polar coordinates see 3.5.2.11, 6., p. 208. Figure 3.156 


3. Definition of the Hyperbola, Focal Properties The hyperbola is the locus of points for which 
the difference of the distances from two given points, the foci, is a constant 2a. The points for which 
rı — rz = 2a belong to one branch of the hyperbola (in Fig. 3.156 on the left), the others with ro— rı = 
2a belong to the other branch (in Fig. 3.156 on the right). These distances, also called the focal radii, 
can be calculated from the formulas 


ry =t(ex—a); το (era); rz— r, = £2a, (3.331) 
where the upper sign is valid for the right branch, the lower one for the left branch. Here and in the 


following formulas for hyperbolas in Cartesian coordinates it is supposed that the hyperbola is given 
in normal form. 


y^ d, yA 


di T 


Ν 
P(xyyo) 


xY 


directrice 


à 


Figure 3.157 Figure 3.158 


4. Directrices of the Hyperbola are the lines perpendicular to the real axis at a distance d — a/c 
from the midpoint (Fig. 3.157). Every point of the hyperbola P(x, y) satisfies the equalities 


ry το 
— =. =, 6: 992 
i d e (3.332) 
5. Tangent of the Hyperbola at the point P(xo, yo) is given by the equation 
Tro YYo 
-------ι. i 
= b; (3.333) 


The normal and tangent lines of the hyperbola at the point P (Fig. 3.158) are bisectors of the interior 
and exterior angles between the radii connecting the point P with the foci. The line Ar + By + C = 0 
is a tangent line if the equation 

AM? — BP? — CG? =0 (3.334) 
is satisfied. 
6. Asymptotes of the Hyperbola are the lines (Fig. 3.159) approached infinitely closely by the 
branches of the hyperbola for x — oo. (For the definition of asymptotes see 3.6.1.4, p. 252.) The slopes 
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y 
> > 
0 x x 
Figure 3.159 Figure 3.160 
of the asymptotes are k = + tanó = +b/a. The equations of the asymptotes are 


geo (2) " (3.335 


A tangent is intersected by the asymptotes, and they form a segment of the tangent of the hyperbola, 
i.e., the segment TT, (Fig. 3.159). The midpoint of the segment of the tangent is the point of contac 
P,so TP = Tj P holds. The area of the triangle TOT; between the tangent and the asymptotes for any 
point of contact P is the same, and is 

STOT, = ab. (3.336 
The area of the parallelogram 0F PG, determined by the asymptotes and two lines parallel to the 
asymptotes and passing through the point P, is for any point of contact P 


b 
Sorpa = : (3.337 
7. Conjugate Hyperbolas (Fig. 3.160) have the equations 
2 2 2 2 
wey y α 
Bm 1 and πα” 1 (3.338 


where the second is represented in Fig. 3.160 by the dotted line. They have the same asymptotes, 
hence the real axis of one of them is the imaginary axis of the other one and conversely. 


Figure 3.161 Figure 3.162 


8. Diameters of the Hyperbola (Fig. 3.161) are the chords between the two branches of the 
hyperbola passing through the midpoint, which is their midpoint too. Two diameters with slopes k and 
k' are called conjugate if one of them belongs to a hyperbola and the other one belongs to its conjugate, 
and kk’ = b?/a? holds. The midpoints of the chords parallel to a diameter are on its conjugate diameter 
(Fig. 3.161). From two conjugate diameters the one with |k| < b/a intersects the hyperbola. If the 
lengths of two conjugate diameters are 2a, and 2b,, and the acute angles between the diameters and 
the real axis are a and β < a, then the equalities 


a?— δ} =a°— δ), ab — ab, sin(a— ϱ) (3.339) 
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are valid. 
9. Radius of Curvature of the Hyperbola At the point P(xo, yo) the radius of curvature of the 
hyperbola is 


Ρ ον 3/2 4/9 
ος | i) ^ nm? p (3.3402) 


at bt ab sin?u’ 
where u is the angle between the tangent and the radius vector connecting the point of contact with a 
focus. At the vertices A and B (Fig. 3.156) the radius of curvature is 
b2 
Ra = Re=p=—. (3.340b) 
a 
10. Areas in the Hyperbola (Fig. 3.162) 
a) Segment APN: 


Sapn = xy — abln (S | 3 = xy — ab Arcosh -. (3.341a) 
α a 
b) Area OAPG: 
ab ab, 24 
SoApPG = qt» In P (3.341b) 


The line segment PG is parallel to the lower asymptote, c is the focal distance and d = 0G. 

11. Arcofthe Hyperbola The arclength between two points A and B of the hyperbola cannot be 
calculated in an elementary way like the parabola, but it can be calculated by an incomplete elliptic 
integral of the second kind E(k, p) (see 8.1.4.3,2., p. 502), analogously to the arclength of the ellipse 
(see 3.5.2.8, W p. 201). 

12. Equilateral Hyperbola has axes with the same length a — b, so its equation is 


a? -- η) =a. (3.342a) 
The asymptotes of the equilateral hyperbola are the lines y = +a; they are perpendicular to each 
other. If the asymptotes coincide with the coordinate axes (Fig. 3.163), then the equation is 
2 
a 
ΙΙ (3.342b) 
bé 


xY 


(Xo Yo) 


Figure 3.163 Figure 3.164 Figure 3.165 
3.5.2.10 Parabola 


1. Elements of the Parabola In Fig. 3.164 the x-axis coincides with the axis of the parabola, 
0 is the vertex of the parabola, F is the focus of the parabola which is on the x-axis at a distance p/2 
from the origin, where p is called the semifocal chord of the parabola. The directrix is denoted by NN’; 
it is the line perpendicular to the axis of the parabola and intersects the axis at a distance p/2 from 
the origin on the opposite side as the focus. So the semifocal chord is equal to half of the length of the 
chord which is perpendicular to the axis and passes through the focus. The numerical eccentricity of 
the parabola is equal to 1 (see 3.5.2.11, 4., p. 207). 
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2. Equation of the Parabola Ifthe origin is the vertex of the parabola and the x-axis is the axis o 
the parabola with the vertex on the left-hand side, then the normal form of the equation of the parabola 
is 

y -2pz. (3.343 
For the equation of the parabola in polar coordinates see 3.5.2.11, 6., p. 208. For a parabola with 
vertical axis (Fig. 3.165) the equation is 


y =ar? 4 bz +e. (3.3442. 


The parameter of a parabola given in this form 15 p= Ja] (3.344b 
a 


If a > 0 holds, the parabola is open up, for a < 0 it is open down. The coordinates of the vertex are 


b 4ac — b? 
To = ; z= : 
0 2a Vo m 


3. Properties of the Parabola (Definition of the Parabola) The parabola is the locus of points 
P(x,y) whose distance from a given point, the focus, is equal to its distance from a given line, the 
directrix (Fig. 3.164). Here and in the following formulas in Cartesian coordinates the normal form 
of the equation of the parabola is supposed. Then holds the equation 


(3.344c 


PF=PK=2+8, (3.345) 


where PF is the radius vector whose initial point is at the focus and endpoint is a point of the parabola. 
4. Diameter of the Parabola is a line which is parallel to the axis of the parabola (Fig. 3.166). 
A diameter of the parabola halves the chords which are parallel to the tangent line belonging to the 
endpoint of the diameter (Fig. 3.166). With slope k of the chords the equation of the diameter is 


p à 
pe 34 
i= (3.346) 


y 


Figure 3.166 Figure 3.167 Figure 3.168 


5. Tangent of the Parabola (Fig.3.167) The equation of the tangent of the parabola at the point 
P(xo, yo) is 

YYo = p (T + 10). (3.347 
Tangent and normal lines are bisectors of the angles between the radius starting at the focus and the 
diameter starting at the point of contact. The tangent at the vertex, i.e., the y-axis, halves the segmen 
ofthe tangent line between the point of contact and its intersection point with the axis of the parabola, 
the x-axis: 

TS=SP, TO0—-0M τα TF-FFP. (3.348 
A line with equation y = ka + bis a tangent line of the parabola if 

p —2bk. (3.349 
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6. Radius of Curvature of the Parabola at the point P(xo, yo) with ln as the length of the norma 
PN (Fig. 3.167) is 


27, 2 B 
po Cta) o P. Um (3.350a 
VP simu p? 
and at the vertex 0 it is 
R=p. (3.350b 


7. Areas in the Parabola (Fig. 3.168) 
a) Parabolic Segment PON: 


2 
Sopn = 3 SMQNP (MQNP is a parallelogram). (3.351a. 
b) Area 0PR (Area under the Parabola Curve): 
25: 
Sopn = E : (3.351b 
8. Length of Parabolic Arc from the vertex 0 to the point P(x, y) 
2 2; 
EHI (142) +m (2 ee) (3.352a 


σε (s + 5) + P Arsinh ΤΙ a . (3.352b 
2 2 p 


2 $ cma 
For small values of — the following approximation can be used: 


cus ΠΡ y - 2() (3.352c) 


| 3ky 
3.5.2.11 Quadratic Curves (Curves of Second Order or Conic Sections) 


1. General Equation of Quadratic Curves (Curves of Second Order or Degree) 
The ellipse, its special case, the circle, the hyperbola, the parabola or two lines as a singular conic 
section are defined by the general equation of a quadratic curve (curve of second order) 

aa? +2beytey>+2dx+2ey+f — 0. (3.353a) 
This equation can be reduced to the normal form with the help of the coordinate transformations given 
in Tables 3.19 and 3.20. 
Remark 1: The coefficients in (3.353a) are not the parameters of the special conic sections. 
Remark 2: If two coefficients (a and b or b and c) are equal to zero, the required coordinate transfor- 
mation is reduced to a translation of the coordinate axes. 
The equation c v + 2da + 2ey + f = 0 can be written in the form (y — yo)” = 2p(a — xo); 


S 


the equation az? + 2da + 2ey + f = 0 can be written in the form (£ — £o)? = 2p (y — yo). 
2. Invariants of Quadratic Curves 
are the three quantities 
abd a b 
A-bce, d=], S=ate. (3.353b) 
ἃ 
def 


They do not change during a rotation of the coordinate system, i.e., if after a coordinate transformation 
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Table 3.19 Equation of curves of second order. Central curves (ὃ 4 0)! 


Quantities 6 and A Shape of the curve 
Ellipse a) for A- S < 0: real, 
AZ0Ü 
b) for A - S > 0: imaginary? 
Central curves 5>0 S 
540 A=0 A pair of imaginary? lines with real common point 
AF0 Hyperbola 
à«0 A=0 A pair of intersecting lines 


Normal form of the equation 


Required coordinate transformations after-the transformation 


1. Translation of the origin to the - 
center of the curve, whose coordinates are ar του + 3 0 
be — cd _ bd — ae 

δ » Yo 5 . 
2. Rotation of the coordinate axes by 


2b «c A (a— ο 2 
the angle a with tan 2a = , act y(a- c) 4b 


: d = 
α--ο 9 
The sign of sin 2a must coincide with 
the sign of 2b. Here the slope 


To = 


a4 c— y (a — ο)” + 402 


of the new z'-axis is c= 9 
TA c—a+y(c— a)” + 4b? (α΄ and ὅ are the roots of the quadratic 
2b i equation αἲ-- Su +ô = 0). 


1 A, δ and S are numbers given in (3.353b). 

? The equation of the curve corresponds to an imaginary curve. 

the equation of the curve has the form 

ala? + Wry + ch? + 2d'a! 4 2e'y + f! — 0, (3.353c) 
then the calculation of these three quantities A, ó, and S with the new constants will yield the same 
values. 

3. Shapeofthe Quadratic Curves (Conic Sections) 

If a right double circular cone is intersected by a plane, the result is a conic section. If the plane does 
not pass through the vertex of the cone, the result is a hyperbola, a parabola, or an ellipse depending 
on whether the plane is parallel to two, one, or none of the generators of the cone. If the plane goes 
through the vertex, the result is a singular conic section with A — 0. As a conic section of a cylinder, 
i.e., a singular cone whose vertex is at infinity, yield parallel lines. The shape of a conic section can be 
determined with the help of Tables 3.19 and 3.20. 


4. General Properties of Curves of Second Degree 

The locus of every point P (Fig. 3.169) with constant ratio e of the distance to 
a fixed point F, the focus, and the distance from a given line, the directrix, is a 
curve of second order with numerical eccentricity e. For e < 1 it is an ellipse, for 
e = Lit isa parabola, for e > 1 it is a hyperbola. 


directrix~ 


Figure 3.169 
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Table 3.20 Equations of curves of second order. Parabolic curves (ὃ = 0) 


Quantity ὃ and A Shape of the curve 
Parabolic curves!, 5 =0 cx glad 
Two lines: Parallel lines for d? — af 50, 
A=0 Double line for d? — af =0, 
Imaginary? lines ford? — af « 0. 


Required coordinate transformation 


Normal form of the equation 
after the transformation 


A#0 


1. Translation of the origin to the vertex of the 
parabola whose coordinates xo and yo are defined 


d 4- be 
mr =0 and 


by the equations axo + byo + 


dc — be > — bd 
(0+ 5) a (e “) w+ f=. 


2. Rotation of the coordinate axes by the angle a 


] α . . ] 
with tana = Fi the sign of sina must differ 


from the sign of a. 


A=0 

Rotation of the coordinate axes by the angle a 
" a : $ : 

with tana = Ty! the sign of sina must differ 


from the sign of a. 


ad+be , +f=0 bét 
—— — 0 can be trans- 
Vere * 
formed into the form 
(y' — yo) ( — wi) = 0. 


Sy? +2 


n the case of ὃ -- 0 it is supposed that none of the coefficients a, b, c is equal to zero. 
The equation of the curve corresponds to an imaginary curve. 


Determination of a Curve Through Five Points 
yere is one and only one curve of second degree passing through five given points of a plane. If three 


of these points are on the same line, then it is a singular or degenerate conic section. 


6. Polar Equation of Curves of Second Degree 
All curves of second degree can be described by the polar equation 


where p is the semifocal chord and e is the eccentricity. Here the pole is at the focus, while the polar 
axis is directed from the focus to the closer vertex. For the hyperbola this equation defines only one 


p 


ΡΞ 1+ ecos’ 


branch. 


(3.354) 
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3.5.3 Analytical Geometry of Space 
3.5.3.1 Basic Concepts 


1. Coordinates and Coordinate Systems 

Every point P in space can be determined by a coordinate system. The directions of the coordinate lines 
are given by the directions of the unit vectors. In Fig. 3.170a the relations of a Cartesian coordinate 
system are represented. There are to distinguish right-angled and oblique coordinate systems where 
the unit vectors are perpendicular or oblique to each other. Another important difference is whether it 
is a right-handed or a left-handed coordinate system. 

The most common spatial coordinate systems are the Cartesian coordinate system, the spherical polar 
coordinate system, and the cylindrical polar coordinate system. 

2. Right- and Left-Handed Systems 

Depending on the successive order of the positive coordinate directions there are to distinguish right 
systems and left systems or right-handed and left-handed coordinate systems. A right system has for 
instance three non-coplanar unit vectors with indices in alphabetical order &;, €j, €;. These vectors 
form a right-handed system when an observer, looking on the 6η, 6; plane and at the same time into 
the direction of ἄμ, can rotate e; into 6; along the smallest angle, i.e., clockwise; looking from the cusp 
of the vector Εμ into the direction of the 6;, €; plane he can rotate 6; into €; counterclockwise. A left 
system consequently requires the opposite rotation in both cases. The alphabetical order of rotation is 
represented symbolically in Fig. 3.34, p. 143 where the notations a, b, c are substituted for the indices 
i,j,k. 


Figure 3.170 


Right- and left-handed systems can be transformed into each other by interchanging two unit vectors. 
The interchange of two unit vectors changes its orientation: A right system becomes a left system, and 
conversely, a left system becomes a right system. 

A very important way to interchange vectors is the cyclic permutation, where the orientation remains 
unchanged. As in Fig. 3.34 the interchange the vectors of a right system by cyclic permutation yields 
a rotation in a counterclockwise direction, i.e., according to the scheme (i > j > k > i,j > k > 
i > j,k => i } > k). Ina left system the interchange of the vectors by cyclic permutation follows a 
clockwise rotation, i.e., according to the scheme (i + k > j ük—j—i—kjoi-ko J). 


A right system is not superposable on a left system. 

The reflection of a right system with respect to the origin is a left system (see 4.3.5.1, p. 288). 

W A: The Cartesian coordinate system with coordinate axes x, y, 2 is a right system (Fig. 3.170a). 
W B: The Cartesian coordinate system with coordinate axes x, z, y is a left system (Fig. 3.170b). 
WB C: From the right system 6;, €, €j one gets the left system €;, €x, €; by interchanging the vectors 
8; and ὄχ. 

W D: By cyclic permutation follows from the right system &;, 6], €; the right system 6]. δε, €; and 
from this one €;, 6, €j, a right system again. 
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Table 3.21 Coordinate signs in the octants 


Octant I I Ul IV V VT VII VIII 
T t | t 
y t 

2 


3. Cartesian Coordinates 
of a point P are its distances from three mutually orthog- 
onal planes in a certain measuring unit, with given signs. 
They represent the projections of the radius vector r of 
the point P (see 3.5.1.1, 6., p. 182) onto three mutually 
perpendicular coordinate axes (Fig. 3.170). The inter- 
section point of the planes O, which is the intersection 
point of the axes too, is called the origin. The coordinates 
x, y, and z are called the abscissa, ordinate, and appli- 
cate. The written form P(a, b, c) means that the point P 
has coordinates x = a, y = b, z = c. The signs of the co- 
ordinates are determined by the octant where the point 
P lies (Fig. 3.171, Table 3.21). 

In a right-handed Cartesian coordinate system (Fig. 
3.170a) for orthogonal unit vectors given in the order 
Figure 3.171 €;, 6j, €x the equalities 


ειχξι-δι., eéjxe,—e;, €, XE, =e), (3.355a) 
hold, i.e., the right-hand law is valid (see 3.5.1.5, p. 184). The three formulas transform into each other 
under cyclic permutations of the unit vectors. 
In a left-handed Cartesian coordinate system (Fig. 3.170b) the equations 

Ειχᾶ;--ᾱ., €;x€,=-€;, & x 6; =ë; (3.355b) 
are valid. The negative sign of the vector product arises from the left-handed order of the unit vectors, 
sce Fig. 3.170b, 1.ο., from their clockwise arrangement. 
Notice that in both cases the equations 


6 x6, = ëj xd;—é,x&-Ü (3.355c) 
are valid. Usually one works with right-handed coordinate systems; the formulas do not depend on this 
choice. In geodesy usually left-handed coordinate systems are in use (see 3.2.2.1, p. 144). 

4. Coordinate Surfaces and Coordinate Curves 

Coordinate Surfaces have one constant coordinate. In a Cartesian coordinate system they are planes 
parallel to the plane of other two coordinate axes. By the three coordinate surfaces x = 0, y = 0, 
and z — 0 three-dimensional space is divided into eight octants (Fig. 3.171). Coordinate lines or 
coordinate curves are curves with one changing coordinate while the others are constants. In Cartesian 
systems they are lines parallel to the coordinate axes. The coordinate surfaces intersect each other in 
a coordinate line. 


3.5.3.2 Spatial Coordinate Systems 


1. Curvilinear Three-Dimensional Coordinate Systems 

arise if three families of surfaces are given such that for any point of the space there is exactly one 
surface from every system passing through it. The position of a point will be given by the parameter 
values of the surfaces passing through it. The most often used curvilinear coordinate systems are the 
cylindrical polar and the spherical polar coordinate systems. 
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2. Cylindrical Polar Coordinates (Fig. 3.172) 

consist of: 

e the polar coordinates p and q of the projection of the point P to the x, y plane and 
e the applicate z of the point P. 

The coordinate surfaces of a cylindrical polar coordinate system are: 

e The cylinder surfaces with radius o = const, and the z-axis as axis of the cylinder, 
e the half-planes starting from the z-axis, p = const and 

e the planes being perpendicular to the z-axis, z — const. 

'The intersection curves of these coordinate surfaces are the coordinate curves. 


The transformation formulas between the Cartesian coordinate system and the cylindrical polar coor- 
dinate system are (see also Table 3.22): α 


x = ocos, y=esiny, 2; (3.356a) 
: : y LU 
e m 2 = ateti arene for cm ] 3561 
o z? +y y = arctan 27 arcsin T or r0 (3.356b) 


For the required distinction of cases with respect to q see (3.290c), p. 192. 


Figure 3.172 Figure 3.173 
3. Spherical Coordinates or Spherical Polar Coordinates (3.173) 


contain: 

e The length r of the radius vector r of the point P, 

e the angle Ὁ between the z-axis and the radius vector r and 

e the angle y between the z-axis and the projection of T on the x, y plane. 

The positive directions (Fig. 3.173) here are for r from the origin to the point P, for Ὁ from the z-axis 
to T, and for o from the z-axis to the projection of r to the x, y plane. With the values 0 < r < 00,0 < 
Ü <a, and —7 < p € m every point of space can be described. 

Coordinate surfaces are: 

e Spheres with the origin 0 as center and with radius r = const, 

e circular cones with J = const, with vertex at the origin, and the z-axis as the axis and 

e closed half-planes starting at the z-axis with y = const. 

The intersection curves of these surfaces are the coordinate curves. 

4. Relations between Cartesian, Cylindrical and Spherical Polar Coordinates 

W The transformation formulas between Cartesian coordinates and spherical polar coordinates (see 
also Table 3.22) are: 


r-—rsinÜcosp, y=rsinvsinyg, z=rcosv, (3.357a) 
[5d a 
τς γα +y? z?, 0 — arctan SES , € = arctan E (3.357b) 
£z x 


For the required distinction of cases with respect to q see (3.290c), p. 192. 


212 3. Geometry 


Table 3.22 Relations between Cartesian, cylindrical, and spherical polar coordinates 


Cartesian coordinates | Cylindrical polar coordinates | Spherical polar coordinates 


t= = ocos p = r sin Ù cos y 
y= = osin = rsin 9 sin p 
a =z =rcosv 

va? ra? Ξρ =rsinv 
arctan = = 

=: =g =rcosv 


TESTET Ξ-γοὶ 22 =r 


arctan ————— = arctan — =v 
4 2 Ζ 
arctan z = =) 


5. Direction in Space 
A direction in the space can be determined by a unit vector t? (see 3.5.1.1, 6., p. 181) whose coordinates 
are the direction cosines, i.e., the cosines of the angles ag, Bo, Yo between the vector and the positive 
coordinate axes (Fig. 3.174) 


E : 2 , 

l=cosay, m-—cosfp n=cosy, Pm +n =l. (3.358a) 
The angle y between two directions given by their direction cosines /4, m4, n; and lo, m», n3 can be 
calculated by the formula 


coso = [119 + Mmm + nana. (3.358b) 
Two directions are perpendicular to each other if 
lila + Mmm + nına = 0. (3.358c) 


Figure 3.174 Figure 3.175 
3.5.3.3 Transformation of Orthogonal Coordinates 
1. Parallel Translation 


If the original coordinates are x, y, z, and a, b, c denote the coordinates of the origin of the new coor- 
dinate system in the old system (Fig. 3.175), then the new coordinates 2’, y, z’ satisfy the relations 
πξα-α, y=y +b, z-—z-c x'—v-a  y-y-b z-z-c. (3.359) 


2. Rotation of the Coordinate System 
The relation between the original coordinates x, y, z and the new coordinates 2’, y’, 2’ after rotation is 
given by 


2 x 
x y D|vy Dx. (3.3602) 
z z 
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D is called the rotation matrix of the coordinate system. The special rotation matrix 


1 0 0 
Ῥ.(α) -- | 0 cosa sina (3.360b) 
0 —sina cosa 
describes rotations of the x, y, z system around the x-axis by the angel a. In analogy rotations of the 
x,y, 2 system around the y-axis by the angel 6 or around the z-axis by the angel y describe the following 
rotation matrices: 
cos B 0 — sin 8 cosy sin 0 


D,(8) = ( 0 1 0 | ; (3.360c) D.(y) = (^m cos y o) s (3.360d) 
sing 0 cos 0 0 1 


Remark:The rotation ofthe x, y, z system around an arbitrary axis through the origin can be described 
with the help of direction cosines (see 3.5.3.4), Cardan angels (3.5.3.5) or Euler angels (3.5.3.6). 

3. Rotation of an Object 

In geometry two types of transformations are distinguished (see also 3.5.4, p. 229): 

a) coordinate transformations (the coordinate system is transformed), and 

b) geometric transformations (the position of a geometric object is changed in a fixed coordinate sys- 
tem). 

Consequently, at the rotation around an arbitrary axis through the origin the following rotations are 
distinguished 

a) rotation of the coordinate system (see (3.360a-d)) and 

b) rotation of an object in a fixed coordinate system. In this case: 


x'p— Rxp (3.361a) R-D!-DT. (3.361b) 


Here the formulas (3.361a,b) describe the relation between the coordinates xp, yp, zp of the initial po- 
sition of the object and its coordinates xp, Yp, Zp after the rotation. R is called the rotation matrix of 
the object. 

Remarks: 

1. Rotations around an arbitrary axis through the origin will have a suitable description with quater- 
nions (see 4.4.2.5, p. 297). 

2. Rotations around an arbitrary axis not running through the origin, will be discussed in the example 
on p. 235. 


3.5.3.4 Rotations with Direction Cosines 


1. Rotation of the Coordinate Axis 
If the direction cosines of the new axes are given as in Table 3.23, see also (Fig. 3.176), then for the 
new and old coordinates 


2' =hetmy+mz, 2 = hr ly +132’, 
y. = lox + may + noz, (3.362a) y = ma’ + may + maz , (3.362b) 
2’ = lar + may + naz ; z τινα’ + nay + ng?’ 


holds. The coefficient matrix of the system (3.362a), which is called the rotation matriz D, and the 
determinant of the transformation A are 


ho ob d l l ls 
D={m m mj, (3.362c) det D = A = mi m» πια]. (3.362d) 
n n n ny n3 ma 
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The following relations are valid: 


(x) =D (") or (") =D" (x) ; (3.362e) 


Table 3.23 Notation for the direction cosines under coordinate transformation 
With respect to the Direction cosine 
the old axes of the new axes 
/ 
r y A 
r lh ly ls 
y ma mg ma 
z nı na ng 


2. Properties of the Transformation Determinant 

a) A = +1, with a positive sign if it remains left- or right-handed, as it was, and with negative sign if 
it changes its orientation. 

b) The sum of the squares of the elements of a row or column is always equal to one. 

c) The sum of the products of the corresponding elements of two different rows or columns is equal to 
zero (see 4.1.4, 9., p. 275). 
d) Every element can be written as the product of A = +1 and its adjoint (see 4.2.1, p. 278). 

3. Scalar Invariant 

This is a scalar which keeps its value during translation and rotation. The scalar product of two vectors 
is a scalar invariant (see 3.5.1.5, 3., p. 185). 


— 


W A: The components of a vector à = {a1, a2, a3} are not scalar invariants, because they change their 
values during translation and rotation. 


E B: The length of a vector & = (a1, a2, ag}, i.e., the quantity 4/a? + αὖ + αὖ, is a scalar invariant. 


W C: The scalar product of a vector with itself is a scalar invariant: 
aa = a? = |a|? cos y = |8?|, because ᾧ = 0. 


3.5.3.5 Cardan Angles 
1. Definition of the Cardan angles 


Every rotation of a coordinate system around an arbitrary axis through the origin can be described by 
three successive rotations around the coordinate axis. The angles of rotation a, 3, ^; are called the 
CARDAN-angles, if the rotations are performed in the following or- 
der (see schematically Fig.3.176): 
1. The first rotation is around the z-axis by an angle y, and results 
in the x, y, z(9 coordinate system with 2/1) = z. 
2. The second rotation is made around the image of the y-axis by 
the first rotation, i.e. around the y® axis by an angle β, and results 
the α0), y®), z@) coordinate system with y) = y®. 
3. The third rotation is made around the image of the a-axis by the 
second rotation, i.e. around the z-axis by an angle a. It results 
the finally required 2’, y', z' coordinate system, x! = 2) = ας), 
y = y) 2! = 28). 


Remark: There are different definitions of the Cardan-angles given 
Figure 3.176 in the literature. 
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2. Calculation of the rotation matrix Dc 
It follows from the given order of the rotations (3.360a-d) that the rotation matrix D --Ώς is 


Dc = D,(a)D,(8 


)D.(7). 


(3.363a) 


According to the Falk’s schema (Fig. 4.1, 4.2, p. 273) holds 


cosy siny 0 
—siny cosy 0 


(3.363b) 


0 0 1 
cos B 0 — sin B. | cos B cos y cos B sin y — sin B 
0 1 0 — sin COS y 0 
sing 0 cosB |cosysinf sin siny cos 
1 0 0 cos B cos y cos β sin y — sin B 
0 cosa sina|—cosasiny+sinacosysinf cosacosy+sinasinfsiny sinacos 8 


0 —sina cosa 


sin a sin y + cosa cos y sin B 


cos f) cos ^ 


cosa sin 8 sin y — sina cosy cosa cos f 


cos B sin y — sin B 


Dc = ( —cosasiny + sin a cos y sin 8 


cosacosy+sinasinfsiny sinacos§ |. (3.363c) 
sin o, sin y + cos a cos y sin 8 


cosa sin f sin y — sina cosy cosa cos f 


3. Direction Cosines as Function of the Cardan angles 
Since the rotation matrices D (in (3.362c), p. 213) and Dc (in (3.363c)) coincide, the direction cosines 
can be expressed as functions of the Cardan-angles: 


li = C263, m4 = C253, nı = 82; 
lə = —6483 + 51C352, Mz = C1C3 + S152583, ΤΠ ο (3.364a) 
l3 = 8183 + 616180, Mg = €18383 — 81C3, N3 = C1C3 
: €; — COSQ, C9-— COSp; ὁξ cos ; 
with απο ὑπ να (3.364b) 
s;=sina, so-—sinf, s3=siny. 


3.5.3.6 Euler's angles 
1. Definition of the Euler angles 


The position of the new coordinate system with respect to the old one can be uniquely determined by 
three angles, introduced by Euler (Fig. 3.177). 


a) The nutation angle 2 is the angle between the positive halves 
of the z- and z'-axis there is 0 € J < 7. 

b) The precession angle w is the angle between the positive di- 
rection of the x-axis and the intersection line K of the x, y plane and 
the z^, y' plane. The positive direction of K is chosen depending on 
whether the z-axis, the z'-axis and K form a direction triplet with 
the same orientation as the coordinate axes (see 3.5.1.3, 2., p. 183). 
The angle v» is measured from the x-axis to the direction of the y- 
axis; there is 0 € Y < r. 

c) The rotation angle ¢ is the angle between the positive halves 
of the z'-axis and the intersection line K; there is 0 €  « 27. 


Remark: In the literature there are also other definitions in use for 
the Euler angles. 


Figure 3.177 
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2. Calculation of the rotation matrix Dg 
The transition from coordinate system x, y, z into the coordinate system z^, y’, 2’ (Abb.3.177) can be 
given by three rotations considering (3.360a-d)) as follows: 


1. The first rotation is around the z-axis by angle v, and results the coordinate system a, (0, 2 


MESS 
where 2“ = z: 


ασ cosy siny 0 x 
yO | = [- sin tj cos ij o) (") ie κ) = D.(i)x. (3.3652) 
z0) 0 0 1 2 


The axis « coincides with the intersection line K. 


2. The second rotation is around the x-axis by angle J, and results the coordinate system α!2), y, 2), 
where 2@ = a; 


α0) 1 0 0 σα) 
y2 | = (ο cos 0 sn yO | , ie. x 2 D,(9) x9. (3.365b) 
0 — sin V cos Ü 


3. The third rotation is around the z-axis by angle y, and results the final coordinate system 2’, y’, 2’, 
where 2’ = 2): 


a! cosy siny 0 x) 
y | = | —siny cosy 0 y? |, ie x =D.(y)x®. (3.365c 
z 0 0 1 29 
All together for D = Dg in (3.360a) holds: 
Dg = ὉὈ.(ϱ)Ὀι(ὐ)Ὀ.(Φ). (3.366a 
Analogous to the calculation of the rotation matrix Dc using Falk’s schema, here in the form 
D.(v) 
Ῥ.(9) D,(9)D.(v) one gets (3.366b 
D.(o»)|D:(2)D.(9)D.(V) = De 
( COS y cos Y — sin y cos Ü sin i cosypsinw + sin y cos cosy singsind 
Deg = 


— sin y cos i — cos p cos Ùsiny — sin ysin + cos pcos cosy cosysind | (3.366c 
sin 1} sin tj — sin Ü cos cos Ü 


3. Direction Cosines as Function of the Euler angles 
Because of the identity of the rotation matrices D (see (3.362c), p. 213) and Dz (see (3.366c) the 
following formulas for the direction cosines as functions of the Euler angles are valid: 


l= T = 8061 + C1€383 , ny = 8483; 
Ig = —C983 — C1 89C3, Ma = —8383-- C1C9C3, Πο = S163 ; (3.367a) 
l3 = $185, m3 = —S1C2, ng = οι 
ap Cp c08Ü, co=cosw, C3 = COSY 
with ο (€ Y, a P, (3.367b) 


$,=sinv, sg=sinwv, s3=sing. 
3.5.3.7 Special Quantities in Space 
1. Coordinates of the Center of Mass 
The coordinates of the center of mass P(Z,¥,2Z) (often called incorrectly the center of gravity) of a 


system of n material points P;(x;, yi, zi) with mass m; are calculated by the following formulas, where 
the sum index i changes from 1 to n: 


D mizi j L miyi z D mizi 
Em ? Emp” Σ πι; ᾿ 


(3.368) 
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2. Division of a Segment 
The coordinates of the point P(x, y, z) dividing a segment between the points P (x1, y1, σι) and P» (x2, 
Y2, 22) in a given ratio 


P,P m 
= =—_ =) .369a 
EB. n (3.369a 
are given by the formulas 
nz; muxo 21 -λ21 (3.369b) ny +My? yı +Ay2 , (3.369 
n+m 1+A n+m 1+A 
nzi tmz 2+Az 
= = .369d 
n+m 1+A τ) 
The midpoint of the segment is given by 
v1 X2 Yi + να zı + 2ο : 
im = > m = E Em = . 3.369e 
E 2 " NES 2 (3:3039 
Ζ Ῥο(Χο 2/22) 3. Distance Between Two Points 


The distance between the points P, (21,41, 21) and P»(xs,ys.23) in 
δ Fig. 3.178 is 


P(x,y,z) d= ψίαν zi) + (yo y)? H (22 a)’. (3.370a) 
P468 y4 2) The a cosines of the segment between the points can be calculated by 
y the formulas 

cosa = 22 F a , (ορ Z = B , cosy = mcm (3.370b) 


Figure 3.178 d d 


4. System of Four Points 

Four points P(x, y, z), Pi (τι. y1, 21) , Pe (£2, yo, 22) and P; (£3, Y3, 23) can form 
a tetrahedron (Fig. 3.179) or they arein a plane. The volume of a tetrahedron 
can be calculated by the formula 


eu gi -—— 
1 oer 1 mE! —CSg1^4 αἱ 
Ves s ΠΕ T2 Yo 2 — 22], (3.371) 
y 6 X2 Y2 Z2 1 TS a aede us doch 
* Figure 3.179 T3 ys 23 1 


— — 
where it has a positive value V > 0 if the orientation of the three vectors PP,, PP: , PP} is the same 
as the coordinate axes (see 3.5.1.3, 2., p. 183). Otherwise it is negative. 

The four points are in the same plane if and only if 


=0 holds. (3.372) 


3.5.3.8 Equation of a Surface 
Often an equation 

F(x,y,z) 20 (3.373) 
corresponds to a surface with the property that the coordinates of every of its points P satisfy this 
equation. Conversely, every point whose coordinates satisfy the equation is a point of this surface. The 
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equation (3.373) is called the equation of this surface. If there is no real point in the space satisfying 
equation (3.373), then there is no real surface. 

1. The Equation of a Cylindrical Surface (see 3.3.4, p. 156) whose generating lines are parallel 
to the x-axis contains no x coordinate: F(y, 2) = 0. Similarly, the equations of the cylindrical surfaces 
with generating lines parallel to the y- or to the z-axes contain no y or z coordinates: F(x,z) = 0 or 
F(x,y) = 0 resp. The equation F(x,y) = 0 describes the intersection curve between the cylinder and 
the x,y plane. If the direction cosines, or the proportional quantities l, m, n of the generating line of a 
cylinder are given, then the equation has the form 


F (nx — lz, ny — mz) = 0. (3.374) 
2. The Equation of a Rotationally Symmetric Surface, i.e., a surface 


which is created by the rotation of a curve z = f(x) given in the x, z plane 
around the z-axis (Fig. 3.180), will have the form 


= (veti) , (3.375) 


The equations of rotationally symmetric surfaces can be obtained similarly 
in the case of other variables as well. 


W The equation of a conical surface, whose vertex is at the origin (see 3.3.4, 
p. 157), has the form F (x,y,z) = 0, where F is a homogeneous function of 
the coordinates (see 2.18.2.5, 4., p. 122), e.g. F(x,y, z) = z— να y? —0 Figure 3.180 
with the degree 1 of homogeneity, i.e. F(Av, Ay, Az) = AF (x, y, 2). 
3.5.3.9 Equation of a Space Curve 
A space curve can be defined by three parametric equations 

r=ilt), y= yrlt), z= s(t). (3.376) 
To every value of the parameter t, which does not necessarily have a geometrical meaning, there corre- 


sponds a point of the curve. 
Another method to define a space curve is the determination by two equations 


Fi(a,y,z)=0, Fo(x,y,z) =0. (3.377) 
Both define a surface. The space curve contains all points whose coordinates satisfy both equations, 
i.e., the space curve is the intersection curve of the given surfaces. In general, every equation in the 
form 

ΕΙ +AF, =0 (3.378) 
for arbitrary A defines a surface which goes through the considered curve, so it can substitute any of 
the equations (3.377). 
3.5.3.10 Line and Plane in Space 


1. Equations of the Plane 

Every equation linear in the coordinates defines a plane, and conversely every plane has an equation of 
first degree. 

1. General Equation of the Plane 

a) with coordinates: Ax + By4- Cz -4 D — 0, (3.3792) 


b) in vector form: FN + D=0," (3.379b) 


*For the scalar product of two vectors see 3.5.1.5, p. 184 and in affine coordinates see 3.5.1.6, 5., p. 186; for the 
vector equation of the plane see 3.5.1.7, p. 189. 
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where the vector N(A, B,C) is perpendicular to the plane. In (Fig. 3.181) the intercepts a, b, and c 


are shown. The vector N is called the normal vector of the plane. Its direction cosines are 


cos A cos B 2 co £ (3.379c) 
OS Q = 3 OS D = 3 OS "y = : s 
JAP + ΒΕ ΟἿ OS (ite 7 JPR eo 


If D = 0 holds, the plane goes through the origin; for A = 0, or B = 0, or C = 0 the plane is parallel 
to the x-axis, the y-axis, or the z-axis, respectively. If A= B = 0, or A= C = 0, or B = C = 0, then 
the plane is parallel to the x, y plane, the x, z plane or the y, z plane, respectively. 

2. Hessian Normal Form of the Equation of the Plane 

a) with coordinates: x cosa + y cos 6 + zcosy — p = 0, (3.380a) 


b) in vector form: ΤΝ. p=0, (3.380b) 


where N° is the unit normal vector of the plane and p is the distance of the plane from the origin. The 
Hessian normal form arises from the general equation (3.379a) by multiplying by the normalizing factor 
1 1 

EM N ΝΑΡ + B2 + C? 
Here the sign of jj must be chosen opposite to that of D. 


3. Intercept Form of the Equation of the Plane With the segments a, b, c, considering them 
with signs depending on where the plane intersects the coordinate axes (Fig. 3.181) holds: 


x Yy 2 — 
ipee (3.381) 


4. Equation of the Plane Through Three Points If the points are P (21, y1, 21) , P» (2, Y2, 22) , 
P3 (23, Y3, 23) , then it holds: 


with N = |Ñ]. (3.380c) 


t-i Yy- z—2z 
a) with coordinates: |» — 2) Y2 — Yı 
T3 — Tı να — Yı 
b) in vector form: (E-r) (F — το) F— Τη) — 0f. (3.382b 
5. Equation of a Plane Through Two Points and Parallel to a Line 

The equation of the plane passing through the two points P, (σι, y1, 21), P» (2, ye, 20) and being paralle 


=0, (3.382a 


to the line with direction vector (l, m, n) is the following 


x—x1y—yW 2—2 
a) with coordinates: § | %2— ri yo— Yı 22 — 41 | = 0, (3.383a 
l m n 
b) in vector form: (£ τι) (f — τυ) R.— 0f. (3.383b 


6. Equation of a Plane Through a Point and Parallel to Two Lines 
If the direction vectors of the lines are Ri (hi, mi, nı) and Ro (lo, m», n2), then it holds: 


T— Ti Y— Yı Z- 21 
a) with coordinates: [η mi nı |=0, (3.384a) 
ly ma n2 
b) in vector form: (5-- τι) R, Rs = 0f. (3.384b) 


7. Equation of a Plane Through a Point and Perpendicular to a Line 
If the point is P, (x1, y1, 21), and the direction vector of the line is N(A, B,C), then it holds: 
a) with coordinates: A(x -— x1) + B(y— yı) + C (z — z1) = 0, (3.385a) 


For the mixed product of three vectors see 3.5.1.6, 2., p. 185 
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b) in vector form: ΓΝ -0. (3.385b) 


8. Distance of a Point from a Plane Substituting the coordinates of the point P(a, b,c) in the 
Hessian normal form of the equation of the plane (3.380a) 


xcosa + y cos B+ z cosy — p = 0, (3.3862) 
results in the distance with sign 

6 = acosa + bcos B + ccos ó — p, (3.386b) 
where ô > 0, if P and the origin are on different sides of the plane; in the opposite case ô < 0 holds. 


9. Equation of a Plane Through the Intersection Line of Two Planes The equation of a plane 
which goes through the intersection line of the planes given by the equations Ax + Biy 4- Cz + Dj — 0 
and Aoz + Boy + Coz + Dz = Q is 


a) with coordinates: Ax + Biy + Ciz + Dı + À (Ax + Boy + Coz + D3) = 0. (3.387a) 


b) in vector form: FÑ +D 4 ACN, + Dz) = 0. (3.387b) 


Here λ is a real parameter, so (3.387a) and (3.387b) define a pencil of planes. Fig. 3.182 shows the 
case with three planes. If A takes all the values between —oo and +00 in (3.387a) and (3.387b), so this 
yields all the planes from the pencil. For \ = +1 it results in the equations of the planes bisecting the 
angle between the given planes if their equations are in normal form. 


2. Two and More Planes in Space 
1. Angle between Two Planes, General Case: The angle y between two planes given by the 
equations Aya + Biy + Ciz + Τι = 0 and Agr + Boy + Coz + Dz = 0 can be calculated by the formula 

A143 + By Bo + Ci C2 


COS (p : : : : = (3.388a) 
V/A + B? + CT) (43 + B3 + C3) 
If the planes are given by vector equations r Νι + D,=Oandr No + Də = 0, then 
ΝΙΝ, 3 - 
cosy = — —? with M =|Nj| and M: = [Ns]. (3.388b) 
ΝΙΝΟ 
z^ 
0 
K , 
x 
Figure 3.181 Figure 3.182 Figure 3.183 


2. Intersection Point of Three Planes: The coordinates of the intersection point of three planes 
given by the three equations Aix + Biy + Ciz + Di = 0, Agr + Boy + Coz + Dy = 0, and A3x+ Bay + 
C3z + Ds = 0, are calculated by the formulas 

SA zN -Δ 

ᾱ- K J= Ko z- a with (3.3892) 


*For the scalar product of two vectors see 3.5.1.5, p. 184 and in affine coordinates see 3.5.1.6, p. 186; for the equation 
of the plane in vector form see 3.5.1.6, p. 189. 
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Αι B Ci D, BC, Αι Ρι Οι A, Bı Dı 
A =| Ap Be C|, Ag =|D2 Bz Co|, Ay =|A2 Dz C2|, Az =| Az Bə Τη]. (83898) 
A3 Bs Cs Ds Bs C3 A3 Ds Cs As Bs Ds 


Three planes intersect each other at one point if A Z 0 holds. If A = 0 holds and at least one subde- 
terminant of second order is non-zero, then the planes are parallel to a line; if every subdeterminant of 
second order is zero, then the planes have a common line. 

3. Conditions for Parallelism and Orthogonality of Planes: 

a) Conditions for Parallelism: Two planes are parallel if 


A B C. > 2 2 

T = B - a or | NQxNa5-0 holds. (3.390) 
b) Conditions for Orthogonality: Two planes are perpendicular to each other if 

A,A + BıB2+C1C2=0 or  ΝΙΝΟ-0 holds. (3.391) 


4. Intersection Point of Four Planes: Four planes given by the equations Ajw+Byy+C)z+D, = 
0, Az + By + C2z + D2 —0, Απ Bay + Caz + D = 0, and A4z + Bay + Caz + Dy = 0 havea 
common point only if for the determinant 

Αι Bı Οι Dı 
EU By C5 Ds 
As Bs C3 Ds 
Ay Bı Ca Dı 
holds. In this case the common point is determined from three equations, the fourth equation is super- 
fluous; it is a consequence of the others. 

5. Distance Between Two Parallel Planes: Iftwo planes are parallel, and they are given by the 
equations 


= =0 (3.392) 


Az+ By+Cz+D,=0 and =Ax+ By+Cz+D2=0, (3.393) 
then their distance is 
__ μοι 9.) ; 
°° TPO (aua 


3.5.3.11 Lines in Space 


1. Equations of a Line 

1. Equation of a Line in Space, General Case Because a line in space can be defined as the 
intersection of two planes, it can be represented by a system of two linear equations. 

a) In component form: 


Az + Byyt+Ciz+D,=0, Αρα + Boy + Coz + Do = 0. (3.395a) 
b) In vector form: 

FN, +D =0, ΤΝ; +D: --0. (3.395b) 
2. Equation of a Line in Two Projecting Planes 
The two equations y= kr--a, z=hxt+b (3.396) 


define a plane each, and these planes go through the line and are perpendicular to the x, y and the z, z 
planes resp. (Fig. 3.183). They are called projecting planes. This representation cannot be used for 
lines parallel to the y, z plane, so in this case other projections to other coordinate planes are considered. 
3. Equation of a Line Through a Point Parallel to a Direction Vector 

The equation (or the system of equations) of a line passing through a point P, (σι, y1, σι) parallel to a 
direction vector R(I, m, n) (Fig. 3.184) has the form 

a) in component representation and in vector form: 
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Figure 3.184 Figure 3.185 


πιο UM ee (3.397a) (f-7,)x R=0, (3.397b) 


l m n 


b) in parametric form and vector form: 


αξαπι εἰ, y=ytmt, z—z-cnt; (33976) r=r,+ Rt, (3.397d) 


where the numbers zi, y1, 21 are chosen such that the equations in (3.395a) are satisfied. The repre- 
sentation (3.397a) follows from (3.395a) with 


μαι οι JOGA] AB, an 
l= BO? ™=|e, A^ "=| Ay Bol? (3.398a) 
or in vector form R = Ñ; x No. (3.398b) 


4. Equation ofa Line Through Two Points The equation ofa line through two points P; (x1, y1, 21) 
and P; (£2, yo, 22) (Fig. 3.185) is 
in component form and in vector form: 


a) SOLE BLA 4 (53990) b) (F—#) x (P-R) = 0". (3.399b) 


T2 — πι Y2 — Yı 42 — 21 


y- y g—A 
= s fxi = x 
Yo- Yı 22 — 21 
and y; = y» are both valid, the equations in component form are £ = £2, y = yo. 


If for instance xı = 20, the equations in component form are 7 = 20, 


5. Equation of a Line Through a Point and Perpendicular to a Plane The equation of a 
line passing through the point Pi(x1, y1, 21) and being perpendicular to a plane given by the equation 
Ar + By 4- Cz 4 D —0 or by £N + D — 0 (Fig. 3.186) is 

in component form and in vector form: 


ατα y-W Z—Z 
B B C 


If for instance A — 0 holds, the equations in component form have a similar form as in the previous 
case. 


(3.4002) b) (g—-£)xN-G. (3.400b) 


2. Distance of a Point from a Line Given in Component Form 
For the distance d of the point M (a, b, c) from a line given in the form (3.3972) holds: 
[ία — αι) m — (b — y) I? + (b νι) — (e — σι) m? + (e — z)1— (a — zi) n? 


&- 
13 4a? n? 


(3.401) 


*For the product of vectors see 3.5.1.5, p. 184 
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3. Smallest Distance Between Two Lines Given in Component Form 
If the lines are given in the form (3.3972), their distance is 


Tı — T2 Yi — Y2 7 — 5 
+ η my ni 
l5 Tm» ng 
5 5 i (3.402) 
hm; mi ni ny ly 
lo ma Tm n ng lə 


If the determinant in the numerator is equal to zero, the lines intersect each other. 
3.5.3.12 Intersection Points and Angles of Lines and Planes in Space 


1. Intersection Points of Lines and Planes 
1. Equation of the Line in Component Form The intersection point of a plane given by the 
equation 


Ax + By + Cz + D = 0, and a line given by = F A πα 
m 


2 = 2i " 
E has the coordinates 
n 


τ--πι-ἰρ, g—y —mp, Z—z-—np with (3.4032) 
Az, + By, +Cz+D 
Al+Bm+Cn 
If Al+ Bm 4- C n = 0 holds, then the line is parallel to the plane. If Az; + By; + Cz, + D = 0 is also 

valid, then the line lies in the plane. 

2. Equation of the Line in Two Projecting Planes The intersection point of a plane given by 

the equation Ax + By - Cz 4- D = 0, and a line given by y = kx +a, and z = hz +b has the coordinates 
Ba+Cb+D 

t= , J=kī+a, Z=hE+). 3.404 

πο As BELOW * ΤΉΝ, πα 

IfA+Bk+Ch = 0 holds, then the line is parallel to the plane. If Ba + Cb+ D = 0 is also valid, then 

the line lies in the plane. 

3. Intersection Point of Two Lines If the lines are given by y = kir + a, z = hx + bı and 

y = kax + ag, z = hax + bz, then the coordinates of the intersection point, if any exists, are: 


p- (3.403b) 


az — αι by — bi _ kia — kay haba — hab, 
= = "T a= 

kika bo D k-k ’ hic f 
The intersection point exists only if 


(αι — a3)(h4 — Πο) = (by — b3) (ki — ka). (3.405b) 
Otherwise the lines do not intersect each other. 


(3.4052) 


T 


2. Angles between Planes and Lines 
1. Angle between Two Lines 


] : ατα Y-y 2-3ι 2 -- 22 
a) General Case: If the lines are given by the equations i and i = 
1 mi ny [ο 


y— Ya z— 29 
mo ng 
intersection between them 


Ll; My + nn; 
cosy = '' or (3.406a) 


VG +m? + n?) (B + m 4 n2) 


or in vector form by (r — ri) x R, = Õ and (r — r2) x Ro = 0, then for the angle of 


2 


3 gm, Yhes [R;| and Ry = [Rs]. (3.406b) 


cosy = 
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b) Conditions of Parallelism: Two lines are parallel if 


l [ "n 
εκ ος a ee, (3.407) 
lh m3 ma 
c) Conditions of Orthogonality: Two lines are perpendicular to each other if 
lilo + mim + nn = 0 or RıRz = 0. (3.408) 


2. Angle Between a Line and a Plane 


α- yy 
a) If the line and the plane are given by the equations i ——— 


= 277 and Ax + By 4- Cz4 
n 


m 
D = 0 or in vector form by (r — τι) x R = 0 and £N + D = 0, one gets the angle y by the formulas 
Al + Bm 1 
sing = μιά, or (3.409a 
"TES +B? + C?) (P. +m? +n?) 
RN 4 S 
sing = RN with R = |R| and N = |N]. (3.409b 


b) Conditions of Parallelism: A line and a plane are parallel if 


Al+Bm+Cn=0 oœ RÑ=0. (3.410 
c) Conditions of Orthogonality: A line and a plane are orthogonal if 
A B C am Κος (3.411 


l m n 


3.5.3.13 Surfaces of Second Order, Equations in Normal Form 


1. Central Surfaces 


The following equations, which are also called the normal form of the equations of surfaces of second 
order, can be derived from the general equations of surfaces of second order (see 3.5.3.14, 1., p. 228) by 
putting the center at the origin. Here the center is the midpoint of the chords passing through it. The 
coordinate axes are the symmetry axes of the surfaces, so the coordinate planes are also the planes of 
symmetry. 
2. Ellipsoid 
With the semi-axes a, b, c (Fig. 3.187) the equation of an ellipsoid is 

soy p 
The following special cases are to be distinguished: 
a) Compressed Ellipsoid of Revolution (Lens Form): a = b > c (Fig. 3.188). 
b) Stretched Ellipsoid of Revolution (Cigar Form): a — b « c (Fig. 3.189). 
c) Sphere: a = b = c so that 2? + y? + 2 = a? is valid. 
The two forms of the ellipsoid of revolution arise by rotating an ellipse in the x, z plane with axes a and 
c around the z-axis, and one gets a sphere if rotating a circle around any axis. If a plane goes through an 
ellipsoid, the intersection figure is an ellipse; in a special case it is a circle. The volume of the ellipsoid 
is 

4παὐς 


Va . 3.413 
3 (3.413) 
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[p 


Figure 3.187 Figure 3.188 


3. Hyperboloid 

a) Hyperboloid of One Sheet (Fig. 3.190): With a and b as real and c as imaginary semi-axes 

the equation is 
oy z 

d A == 1 (for generator lines see p. 226). (3.414) 

b) Hyperboloid of Two Sheets (Fig. 3.191): With cas real and a, b as imaginary semi-axes the 

equation is 


qe y? E 
ud ped (3.415) 


Intersecting it by a plane parallel to the z-axis results in a hyperbola in the case of both types of hy- 
perboloids. In the case of a hyperboloid of one sheet the intersection can also be two lines intersecting 
each other. The intersection figures parallel to the x, y plane are ellipses in both cases. 

For a = b the hyperboloid can be represented by rotation of a hyperbola with semi-axes a and c around 
the axis 2c. This is imaginary in the case of a hyperboloid of one sheet, and real in the case of that of 
two sheets. 


Figure 3.189 Figure 3.190 Figure 3.191 


4. Cone (Fig. 3.192) 
If the vertex is at the origin, the equation is 

a p 22 
P+ GF =o. (3.416) 
As a direction curve can be considerd an ellipse with semi-axes a and b, whose plane is perpendicular 
to the z-axis at a distance c from the origin. The cone in this representation can be considered as the 
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asymptotic cone of the surfaces 
2 2 

zt y^ 2 

τν με] (3.417) 


a2 b2 c nu 


whose generator lines approach infinitely closely both hyperboloids at infinity (Fig. 3.193). Fora = b 
there is a right circular cone (see 3.3.4, 9., p. 157). 


Figure 3.192 Figure 3.193 
5. Paraboloid 


Since a paraboloid has no center, in the following section it is supposed that the vertex is at the origin, 
the z-axis is its symmetry axis, and the x, z plane and the y, z plane are symmetry planes. 
a) Elliptic Paraboloid (Fig. 3.194): 
2 2 
r^ y j 

t= + B (3.418) 
The plane sections parallel to the z-axis result in parabolas as intersection figures; those parallel to the 
x,y plane result in ellipses. 
The volume of a paraboloid which is cut by a plane perpendicular to the z-axis at a distance h from the 
origin is given as (see also Fig.3.194) 


1 - 
V= grabh , (3.419) 
The parameters a = av/h and b = bvh are the half axis of the intersecting ellipse at height h. So, 
(3.419) yields the half of the volume of an elliptic cylinder with the same upper surface and altitude 
(compare with (3.328a), p. 201). 
b) Paraboloid of Revolution: For a = b one gets a paraboloid of revolution. It can be generated by 


rotating the z = x?/a? parabola of the x, z plane around the z-axis. 
c) Hyperbolic Paraboloid (Fig. 3.195): 
ey 
ΠΡΙ (3.420) 


The intersection figures parallel to the y, z plane or to the x, z plane are parabolas; parallel to the x, y 
plane they are hyperbolas or two intersecting lines. 
6. Rectilinear Generators of a Ruled Surface 
These are straight lines lying completely in this surface. Examples are the generators of the surfaces of 
the cone and cylinder. 
a) Hyperboloid of One Sheet (Fig. 3.196): 
2 2 2 
T 2 
Bert ee (3.421) 


a P ὦ 
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Figure 3.194 Figure 3.195 
The hyperboloid of one sheet has two families of rectilinear generators with equations m 
2$ u(i i 3 u(- 3 το (3.422a) 
α è b à € b 


τ 42 aol 3! (ο JE (3.422b) 


a c b α c b’ 
where u and v are arbitrary quantities. 
b) Hyperbolic Paraboloid (Fig. 3.197): 
αἲ y 
z= T (3.423) 
The hyperbolic paraboloid also has two families of rectilinear generators with equations 


ET (Z η) =z; (94944) ορ, o(= 7) z. (84949) 
a b a b a b a b 


The quantities u and v are again arbitrary values. In both cases, there are two straight lines passing 
through every point of the surface, one from each family. Only one family of straight lines is denoted 
in Fig. 3.196 and Fig. 3.197. 


Figure 3.196 Figure 3.197 
7. Cylinder 
a) Elliptic Cylinder (Fig. 3.198): a + - =1. (3.425) 
b) Hyperbolic Cylinder (Fig. 3.199): = = ν =l: (3.426) 


c) Parabolic Cylinder (Fig. 3.200): — y^ = 2px. (3.427) 
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Figure 3.198 Figure 3.199 Figure 3.200 


3.5.3.14 Surfaces of Second Order or Quadratic Surfaces, General Theory 


1. General Equation of a Surface of Second Order 
αμα + azy? + ασ” + 2a191y + 2aogyz + 2a31zx + 2ayyx + 23919 + 2a34z + a44 = 0. (3.428) 


2. Telling the Type of Second-Order Surface from its Equation 

The type of a second-order surface can be determined from its equation by the signs of its invariants 

A, 6, S, and T from Tables 3.24 and 3.25. Here one can find the names with the normal form of 

the equation of the surfaces, and every equation can be transformed into a normal form. From the 

equation of the so-called imaginary surfaces it is impossible to determine the coordinates of any real 

point, except the vertex of the imaginary cone, and the intersection line of two imaginary planes. 
Table 3.24 Type of surfaces of second order with ὃ 4 0 (central surfaces) 


S-d>0, T>0 S- ô and T not both > 0 
Δ«0 Ellipsoid Hyperboloid of two sheets 
2 p 2 2. ,2 : 
= t EDT m 1 
a P e 
A>0 Imaginary ellipsoid 
ΝΣ; r γῇ 
g'gta-d atp at! 
A=0 Imaginary cone (with real vertex) Cone 
Soy 2 ga og 
μας 2 — =0 a cd) 
a? m p? * c a? PP ὦ 


3. Invariants of a Surface of Second Order 
Substituting aj, = api, then holds: 


Q11 012 413 Q14 


αι 99 d23 024 a11 012 013 
A= νων (3.429a) 6 =| a21 a22 Q23 |, (3.429b) 
431 032 033 Q34 
a31 032 433 
441 Q42 G43 Q44 
2 2 2 j 
S = ay + a22 + a33,(3.429c) T = a22033 + 433011 + 041022 — A33 — 43; — Q5. (3.429d) 


During translation or rotation of the coordinate system these invariants do not change. 
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Table 3.25 Type of surfaces of second order with 6 = 0 (paraboloid, cylinder and two planes) 
A«0 (here T » 0)! A0 (here T « 0) 
AZz0 Elliptic paraboloid Hyperbolic paraboloid 
Κο 2 πο; Ἢν 
x y x y 
το Ἔ πο Ξ παρ —— DSH E 
a? P a DP 


A=0) Cylindrical surface with a second-order curve as a directrix whose type 
defines different cylinders: For T > 0 imaginary elliptic, for T < 0 
hyperbolic, and for T = 0 parabolic cylinder, if the surface does not 
split into two real, imaginary, or coincident planes. The condition for 
splitting is: 

611 012 G14 022 023 024 
032 033 034 
Q42 G43 Q44 


Q11 413 Q14 
a31 033 434 
Q41 G43 G44 


E + =0 


21 022 G24 


Q41 G42 Q44 


1 For the quantities A and T see p. 228. 


3.5.4 Geometric Transformations and 
Coordinate Transformations 


1. Transformations 

Transformations describe the changes of the position or of the form of objects in the plane or in the 
space. There are two types of consideration which are in close relationship with each other [3.22]. In 
the first case points or objects are transformed in a fixed coordinate system. It is called geometric 
transformation (3.5.4.1, p. 229, 3.5.4.5,1., p. 234). 

In the second case the object remains unchanged while the coordinate system is transformed with re- 
spect to the object. With this coordinate transformation (3.5.4.3, p. 231, 3.5.4.5,2., p. 234) not the 
object, but its coordinate representations are modified. At problem solving one of them can be more 
reasonable than the other. 

2. Application Fields 

e Building parts which are described in their own object coordinate-system. 

e Description of motions of parts connected to each other (e.g. robots). 

e Reproduction of two-dimensional projections of three-dimensional objects. 

e Description of motions and deformations in computer graphics and computer animation. 


3.5.4.1 Geometric 2D Transformations 


1. Translations 
"Translation of a point P, given by Cartesian coordinates rp, yp, by ἐν in the direction of the positive 
x-axis, and by ty in the direction of the positive y-axis (Fig.3.201) results the new coordinates of the 
transformed point P'(x 5, γρ}: 

Gh—G3pckis Yp=yptty. (3.430) 
If the coordinates are described as column vectors, then the formulas for the new coordinates of the 
transformation and the reversal are 


αρ} _ (xp 2j ; p , A - is . 
P ) p ) + h , (3.431a) d uh i)? (3.431b) 
2. Rotation Around the Origin 


At a rotation the object is rotated around the origin by an angle a. If a > 0, then the rotation is 
counterclockwise (Fig.3.202). The mapping of the coordinates of a point P(ap, yp) is described by 
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[ 
l ROI 
ESSA 
» » 
| x x 
Figure 3.201 Figure 3.202 
| the relations 
Up = Tp > COSA — yp Sina , Up = Tp SiN a + yp: cosa. (3.432) 
The matrix form of (3.432) is 
wil dude bs 
Fi = Ee an M Er (3.4332) 
Up sina cosa \ yp 
The inverse of this transformation corresponds to a rotation by the angle —a: 
tp\ _ (cos(—a) —sin(—a)\ (a>) _ / cosa sina Y (ap 
Pa i Un cos(—a)/ \yp J ~ \ sina cosa) ορ) ^ p.95) 


3. Scaling with respect to the Origin 
At scaling transformations the coordinates are multiplied by s, and s, respectively (Fig.3.203). The 
transformation of a point P(rp, yp) is given by 

Bp = Sp vp, Up = Sy ` YP. (3.434) 
The matrix forms of this transformation and of its inverse are 


X5X e: 2) ur Ἢ d B αν 0 ) [5 - 
yp) ~\0 sy) \ yP)’ (3.435a) vey ZN 0 πο \yo) 7 (3.435b) 


Scaling results in a change of the size of the transformed object. A positive multiplier s, < 1 results 
in contraction of the object in z-direction. Consequently a factor s, > 1 results in an expansion. A 
negative factor s, « 0 results in a reflection with respect to the y-axis. The corresponding statements 
are valid for sy. 


Scaling transformations in special cases: 
e Reflection with respect to the x-axis: s, = 1, s, = —1. 
e Reflection with respect to the y-axis: 5; 
e Reflection with respect to the origin: s; 


4. Shearing 
At shear transformation the value of each coordinate is changed proportionally to the other. The for- 
mulas of this transformation are: 

αρ = TP + ar yp, Yp = YP ay: Tp. (3.436) 
The matrix form of this transformation (with notation m = 1 — a,a,) is seen in the following: 


αη la, x E l/m -—a,/mJ [x 

(τίς κ) ee ea wm GE) = Gem 
Fig.3.204 shows an example for shearing. 
5. Properties of Transformations 
The transformations introduced above are affine transformations, i.e. the (z', y’) coordinates of a trans- 
formed point P’ can be expressed by a system of linear equations of the original coordinates (x, y) of 
P. 
Remark: These transformations keep collinearity and parallelism, i.e. lines are transformed into lines, 
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8,= -2 E 
S- 5 
CEROS 
— | > ᾽ 
Figure 3.203 Figure 3.204 


and the images of parallel lines are parallel lines. Furthermore translation, rotation and reflection are 
distance- and angle-preserving mappings. 


3.5.4.2. Homogeneous Coordinates, Matrix Representation 

While the changes of coordinates in the case of rotation, scaling and shearing can be described by 
multiplication by a 2 x 2 type matrix (3.433a),(3.435a) and (3.437a), translation does not have this 
type of representation. To be able to handle all of these transformations in the same way, homogeneous 
coordinates are introduced. Every point in the plane gets an additional coordinate w 4 0. Point P(x, y) 
will have coordinates (x^, y^, w), where 


s=, α--. (3.438) 


In the followings w is fixed as w = 1. So point P(x, y) has coordinates (x, y, 1). Now the basic trans- 
formations can be given by a 3 x 3 type matrix in the following form: 


m maji m4» M13 TP 
yp | = | mar Mm mas yp |, ie, xp Μαρ holds. (3.439) 
1 0 0 1 1 
The matrices for translation, rotation, scaling and shearing are: 
1 0 f, cosa —sina 0 
τσ ο t, 1; (3.440) R(a) = | sina cosa 0], (3.441) 
001 0 0 1 
Translation matrix Rotation matrix 
s, 0 0 ο Ὁ 
S(sz,5y) = | 0 s0], (3.442) V(az,dy) = | a 10]. (3.443) 
001 001 
Scaling matrix Shearing matrix 


3.5.4.3 Coordinate Transformation 

With geometric transformations an object is transformed with respect to a fixed coordinate system. 
Whereas coordinate transformation gives the relation between the coordinate representations of a fixed 
object with respect to two different coordinate systems. 

The relation between the two types of transformations is represented in Fig.3.205. If the coordinate 


system is shifted by vector t, the coordinates of point P(ap, yp) become 2'p = ap — te, yp = yp ty. 
The translation of the coordinate system by the vector t results the same outcome as the translation 
of point P by ~t. 

The same consequences are valid for rotation and scaling. So, the transformation of the coordinate 
system is equivalent to the reverse transformation of the object. 
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— 
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Figure 3.205 


The coordinate transformations resulted by a translation, rotation or scaling can be given by 3 x 3 
transformation matrices T, R and S. Considering the matrices of geometric transformations (3.440)— 
(3.442) follows: 


(εν, ἐν) = T(-t,,—t)) = T !(t,,tj)) (3.444) R(a) = R(-o) = R! (a) (3.445 


S(s,, ον) = S(1/s,,1/sy) = S^! (s,, Sy) (3.446 


In this way all basic transformations can be described by a 3 x 3 transformation matrix M: 


αρ Mi, M12 Τις ΖΡ m 
Up | = | παι M2 M3 yp |, ie αῤ- Μορ. (3.447 
1 0 0 1 1 


3.5.4.4 Composition of Transformations 
Complex geometric transformations can be realized by combining different basic transformations. Le 
a sequence of transformations be given by matrices M1, Μ»,..., Μ,,. The consecutive execution o 
these transformations converts the point P(x,y) in n steps into P’. The transformation matrix M 
resulted by this sequence of mappings is the product of these matrices: 

ΜΞΜ,:Μ,.ι-...: Mo: ΜΙ. (3.448 
Similarly, for the reverse transformation 

Μ-!-ΜΓ'.ΜΣ'..... Μτὶ, :-ΜΤ'. (5.449 
So, instead of performing n times basic transformations in a sequence on a point, it is possible to give 
the matrix of the composite transformation, and to apply it directly. 
Every affine transformation can be given as a chain (composition) of translation, rotation and scaling. 
Even shearing can be given as a rotation R(o), a scaling S(s,, sy) and a further rotation R(3) applied 
after each other. The parameters a, β, s,, s, can be determined so, that V (a, b) = R(8)-S(sz, sy)-R(a) 
holds. 


HM Calculation of the transformation matrix of a rotation 
around an arbitrary point Q(r,, Y4) by an angle a: The com- 
posite transformation is the result of composition of the follow- 
ing basic transformations: 

1. Shifting of Q into the origin: M; = T(—z,, —yq). 

2. Rotation around the origin: M5 = R(a). 

3. Translation back the origin into Q: M; = M1! = T(z,, y,). 
The sequence of the single steps of these transformations see 
Fig.3.206. Point P is transformed via P, and P into P’. Figure 3.206 


M = Ms Μο M, = T(z;, yj) - R(a) - T(—zs, --ψρ) 
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104. cosa —sina 0 10 —x, cosa —sina z;(1—cosa) + y, sina 
=(0ly sina cosa 0 01—y | = | sina cosa y(l—cosa)-— rasina | . 
001 0 0 1 00 1 0 0 1 
Reflection with respect to a line given by the equation y = ma + n: 
- Translation of the line to pass through the origin: M; = T(0, —n). 
. Rotation of the line clockwise until it coincides with the x-axis: Mo = R(—a), with tana = m. 
Reflection with respect to the x-axis: M; = S(1, - 1). 
. Rotation back by a: My = Μο: = R(a). 
. Translation of the line back into the original position: M; = My! = T(0,n). 
M = T(0,n)- R(a)-S(1, -1)- R(—a) - T(0, =n) 


100 cosa —sina 0 1 00 cosa sino 0 10 0 
5501 ἡ sina cosa 0 0-10 —sina cosa 0 01-n|. 
001 0 0 1 0 01 0 0 1 00 1 


Use of the well known trigonometric relations sin a = m/Vm? + 1 and cosa = 1/v m? + 1 results the 
transformation matrix 


| 


1—m? 2m  —2mm 
πι 1 m?-c-1 m?41 
M= 2m  m?—1 2n 


m2+1m?+1 m?41 
0 0 1 


W Complete centered transformation of a rectangle shape cut 
with side lengths a and b into a similar one in a window with width 
c and height d (Fig.3.207). Sequence of the basic transforma- 
tions: 
1. Shifting of P(xp, yp) into the origin: ΜΙ = T(—rp, —yp). 
2. Clockwise rotation by an angle a: M» = R(—a). 
3. Scaling by factor s = s, = s, = min(c/a, d/b): Μη = S(s, s) 
4. Shifting of the origin into the center of the window: M4 = 
T(c/2, d/2). 

M = T(c/2,d/2) - S(s, s) Ἀ(--α): Τί--α». —yp) 


10c/2\ /s00 cosa sina 0\ /10 —rp 
—[014/2][0s0]|| —sina cosa 0 || 01 —yp 
00 1 001 0 0 1 00 1 


( scosa ssina (agp = ansima) | 


Figure 3.207 


—ssina scosa d/2—s(ypcosa — xpsina) 


0 0 


3.5.4.5 3D- Transformations 


'The mathematical description of geometric transformations and coordinate transformations in the 
three-dimensional space is based on the same ideas which have been discussed for the two-dimensional 
case in sections 3.5.4.1—3.5.4.4. Affine transformations of the three-dimensional space are the compo- 
sitions of basic transformations as: translation, rotation around one of the coordinate axes and scaling 
with respect to the origin. Using homogeneous coordinates these transformations can be given by 4x 4 
transformation matrices. As in the two-dimensional case, the composed transformations can be real- 
ized by matrix multiplications. 
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1. Geometric Transformations 
The transformation of a point P(xp, yp, zp) is performed according to the following rule: 


Y^p mii mij» mia ΤΙ αρ 
/ 4 
Yp | _ | Mar Map Mag Mma YP . tae Mrs. (3.450) 
Zp Thai M32 M33 M34 Zp 
1 0. ο Ὁ q 1 
The transformation matrices of the basic transformations are: 
1001, s, 000 
0190 τε 0 s, 00 
T (testy; te) = 001 i , (3.451) Slez Sy, 82) = 0 0 ME (3.452) 
0001 0 0 1 
Translation Scaling with respect to the origin 
Y. Ὁ 0 0 cosa 0 sina 0 
0 cosa —sina 0 0 Ἱ Ὁ Ὁ 
Βα(α) = 0 sina cosa 0 (3.453) Ἁγία) = —sina 0 cosa 0 | ^ (9.454) 
0 0 0 1 0 0 0 1 
Rotation around the z-axis Rotation around the y-axis 
cosa —sina 0 0 10 a, 0 
sina cosa 0 0 01a, 0 
R.(a) = 0 0 10 ,(3.455) Viy(Gz, ay) = 0 0 i 0 (3.456) 
0 0 01 0001 
Rotation around the z-axis Shearing parallel to the x, y-plane 


For a positive α the rotation is counterclockwise looking from the positive half of the coordinate axis 
to the origin. For reverse transformations the following relations are valid: 


Tl (te, ty, tz) = T(—t,,—t,—5), S τας, Sy, 82) = S(1/82,1/8y, 1/82), (3.457) 


5 


R;(a)-R,(-a) Εν (α) = R,(-a), R;'(a)- Ri(-a). (3.458) 


2. Coordinate Transformations 

Analogously to the two-dimensional case, the transformations of the coordinate systems have the same 
effects to the coordinate representations of a point as the reverse geometric transformation (see 3.5.4.3, 
p. 231). Hence, the matrices of the transformations are: 


T (t, ty, tz) = T(—te, —ty, -tz) = T^ (tty te), (3.459 
R,(a,) = R,(—o,) = R;!(o,), (3.460 
R,(a,) = R,(-a,) = Ry‘ (ay) ; (3.461 
R.(az) = R;(-o;) = Β. (αι), (3.462 
S(s,,5,,5.) = S(1/s,, 1/5, 1/6.) — S^! (s,, Sy, 82) - (3.463 
In practical applications it occurs quite often, that a particular transformation is replaced from a right- 
handed Cartesian coordinate system into another Cartesian coordinate system. The original one is 
often called as world coordinate system, the other one is called the local or object coordinate system . T 


the origin U (£u, yu, Zu) and the unit vectors 6ι = (lh, m4, nı}, 6] = (Io, ma, n3) and & = (I3, ma. na} 
of the local coordinate system are given in the world coordinate system, then the transformation from 
the world system into the local system and its reverse are given by the matrices 
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l m m —lz, — MYu — nz, hd lg ty 
lg Mmo n3 —ly£u — may, — 22, xrl mi; m» M3 Ὁ 
ατα η BAA M DEM 
l3 ma ng —lar, — may, — N3Žu ny Ny N3 Zu 


0 0 0 1 0 0 0 1 


zl 


If a point P has the coordinates (ap, yp, zp) in the world coordinate system and it has the coordinates 
(αρ. yp. Zp) in the local system, then the following equalities are valid: 


zp = Μι», (3.466) zp-M p. (3.467) 
If M, and M; denote the matrices of transformations from the world coordinate system into two local ΚΕ 


coordinate systems, then the transformation between the two local systems are given by the matrices: 


M-M;,.M; and M'‘=M,-M,’. (3.468) 
W Determination of the matrix of a rotation by an angle 0 around the line passing through the points 
P(xp,yp,zp) and Q(£4, Yq; Zq) with direction vector v = 
{ως vy, v; Lp — Tq, YP — Yq ZP — δα]. It is easy to choose 
P and Q in a distance of one unit, so V is a unit vector. First the 
line is transformed into the z-axis of the coordinate system. Then 
the rotation follows, by an angle 0 around the z-axis. Finally it 
follows the transformation of the line back into the original one. 
It is represented in Fig.3.208 how the transformation of the spa- 
tial line into the z-axis is performed. It consists of the following 
steps: 
1. Q is translated into the origin: Mı = T(—z,, —Yq, —24). 


2. Rotation around the z-axis so that the rotation-axis is mapped 


into the y, z-plane: My = R.(a.) with cosa, = vy/ /v2 + v? and 


x = [oy 932 
sina, = vef vz + Uy: 


Point P, has the coordinates (0, \/u2 + v2, vz). Pietre o 206 


3. Rotation around the z-axis by the angle o, until the image of the direction vector v is on the z-axis: 
M; = R,(a,), where cosa, = v./|V| and sina, = ,/v? + v3 / |ν]. The point P; has the coordinates 
(0,0, |v]). 

The matrix of transforming the direction vector into the z-axis is with m = ,/v2 + v2 and |ν] = 1: 


Ma = RE; (az) : Re (az) + T(—2q, Yq, —20) 


V, =, 
Z “00 
10 0 0 mom 100 =i; 
ο Vy U, acid 
[0v πι 0 ve Uy go 010 —y, 
0m v, 0 m m 001 —z 
00 ο 1/10 0 10]4000 1 
0 0 01 
Vy —Us 0 υχθα — UyXq Uy υρυ, 
S ~~ a στ == = y T. 
m m m m m di 
Ury VyVz UpVzLq + VyVzYq —VU, VyVz 
M,= : m mz, M = | — -Ξυ Y 
A m m m ? A m m y 79 
Us Uy Uz —UgXq — VyYq — UzZq 0 =M Uz 2q 


ο 0 0 1 0 001 
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Comparing M4 and M! with matrices (3.465) and (3.464) shows, that the transformation of the 
spatial line into the z-axis is identical to a coordinate transformation from the world coordinate system 
into a local coordinate system where the origin is Q and the z-axis-direction is V. In the local system 
the rotation is made around the z-axis. The matrix of transformation of the complete rotation is given 
by matrix (3.455) 


M-M;!-R.(0): M4 


Uy UzUz "E Uy —Us; 0 UxYq — VyXq 
m m 7? cosÜ —sin0 0 0 m m m 

E P A p b i ΒΕ 2), 2, bet sb. J. 1 
Ur UyUz ᾿ sinÜ cos0 00 LRL inh - VrVzTq + UyUs Ug 

= Uy Yq DUUM 

m mb c 0 0 10 m m m 

0 =m v 2 0 0 01/| % ον Vs TET y — VA 
Us UPS 0 0 0 1 


In section 4.4, p. 289 an alternative method is given to describe rotation transformations by using the 
properties of quaternions. 


Figure 3.209 


3.5.4.6 Deformation Transformations 
'The affine transformations discussed in section 3.5.4.5, p. 233 change the position of objects and result 
in stretching or compressing in given directions. If the elements of the transformation matrix M — 
(mij) are not constants, as they have been until now, but they are functions of position, then this will 
lead to a generalized class of structure-changing transformations. The elements of the matrix now have 
the form: 

mij = mi(m, y, 2). (3.469) 
1. Contraction 
This transformation is a generalization of scaling. At a contraction in the direction of the z-axis the 
scaling parameters Sz, s, are functions of z. The matrix of transformation is: 


s,(z 0 00 
S(s(2),5,(2),)=| à 9999]. (3.470) 
0 0 01 


The functions s,(z) and s,(z) define the contraction-profile. If s,(z) > 1, then the transformed object 
is stretched in the direction of the z-axis, if ,(z) < 1, then the object is compressed. Fig.3.209b shows 
the result of the transformation of the unit-cube in Fig.3.209a by functions s,(z) = s,(z) = 1/(1— 2°). 
2. Torsion around the z-axis 
This transformation is the generalization of the rotation around the z-axis. The angle of rotation is 
changing along the z-axis. The matrix of transformation is: 
cosa(z) —sina(z) 0 0 
sina(z) cosa(z) 0 0 

0 0 10 

0 0 01 


(3.471) 
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The function a(z) defines the angle of rotation along the z-axis. Fig.3.209c shows the rotation of the 
π 

=% 
4 

3. Bending Around the x-axis 

At bending the angle of rotation changes in a direction perpendicular to the rotation-axis. The matrix 

of transformation has the following form: 


1 0 0 0 
0 cosa(z) —sina(z) 0 
0 
1 


unit-cube with a(z) 


Rage) = 0 sina(z) cosa(z) (3.472) 
0 0 0 


Bending of the unit-cube by a(z) = =z is shown in Fig.3.209d. 


Co 


These deformations can be applied in a sequence like affine transformations. The object in Fig.3.209e 
is the result of a contraction then a torsion of the unit-cube. 


3.5.5 Planar Projections 


There are several methods to visualize three-dimensional objects in a two-dimensional media [3.22]. 
Among them planar projections are very important. A planar projection is a mapping, where points of 
a three-dimensional space are assigned to points of a plane. An image point is given as the intersection 
point of this plane and a ray connecting an observer with the space-point. The plane is called a pro- 
jection plane or a picture plane, the ray is called projecting ray, and its direction is the direction of the 
projection . 


Figure 3.210 Figure 3.211 Figure 3.212 


3.5.5.1 Classification of the projections 


1. Central Projection 

The central projection which is called also perspective projection the projecting rays start from a common 
center point (Fig.3.210). Objects being farther from the center of perspectivity C are represented 
smaller than the ones being closer to the center. Parallel lines, which are not parallel to the projection 
plane, become not parallel and they intersect in a so called vanishing point. Perspective projections 
give a realistic impression of the object for the viewer. But relations of lengths and angles are lost at 
this mapping. 

2. Parallel Projection 

At the parallel projection the projecting rays are parallel to each other (Fig.3.211). Line segments, 
which are not parallel to the projection plane are shortened, and angles are usually distorted. 

1. Orthogonal Parallel Projections A parallel projection is an orthogonal projection if the di- 
rection of the projecting ray is perpendicular to the picture plane. If furthermore the picture plane is 
perpendicular to one of the coordinate-axes, then it is an orthographic projection or principal projec- 
tion, which is well known from industrial designs. 
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If the direction of projection is not perpendicular to any coordinate-axis, then the orthogonal projection 
is called an axonometric projection. 
2. Oblique Parallel Projections A parallel projection is an oblique projection if the direction of 
the projecting ray is not parallel to the normal vector of the picture plane (Fig.3.212). Special cases 
of oblique projections are Cavalier projection and cabinet projection . 
Sometimes parallel projections preserve the ratio of sizes, but they seem less realistic than perspective 
representations. 
3.5.5.2 Local or Projection Coordinate System 
Defining the direction of the projection plane and the coordinates of the result of a projection in the 
world coordinate system is not reasonable. It seems to be useful to apply a picture coordinate system 
whose z, y-plane is identical to the projection plane. This picture system represents the projection from 
the viewpoint of an observer who is looking perpendicularly to the projection plane. 
Let the projection plane be given by a reference point A(r,,y,,z,) and a unit normalvector i = 
{nx,Ny,nz}. Then the picture coordinate system is defined in the following way. The origin is placed 
into R(z,, Yr, zr). Among the coordinate unit vectors 65, €; and 6j first &j, = ñ is fixed. An additiona 
information is needed to fix the coordinate vectors & and & in the picture plane. An ,,upward” vector 
üis chosen in the world coordinate system, and its projection into the picture plane defines the vertica 
direction i.e. the y'-direction of the picture system. The normed vector product of ii and ñ defines the 
vector &;. Summary: 
üx 
& =ñ, & = r, ὅ--σιχσ,. (3.473 
[1 x xi] 

The transformation matrices of the mapping from the world coordinate system into the picture system 
and their inverses are given as in (3.464) and (3.465) by substituting the corresponding coordinates o 
point R(z,, Yr, zr) and vectors &;, 6; and 6j. 


3.5.5.3 Principal Projections 


The projection is perpendicular to a plane which is perpendicular to one of the coordinate-axes. De- 
pending on the direction of the projection and the viewing direction to the picture plane, the ground 
plan, the top view or one of the side views is formed on the projection plane. 

The matrix form of the orthogonal projection of a point P(xp, yp, zp) to the projection plane being 
parallel to the x, y-plane with equation z = zp is 


Tp 1000 Tp 
yp | [0100 yp 
Zp a 000 20 Zp ` (3.474) 
1 000 1 1 


In general, the projection plane is chosen zy = 0. The matrices of projections to the coordinate planes 
are 


P, = (3.475) 


Ge acm 


1000 
0100 
0000 
0001 


3.5.5.4 Axonometric Projection 

In contrary to the orthographic projection, now the normal-vector ii = (n;, ny, nz} of the projection 
plane and the direction of the projection are not parallel to any of the coordinate axes. There are three 
different cases to consider: 

ο Isometric: The angles of Π are the same with every coordinate axis. So, for the coordinates of ñ, 
μια] = |ny| = |n;|. The angle between the projected coordinate axes is 120°. Line segments being 
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parallel to the coordinate axes have the same distortion factor (Abb.3.213a). 
ο Dimetric: ñ has the same angle with two of the coordinate axes. In these directions equal distances 
remain equal. Two of the coordinates of Π have the same absolute value (Fig.3.213b). 


ο Trimetric: ni has different angles with the coordinate axes. Consequently the coordinate axes have 
different distortion factors (Fig.3.213c). 


Ζ 


a) X y 


Figure 3.213 


3.5.5.5 Isometric Projection 
Consider the case when the projection plane contains the origin of the world coordinate system, and 

hen it Fedori 1 1 1 
when its normal vector is i = 4 —=, =, —— p. 

V3 V3 V3 

In order to determine the matrix of projection, a coordinate transformation into the projection co- 
ordinate system is combined with a subsequent projection along the z'-axis to the z^, y-plane. The 
definition of the picture coordinate system corresponds to (3.473). The upward vector is chosen as 
ü = {0,0,1}. In this way the z-axis is mapped to the y'-axis. The unit basis-vectors of the picture 
coordinate system are: 


g..Hxm fJ 3. d πα ου 
- qux ail νο yat J’ pos V6 V6 ν6/᾽ 


g-ü- 1 1 1 
: /$ vs να] 
The transformation matrices of mapping from the picture system into the world coordinate system and 
reverses with (3.464) and (3.465) are: 


1 1 1 0 1 1 0 0 
V2. v6 νᾶ V2 v2 
1 1 i 0 1 1 2 0 
Ματ] v2 v6 V3 |, My=Mi=| vé vé vo |. (3.476) 
2 1 1 1 1 
0 => == 0 => == c 
v6 v3 V3 νᾶ νᾶ 
0 0 0 1 0 0 0 1 
Now, in the picture system the orthogonal projection along the z'-axis is: 
1 1 
NULLE M, 1 1 
2 2 == — Ὁ 0 
1000 1 i 2 a : > 
x 0100 = cu 
=P, = zuo Sr Ie 
P,=P.M, 0000 ys y» vb Je "vé VE (3.477) 
0001 = — — 0 0 0 0 0 
Va νη v3 0 0 0 1 
0 0 0 1 
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The projection matrix P4 maps the points from the world coordinate system into the z^, y'-plane of 
the picture system. By multiplication with matrix M 4 one obtains the points in the world coordinate 
system. The projection-matrix of the complete projection is: 


2-1-10 
1|l-1 2-10 
P =M,P.M,' = 311-1 20 (3.478) 
0 0 03 


3.5.5.6 Oblique Parallel Projection 
At an oblique projection the projecting rays intersect the projection plane in an angle β. In Fig.3.214 
the oblique projection of point P(xp, yp, zp) is P'(xp, yp. Zp) and its orthogonal projection is Pj. L 


is the length of the projected line segment Pj P'. The projection is characterized by two quantities. 
d= — = — 

2 tan 8 
scaled. a is the angle between the x-axis and the projected image of the perpendicular line segment. 
Then the rule for the coordinates of the whole projection is: 


gives the factor by which the line segment perpendicular to the projection plane is 


Tp = £p — zpdcosa, yp=yp—2zpdsina, 2=0 or (3.479) 
Lp 1 0 —dcosa 0 xp 
yp | [01 —dsina 0 Up 
*~\=lo00 o ο] |: (3.480) 
1 00 0 1 1 


If the projection plane is different from the x, y-plane, then a coordinate transformation into the picture 
system must be done before applying the projection-matrix (see 3.5.5.2, p. 238). 


- £4 


L730" y 


Figure 3.214 Figure 3.215 Figure 3.216 


W Cavalier projection of the unit cube with a = 45° and d = 1, i.e. 8 = 45°. Segments being perpen- 
dicular to the projection plane do not become shorter in this case (Fig.3.215). 
The four vertices not lying in the «x, y-plane are put as columns into a matrix. This matrix multiplied 
by the matrix of the projection will give the coordinates of the projected points: 


1 1 1 1 1 

1 1 10 -—0 
v2 v2 y? v2 νὰ 0101 
1 1 0011 

1 1 -|01 0 
V2 ν y2 νὰ V2 1111 
0 0 0 0 00 0 ο rir 

1 1 1 1 00 0 1 


Β Cabinet projection of the unit cube with a = 30°, d = 1/2, i.e. 8 = 63.4°. The lengths of segments 
perpendicular to the projection plane are halved (Fig.3.216). 
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The coordinates of the projections of the four vertices not lying in the x, y-plane are calculated by 


ΝΕ νὸ νᾶ V3 v3 

"S 4 r: 4 10-7" 0101 
1 1 3 3 1 0011 
a ur e = = T..-— 

4 4 4 4 0 i! 1111 
0 0 0 0 00 0 0 L3 d 
1 1 1 1 00 0 1 


3.5.5.7 Perspective Projection 
1. Mapping Formulation i 


The formulation of the mapping of a perspective projection can be given reasonably in the picture 
coordinate system. The origin is chosen so that the center of projection lies on the z-axis. 

As it can be seen in Fig.3.217, the relations between the coordinates of a point P(xp, yp, zp) and its 
projected image P’(’p, yp, zp) can be calculated 
using equations of the intercept theorems (see 
3.1.3.2,5., p. 135): 


Up 20 1 Yp 1 

x = = = = zs 
T ZP? q eet 
x 20 — 2 EB Up 1-2 

P(x 2 Ῥ 0 Ῥ 1 E 
re ( p/Ypr ») Zo Zo 
- zæ =0. 
The relations between the original and the im- 
> age coordinates are not linear. However using 


the properties of homogeneous coordinates (see 
3.5.4.2, p. 231) the projection rule can be given 


Figure 3.217 in the following matrix form: 
Up Up r i y ; Up 
yt yp 
πι ο [5109 9 Bl] 5 (3.481) 
1 1-2 oo-+1}\1 


2. Vanishing points 

Perspective projections have the property, that parallel lines, which are not parallel to the picture 
plane, seem to intersect each other in a point. This point is called vanishing point. The vanishing 
points of lines being parallel to the coordinate axes are called principal points or principal vanishing 
points. The number of principal points is equal to the number of coordinate axes intersecting the picture 
plane. In Fig.3.218 there are perspective representations with one, two and three principal points. The 


Figure 3.218 
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principal points coincide with the intersection points of the picture plane and the rays being parallel to 
the coordinate axes. If the center of projection is C (£e, Ye, Zc), a point of the picture plane is R(x,, Yr, 2r) 
and the normal vector is i = {n,,n,,n-}, then the coordinates of the principal points are: 


d d d 
Fylte + —, θε, ἕο), Fylte, Ye + —, Ze), Fr (Xe, Yer ze + —) where (3.482a) 
fig Ny nz 
d — (€— i): ii — (x, — z,)ns + (Ye = Yny + (Ze — 2r)Nz (3.482b) 


in the case they exist. If a coordinate of the normal vector ñ is zero, then there is no principal point in 
that direction. 
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3.6 Differential Geometry 


In differential geometry planar curves and curves and surfaces in space are discussed by the methods 
of differential calculus. Therefore it is to be supposed that the functions describing the curves and 
surfaces are continuous and continuously differentiable as many times as necessary for discussion of 
the corresponding properties. The absence of these assumptions is allowed only at a few points of the 
curves and surfaces. These points are called singular points. 

During the discussion of geometric configurations with their equations there are to be distinguished 
properties depending on the choice of the coordinate system, such as intersection points with the co- 
ordinate axes, the slope or direction of tangent lines, maxima, minima; and invariant properties inde- 
pendent of coordinate transformations, such as inflection points, curvature, and cyclic points. There 
are also local properties, which are valid only for a small part of the curves and surfaces as the curva- ns 
ture and differential of arc or area of surfaces, and there are properties belonging to the whole curve or 
surface, such as number of vertices, and arc length of a closed curve. 


3.6.1 Plane Curves 
3.6.1.1 Ways to Define a Plane Curve 


1. Coordinate Equations 
A plane curve can be analytically defined in the following ways. 
1. In Cartesian coordinates: 
a) Implicit: F(x,y) — 0, (3.483 
b) Explicit: y- (3.484 
2. In Parametric Form: x= v(t), y=y(t). (3.485 
3. In Polar Coordinates: p= f(y). (3.486 


2. Positive Direction on a Curve 

If a curve is given in the form (3.485), the positive direction is defined on it in which a point P (x(t), y(t)) 
of the curve moves for increasing values of the parameter t. If the curve is given in the form (3.484), 
then the abscissa can be considered as a parameter (x = x, y = f(x)), so the positive direction holds for 
increasing abscissa. For the form (3.486) the angle can be considered as a parameter o (x = f(y) cosy, 
y = f(y) sin p), so the positive direction holds for increasing y, i.e., counterclockwise. 

E Fig.3.219a, b,c: A:z—0,y—0b, B:y-sinz, C:p=ay. 
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a) b) 


Figure 3.219 


3.6.1.2 Local Elements of a Curve 

Depending on whether a changing point P on the curve is given in the form (3.484), (3.485) or (3.486), 
its position is defined by x, t or y. A point arbitrarily close to P with parameter values x + da, t + dt 
or y + dy is denoted here by N. 

1. Differential of Arc 

If s denotes the length of the curve from a fixed point A to the point P, the infinitesimal increment 


As —PN can be expressed approximately by the differential ds of the arc length, the differential of 
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are: 


2 
1+ (2) dx for the form (3.484), (3.487) 
m 


As πό ds = = - 
y xz"? + y"? dt for the form (3.485), (3.488) 
Ve? + p? ἂρ for the form (3.486). (3.489) 


E A: y=sinz, ds = V1-ccos?rdr.W, B: xz — C, y =t, ds = ἐν 4 + 9t dt. 
MC: p=ay (a>0), ds = αν] + p? dy. 


Figure 3.220 Figure 3.221 Figure 3.222 


2. Tangent and Normal 

1. Tangent at a Point P toa Curve isa line in the limiting position of the secants PN for N > P; 
the normal is a line through P which is perpendicular to the tangent here (Fig. 3.220). 

2. The Equations of the Tangent and the Normal are given in Table 3.26 for the three cases 
(3.483), (3.484), and (3.485). Here x, y are the coordinates of P, and X, Y are the coordinates of the 
points of the tangent and normal. The values of the derivatives should be calculated at the point P. 


Table 3.26 Tangent and normal equations 


Type of equation | Equation of the tangent Equation of the normal 
| OP. 10F 7 να Y-y 
(3.483) zm (X — x) 4 By (Y -y)=0 OF OF 
Ox Oy 
3.484 yoga ο Y-y== {X 
(3.484) πι =z) -ν--ῳΐ — c) 
απ 
Y- X-r 
(3.485) κ οι a'(X a2) y(Y - y) 20 
y x 
Examples for equations of the tangent and normal for the following curves: 
E A: Circle αὖ + y? = 25 at the point P(3, 4): 


a) Equation of the tangent: 2r(X — x) + 2y(Y — y) = 0 or Xa + Y y = 25 considering that the point 
P lies on the circle: 3X + 4Y = 25. 

: Χ-α Y-y 
b) Equation of the normal: = : 
21 2y 


I 


4 
orY — yx. at the point P: Y = g^ 
x 


W B: Sine curve y = sin x at the point 0(0, 0): 
a) Equation of the tangent: Y — sin z = cosa(X — x) or Y = X cosx + sina — rcosz; at the point 
(0,0): Y = X. 
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b) Equation of the normal: Y — sin x (X—z)orY X sec x 4- sin x + z sec x; at the point 
cos x 
(0,0): Y = -X. 
Bl C: Curve with x = t°, y = t? at the point P(4, —8),t = —2: 
y-606 x-BP 3 ds 
a) Equation of the tangent: 38 22 orY — ix "E at the point P: Y = —3X 54. 


b) Equation of the normal: 2t (X — 13) + 32 (Y — 9) = 0 or 2X + 3tY = t? (2 + 309); at the point 
P(4,—8): X —3Y = 28. 

3. Positive Direction of the Tangent and Normal of the Curve If the curve is given in one of 
the forms (3.484), (3.485), (3.486), the positive directions on the tangent and normal are defined in the 
following way: The positive direction of the tangent is the same as on the curve at the point of contact, 
and one gets the positive direction on the normal from the positive direction of the tangent by rotating 
it counterclockwise around P by an angle of 90° (Fig. 3.221). The tangent and the normal are divided 
into a positive and a negative half-line by the point P. 

4. The Slope of the Tangent can be determined 

a) by the angle of slope of the tangent a , between the positive directions of the axis of abscissae and 
the tangent, or 

b) if the curve is given in polar coordinates, by the angle jz, between the radius vector OP (OP = p) 
and the positive direction of the tangent (Fig. 3.222). For the angles a and ju the following formulas 
are valid, where ds is calculated according to (3.487)- (3.489): 


l; dx d 
ana = E. , cosa = = , sina = i (3.490a) 
τ LE B (3.490b) 
an μ = , cosH=—, sinp= ; .490b 
d c c mc 
dy 
WA . t 1 , COS X 
:y=sing ana = cos x, sa = ——, sina = —————; 
:y=sing, Wa = COS T, COS a IET WIES 
BB: c=, y=? te 2 cosa = ER sina — em 
Mon 2r LE μμ 43 98" i να τος 
1 : 
B C: p= ap, tanu = p, cos JL = sin 4 = 2 


Vite’ Vite 
5. Segments of the Tangent and Normal, Subtangent and Subnormal (Fig. 3.223) 
a) In Cartesian Coordinates for the definitions in form (3.484), (3.485): 


- |y i A91; 

PT =| 4/1 ΄ segment of the tangent), 3.491a 
| x yl+y (seg f gent) ( ) 

PN = yy 124 y? (segment of the normal), (3.491b) 

PTS z| (subtangent), (5.4916) P'N = |yy'| (subnormal). (3.491d) 

b) In Polar Coordinates for the definitions in form (3.486): 
PT = La V p? 4- p? (segment of the polar tangent), (3.492a) 
p 
PN = a +p? (segment of the polar normal), (3.492b) 
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ΟΤΙ = E (polar subtangent), (3.492c) ON' = |p'| (polar subnormal). (3.492d) 


/ 


E A: y = coshz, y' = sinha, /14- y? = coshz; PT = |coshzcothz|, PN = |cosh?z|, PT = 


|coth z|, P’N = |sinh x cosh z|. 
B B: p=ay (a>0), p =a, yp? +p? = αγ] +p; PT! = lap 1 4 q? αγ 1 +g? 


, ON! =a. 


,PN'- 


, OF = 


ag? 


Figure 3.223 Figure 3.224 Figure 3.225 


6. Angle Between Two Curves The angle B between two curves Γι and I at their intersection 
point P is defined as the angle between their tangents at the point P (Fig. 3.224). By this definition 
the calculation of the angle 8 has been reduced to the calculation of the angle between two lines with 
slopes 


kı = tana, = ($) ; (3.493a) k = tanas = (2) , (3.493b) 
dz } p dz} p 


Here y = f(x) is the equation of Γι and y = fo(a) is the equation of I>, and the derivatives are 
calculated at the point P. One gets 6 with the help of the formula 


tan αο — tana, 


tan — tan(o, — a3) = : 3.494 
(αι 2) 1 + tana, tanag ( ) 
Bl Determine the angle between the parabolas y = yx and y = x” at the point P(1, 1): 
ανα 1 d (a? tano» — tana 3 
tana, = ve Ξ-, tanag= ur) —2, tanp a IË, 
dz}, 2 dr {νι 1 +tanaı tanay 4 
3. Convex and Concave Part of a Curve 
If a curve is given in the explicit form y = f(x), then a small part containing the point P can be 


examined if the curve is concave up or down here, except of course if P is an inflection point or a 
singular point (see 3.6.1.3, p. 249). If the second derivative f”(x) > 0 (if it exists), then the curve is 
concave up, i.e., in the direction of positive y (point P» in Fig. 3.225). If f”(x) < 0 holds (point P), 
then the curve is concave down. In the case if f”(x) = 0 holds, it should be checked if it is an inflection 
point. 


B y = 2? (Fig. 2.15b); y" = 6x, for x > 0 the curve is concave up, for x < 0 concave down. 
4. Curvature and Radius of Curvature 


1. Curvature of a Curve The curvature K of a curve at the point P is the limit of the ratio of the 
angle 6 between the positive tangent directions at the points P and N (Fig. 3.226) and the arc length 
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PN for ΡΝ- 0: 


K= lim E : (3.495) 
PN=>0 PN 


The sign of the curvature K depends on whether the curve bends toward the positive half of the normal 
(K > 0) or toward the negative half of it (K < 0) (see 3.6.1.1, 2., p. 245). In other words the center of 
curvature for K > 0 is on the positive side of the normal, for K < 0 it ison the negative side. Sometimes 
the curvature K is considered only as a positive quantity. Then it is to take the absolute value of the 
limit above. 
2. Radius of Curvature of a Curve The radius of curvature R of a curve at the point P is the 
reciprocal value of the absolute value of the curvature: 
1 
Π---- 3.496 
K (3.496) 
The larger the curvature K is at a point P the smaller the radius of curvature R is. 
ΒΑ: Fora circle with radius a the curvature K = 1/a and the radius of curvature R = a are constant 
for every point. 
W B: For a line with K = 0 holds R = oo. 


3. Formulas for Curvature and Radius of Curvature 
With the notation ὃ = da and PN= ds (Fig. 3.226) in gen- 


eral 
da ds 
Κ---, Π-|--. 3.497 
, ΠΕΝ da ( ) 
For the different defining formulas of curves in 3.6.1.1, p. 243 
Figure 3.226 the different expressions for K and R are: 
273/2 
, ὯΝ, 
Py 1+ (2) | 
E] x 
Definition as in (3.484): K = > zx R- d (3.498) 
ay : 
at ait 
y Ca xy?) / 
Definition as in (3.485); K = ————.,, R= TERT (3.499) 
(x? + Aye voy 
y n y" 
Pis Foy P; 
Εν Ἐν ἵν (E+ EA 
Definition as in (3.483): .K 2 —————"——— R= : (3.500) 
(F, 2 ΕΕ 2 Frs Fry Fr 
: d Fy Fyy Fy 
Ες Fy 0 
3 9 m n 2 1253/2 
Definition as in (3.486): K = ΞΡ — PP (o + p”) (3.501) 
(p? + p2y? p? + 2p? — pp" 
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1 2 8 - 
E A: y=coshe, K= ——5 EB: tsr y—-t, K= v3 
τ ` cosh? x ' d t(4+ 9t2)°/? 
2 2 
—— 2 a , TM . 1 e^ +2 
MC: yo —x =a’, CETA E D: p=ap, κ--' (gh 19m 


5. Circle of Curvature and Center of Curvature 


1. Circle of Curvature at the point P is the limiting position of the circles passing through P and 
two points N and M of the curve from its neighborhood, for N + P and M — P (Fig. 3.227). It 
goes through the point of the curve and here it has the same first and the same second derivative as the 
curve. Therefore it fits the curve at the point of contact especially well. It is also called the osculating 
circle. Its radius is the radius of curvature. It is obvious that it is the reciprocal value of the absolute 


value of the curvature. 


2. Center of Curvature The center C of the circle of curvature is the center of curvature of the 


point P. It is on the concave side of the curve, and on the normal of the curve. 


3. Coordinates of the Center of Curvature The coordinates (xc, yc) of the center of curvature 
for curves with defining equations in 3.6.1.1, p. 243 can be determined from the following formulas. 


dy dy\* 
22 E Te 
dx | T (2) | 


Definition as in (3.484): rc = diy ; c diy 
da? da? 
dhl? 2 affa me 
Definition as in (3.485): zc —r— ο... Ὁ ; yc=y+ ΤΙ 
x" y" qz" y" 


(p? + p°) (pcos p + p'sing) 
pP EN 2p? = pp" ? 
(p? + p?)(psin e — p' cos p) 


yo = psing 3X7 PT 


Definition as in (3.486): vc = pcos y 


Definition as in (3.483): zc — r4 yc=y+ 


Fus "m Fi Pus Ταν Fy ` 
ye Fyy Fy νε Fyy Fy 
These formulas can be transformed into the form 
ro —r— Rsina, yo =y + Rcosa or 
roi ne yo= yt RE 


(Fig. 3.228), where R should be calculated as in (3.498)—(3.501). 


Figure 3.227 Figure 3.228 Figure 3.229 


F, (Ε5 + F?) 


(3.502) 


(3.503) 


(3.504) 


(3.505) 


(3.506) 
(3.507) 
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3.6.1.3 Special Points of a Curve 

In the following are to be discussed only the points remaining invariant during coordinate transforma- 
tions. In order to determine maxima and minima see 6.1.5.3, p. 444. 

1. Inflection Points and the Rules to Determine Them 

Inflection points are the points of the curve where the curvature changes its sign (Fig. 3.229) while 
a tangent exists. The tangent line at the inflection point intersects the curve, so the curve is on both 
sides of the line in this neighborhood. At the inflection point K = 0 and R = œ hold. 


1. Explicit Form (3.484) of the Curve y = f(x) 


a) A Necessary Condition for the existence of an inflection point is the zero value of the second 
derivative 

f" (x) =0 (3.508) 
if it exists at the inflection point (for the case of non-existent second derivative see b)). In order to 
determine the inflection points for existing second derivative all the roots of the equation f"(x) = 0 
with values z1, £2,..., Ti, ..., Xn are to be considered, and to be substituted into the further deriva- 
tives. If for a value x; the first non-zero derivative has odd order, then there is an inflection point here. 
If the considered point is not an inflection point, because for the first non-disappearing derivative of 
kth order, k is an even number, then for f(x) < 0 the curve is concave down; for f (a) > 0 it is 
concave up. If the higher-order derivatives are not be checked, for instance in the case they do not exist, 
see point b). 
b) A Sufficient Condition for the existence of an inflection point is the change of the sign of the 
second derivative f"(r) while traversing from the left neighborhood of this point to the right, if also 
a tangent exists here, of course. So the question, of whether the curve has an inflection point at the 
point with abscissa x;, can be answered by checking the sign of the second derivative traversing the 
considered point: If the sign changes during the traverse, there is an inflection point. (Since v; is a root 
of the second derivative, the function has a first derivative, and consequently the curve has a tangent.) 
This method can also be used in the case if y" = oo, together with the checking of the existence of a 
tangent line, e.g. in the case of a vertical tangent line. 

1 n d= 3a? 1 UV de a τη 

1-2) f (2) ga + 229 he V3 if (2) 7 a + x?)4 if (ma) P 


Inflection points: A (5 ' i) . B [-- ; i) : 


BB: y — x^, f'(x) = 121’, αι — 0, f"(x) = 24x, f(a) — 0, f'V(v) = 24; there is no inflection 
point. 


W A: y 


= 


“a n 10 


1 
sy 9 x 3; for x = 0 we have y" = oo. 


5 
MC: y=23,y 


As the value of x changes from negative to positive, the second derivative changes its sign from “—” to 
“+”, so the curve has an inflection point at x = 0. 

Remark: In practice, if from the shape of the curve the existence of inflection points follows, for in- 
stance between a minimum and a maximum with continuous derivatives, then there are to be deter- 
mined only the points x; without checking the further derivatives. 

2. Other Defining Forms of the Curve The necessary condition (3.508) for the existence of an 
inflection point in the case of the defining form of the curve (3.484) will have the analytic form for the 
other defining formulas as follows: 


a) Definition in parametric form as in (3.485): — 0. (3.509) 


b) Definition in polar coordinates as in (3.486): p 4- 2p? — pp" =0. (3.510) 
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Pex Fj F, 
c) Definition in implicit form as in (3.483): F(a,y)=0 and [Ες Fy Fj|—0. (3.511) 
pH ϐ 
a fy 


In these cases the system of solutions gives the possible coordinates of inflection points. 


1 1 
MA: «=a ( E sin t) > y=a (1 τ 5 60s t) (curtated cycloid (Fig. 2.68b), p. 102); 


x yl 
x" y" m 


2 * 
a^|2 — cost sint 


sinf cost 


A 1 
T = T (2cost -- 1); cos ἐκ PL tk = F2km (k =0,+1,+2,...). 


The curve has an infinite number of inflection points for the parameter values ἐμ. 


1 ‘ : 1 1 3 1 , 
HB: p= N ; ph 2p? — pp" = 5 + a5 a = ποια 1). The inflection point is at the angle 


p=1/2. 
Fac 2 0 2x 
B C: 2? —y? =a? (hyperbola). - τσ 0 —2 -2y — 8x? — 8y?. The equations x? — y? = a? 
2r —2y 0 


and 8(a? — y?) = 0 contradict each other, so the hyperbola has no inflection point. 


Figure 3.230 


2. Vertices 
Vertices are the points of the curve where the curvature has a maximum or a minimum. The el- 
lipse has for instance four vertices A, B, C, D, the curve of the logarithm function has one vertex 
at E (1/2, — 1n 2/2) (Fig. 3.230). The determination of vertices is reduced to the determination of 
the extreme values of K or, if it is simpler, the extreme values of R. The formulas (3.498)-(3.501) can 
be used for the calculations. 

3. Singular Points 

Singular point is a general notion for different special points of a curve. 

1. Types of Singular Points The points a), b), etc. to j) correspond to the representation in 
Fig. 3.231. 

a) Double Point: At a double point the curve intersects itself (Fig. 3.2312). 


b) Isolated Point: An isolated point satisfies the equation of the curve; but it is separated from the 
curve (Fig. 3.231b). 


c), d) Cuspidal point: At a cuspidal point or briefly a cusp the orientation of the curve changes; ac- 
cording to the position of the tangent a cusp of the first kind and a cusp ofthe second kind (Fig.3.231c,d) 
can be distinguished. 


e) Tacnode or point of osculation: At the tacnode the curve contacts itself (Fig. 3.231e). 


f) Corner point: Ata corner point the curve suddenly changes its direction but in contrast to a cusp 
there are two different tangents for the two different branches of the curve here (Fig.3.231f). 


g) Terminal point: At a terminal point the curve terminates (Fig.3.231g). 


h) Asymptotic point: In the neighborhood of an asymptotic point the curve usually winds in and out 
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or around infinitely many times, while it approaches itself and the point arbitrarily close (Fig.3.231h). 


i), j) More Singularities: It is possible that the curve has two or more such singularities at the same 
point (Fig. 3.231i,j). 


LSDPE 
KE ο M 3 τ 


Figure 3.231 


2. Determination of the Tacnode, Corner, Terminal, and Asymptotic Points Singularities 
of these types occur only on the curves of transcendental functions (see 3.5.2.5, p. 195). 

The corner point corresponds to a finite jump of the derivative dy/dx. 

Points where the function terminates correspond to the points of discontinuity of the function y = f(x) 
with a finite jump or to a direct termination. 

Asymptotic points can be determined in the easiest way in the case of curves given in polar coordinates 
asp = f(y). If for p — oo or φ — —oo the limit is equal to zero (limp = 0), i.e., the pole is an 
asymptotic point. 


). 


W A: The origin is a corner point for the curve y = 
l+ez 


1 
E B: The points (1,0) and (1, 1) are points of discontinuity of the function y = ———34- (Fig. 2.8) 


(see 2.1.4.5, p. 106). eet 
HC: The logarithmic spiral p = ae’? (Fig. 2.75) (see 2.14.3, p. 106) has an asymptotic point at the 
origin. 
3. Determination of Multiple Points (Cases from a) to e), and i), and j)) Double points, 
triple points, etc. are denoted by the general term multiple points. To determine them, one starts with 
the equation of the curve of the form F(x,y) = 0. A point A with coordinates (σι, y1) satisfying the 
three equations F = 0, F, = 0, and F, = 0 is a double point if at least one of the three derivatives of 
second order Fy, Fry, and Fy, does not vanish. Otherwise A is a triple point or a point with higher 
multiplicity. 
The properties of a double point depend on the sign of the Jacobian determinant 
A- [Es τν Fay (3.512) 


e ἝΞ 


ipe Vi 
Case A « 0: For A < 0 the curve intersects itself at the point A; the slopes of the tangents at A are 
the roots of the equation 
Fyyk? + 2Fiyk + Foe = 0. (3.513) 
Case A > 0: For A > 0 A is an isolated point. 
Case A = 0: For A = 0 A is either a cusp or a tacnode; the slope of the tangent is 


Py 
tana = -—. (3.514) 


yy 
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For more precise investigation about multiple points the origin can be translated to the point A, and 
rotate so that the x-axis becomes a tangent at A. Then from the form of the equation one can tell if it 
is a cusp of first or second kind, or if it is a tacnode. 

ΒΑ: FG y) = (z? +y?) 2a? (2? — νυ») 20 (Lemniscate, Fig. 2.66, 2.12.6, p. 101); Εν = 4x(a? + 
y? — a’), Fy = Ay(a? + y? + a”); the equation system F, = 0, Fy = = 0 results in the three solutions 


(0,0), M 0), from which only. the vus one satisfies the condition T= = 0. Substituting (0, 0) into the 
second derivatives yields Fp» = —4a?, = 0, Fy = 1447; A = —16a* < 0, i.e., at the origin the 
curve intersects ieot, the slopes of thet ni far are tana = +1, their equations are y = +7. 

ΒΒ: F(x,y) = Ey! —a? -- μὲ = 0; Εν = «(3x — 2), Fy = y(3y — 2); among the L4 ic 0), 
(0, 2/3), (2/3, 0), - (2/3, 2/3) only the first one belongs to "the curve; there is F;, = —2, e, 
> = —2, A = 4 > 0, i.e., the origin is an isolated point. 


MC: F(x,y) = (y — 32) — à? = 0. The equations F, = 0, F, = 0 result in only one solution (0, 0), 
it also satisfies the equation F = 0. Furthermore there is A = 0 and tana = 0, so the origin is a 
cusp of the second kind. This can be seen from the explicit form of the equation y = a?(1 € vz). y 
is not defined for x < 0, while for 0 < x < 1 both values of y are positive; at the origin the tangent is 
horizontal. 


4. Algebraic Curves of Type F(z, y) = 0, F(x, y) Polynomialin α and y 

If the equation does not contain constant and first-degree terms, then the origin is a double point. The 
corresponding tangents can be determined by making the sum of the second-degree terms equal. 

Β For the lemniscate (Fig. 2.66, p. 101) at the origin (0, 0) the equations of the tangent y = + follow 
from x? — y? = 0. 

If the equation does not contain second-degree terms as well but contains cubic terms, then the origin 
is a triple point. 


3.6.1.4. Asymptotes of Curves 
1. Definition 


An asymptote is a straight line to which the curve 
approaches while it moves away from the origin — 
(Fig. 3.232). 


The curve can approach the line from one side a) b) 
(Fig. 3.232a), or it can intersect it again and again : 
(Fig. 3.232b). Figure 3.232 


Not every curve which goes infinitely far from the origin (infinite branch of the curve) has an asymptote. 
For instance the entire part of an improper rational expression is called an asymptotic approximation 
(see 1.1.7.2, p. 15). 


2. Functions Given in Parametric Form x = z(t), y = y(t) 


To determine the equation of the asymptote first those values are determined for which t — t; yields 
either x(t) — Ἔσο or y(t) — oo (or both). 
There are the following cases: 


a) x(t) — oo for t — t; but y(t;) =a #00: y=a. The asymptote is a horizontal line. — (3.515a) 
b) y(t) — oo for t — t; but z(t;) =a z oo: xv —a. The asymptote is a vertical line. (3.515b) 


(t) 


c) If both y(t) and x(t) tend to -Ἔσο, then the following limits are to be calculated: k = lim mE and 


i x(t) 


= lim ly(t) — kx(t)]. If both exist, the equation of the asymptote is 
Sti 


y — kx 4 b. (3.515c) 
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m π π 
E= , y = n(tant t), ti = c, t= , etc. Determine the asymptote at t4: 
cost 2 2 ` 
n n 
z(t) = y(t) =co, k= lim (sint — tcost) 2 ---, 
tor /2 m m 
n m sint — tcost — 1 nT 
b= lim Jj[n(tant—1) - — — n lm = . For the asymptote 
t— 7/2 m cost t> 7/2 cost 2 
: n nT S n nT 
(3.515c) gives y = — x — EX For the second asymptote, etc. one gets similarly y = -----2 + rd 
m m 


3. Functions Given in Explicit Form y = f (æ) 
The vertical asymptotes are at the points of discontinuity where the function f(x) has an infinite jump 
(see 2.1.5.3, p. 59); the horizontal and oblique asymptotes have the equation 


f(a) 


y = ke +b with k = lim b= lim [f(z) — ka]. (3.516) 
4. Functions Given in Implicit Polynomial Form F(z, y) = 0 
1. To determine the horizontal and vertical asymptotes one chooses the highest-degree terms with 
degree m from the polynomial expression in x and y, then one separates them as a function (x, y) and 
solves, if it is possible, the equation d(x, y) = 0 for x and y: 

$(r,y)—O0 yields r-—q(y) υ--ψί(α). (3.517) 
The values yı = a for x — oo give the horizontal asymptotes y = a; the values x; = b for y — oo the 
vertical ones x — b, if the limits exist. 
2. To determine the oblique asymptotes the equation of the line y = kx + b is to be substituted into 
the equation F(x, y), then the resulting polynomial is to be ordered according to the powers of x: 


F(x, ka +b) = fi(k,b)a™ + falk, bda" +--+. (3.518) 
The parameters k and b can be yield, if they exist, from the equations 
fi(k,b) =0, Πε, b) = 0. (3.519) 


Β x’ +y’ — 3ary = 0 (Cartesian folium Fig. 2.59, 2.11.3, p. 96). Based on equation F(x, kx + b) = 
(1+ Kk?)a? -- 3(K?b — ka)a? +--+ according to (3.519), fi(k,b) =1+k® = 0 and fo(k,b) = k?b — ka = 0 


with the solutions k = —1, b = —a the equation of the asymptote is y = —r — a. 


3.6.1.5 General Discussion of a Curve Given by an Equation 

Curves given by their equations (3.483)-(3.486) are investigated in order to know their properties and 
shapes. 

1. Construction of a Curve Given by an Explicit Function y = f(x) 

a) Determination of the domain (sce 2.1.1, p. 48). 


b) Determination of the symmetry of the curve with respect to the origin or to the y-axis checking 
if the function is odd or even (see 2.1.3.4, p. 51). 


c) Determination of the behavior of the function at +00 by calculating the limits lim f(a) and 
α-»--οο 


ΜΝ f(x) (see 2.1.4.7, p. 55). 


d) Determination of the points of discontinuity (see 2.1.5.3, p. 59). 


e) Determination of the intersection points with the y-axis and with the x-axis calculating 
f (0) and solving the equation f(x) = 0. 

f) Determination of maxima and minima and finding the intervals of monotonicity where the 
function is increasing or decreasing. 


g) Determination of inflection points and the equations of tangents at these points (see 3.6.1.3, 
p. 249). 
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With these data the graph of the function can be sketched, and where it is necessary, some values can 
be calculated to make it more precise. 
W Sketch the graph of the function y = pog 

a) The function is defined for all x except x = 0. 

b) There is no symmetry. 

c) For x — —oo follows y — 2, and obviously y = 2 — 0, i.e., approach from below, while x — oo also 
holds y + 2, but y = 2+ 0, an approach from above. 

d) x = 0 is a point of discontinuity such that the function from left and also from right tends to —oo, 
because y is negative for small values of x. 

e) Because f(0) = —oo holds, there is no intersection point with the y-axis, and from f(x) = 207 + 
3r — 4 = 0 the intersection points with the z-axis are at x, £z 0.85 and x & —2.35. 

f) A maximum is at x = 8/3 ~ 2.66 and here y ~ 2.56. 

g) An inflection point is at x = 4, y = 2.5 with the slope of the tangent line tana = —1/16. 

h) After sketching the graph of the function based on these data (Fig. 3.233) one can calculate the 
intersection point of the curve and the asymptote, which is at x = 4/3 ~ 1.33 and y = 2. 

2. Construction of a Curve Given by an Implicit Function F(z, y) = 0 

There are no general rules for this case, because depending of the actual form of the function different 
steps can be or cannot be performed. If it is possible, the following steps are recommended: 

a) Determination of all the intersection points with the coordinate axes. 

b) Determination of the symmetry of the curve, replacing x by —x and y by —y. 

c) Determination of maxima and minima with respect to the x-axis and then interchanging x and 
y also with respect to the y-axis (see 6.1.5.3, p. 444). 

d) Determination ofthe inflection points and the slopes of tangents there (see 3.6.1.3, p. 249). 
e) Determination of singular points (sce 3.6.1.3, 3., p. 250). 

f) Determination of vertices (see 3.6.1.3, 2., p. 250) and the corresponding circles of curvature (see 
3.6.1.2, 4., p. 246). It often happens that the curve's arc can hardly be distinguish from the circular 
segment of the circle of curvature on a relatively large segment. 


g) Determination of the equations of asymptotes (see 3.6.1.4, p. 252) and the position of the 
curve branches related to the asymptotes. 


evolvent 


Figure 3.233 Figure 3.234 Figure 3.235 


3.6.1.6 Evolutes and Evolvents 

1. Evolute 

The evolute is a second curve which is the locus of the centers of circles of curvature of the first curve 
(see 3.6.1.3, 5., p. 248); at the same time it is the envelope of the normals of the first curve (see also 
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3.6.1.7, p. 255). The parametric form of the evolute can be derived from (3.503) or (3.504) for the 
center of curvature if xc and yc are considered as running coordinates. If it is possible to eliminate 
the parameter (x, t or p) from (3.502), (3.503), (3.504), then the equation of the evolute is obtained in 
Cartesian coordinates. 
2x(1 + 437) 

2 


Bl Determine the evolute of the parabola y = x? (Fig. 3.234). From X = x 
14 42? 14 θα” 


4r, Y = αἱ 4 2 2 and considering X and Y as running coordinates the evolute 
1 x25 
Yea 49 (5) 
2 4 


2. Evolvent or Involute 
The evolvent, also called involute, of a curve I> is a curve 7}, whose evolute is >. Here every normal 


PC of the evolvent is a tangent of the evolute (Fig. 3.234), and the length of arc CC of the evolute 
is equal to the increment of the radius of curvature of the evolvent: 


CC;- BC, - PC. (3.520) 
These properties show that the evolvent / can be regarded as the curve traced by the end of a stretched 
thread unspooling from 7/5. A given evolute corresponds to a family of curves, where every curve is de- 
termined by the initial length of the thread (Fig. 3.235). 

The equation of the evolute is obtained by the integration of a system of differential equations corre- 
sponding to its evolute. For the equation of the evolvent of the circle see 2.14.4, p. 106. 
W The catenoid is the evolute of the tractrix; the tractrix is the evolvent of the catenoid (see 2.15.1, 


p. 108). 
P Qe 
a) 
Κ/άῑλα d) 
b) 
a) c) ro 
Figure 3.236 Figure 3.237 


3.6.1.7 Envelope of a Family of Curves 


1. Characteristic Point 
Consider the one-parameter family of curves with equation 

F(x,y,a) = 0. (3.521) 
Every two close curves of this family corresponding to the values of parameter a and a + Aa have points 
K of nearest approach. Such a point is either a point of intersection of the curves (o) and (a + Δα) or 
a point of the curve (a) whose distance from the curve (a + Δα) along the normal is an infinitesimal 
quantity of higher order than Aa (Fig. 3.236a,b). For Aa — 0 the curve (a + Aa) tends to the curve 
(a), where in some cases the point K approaches a limit position, the characteristic point. 
2. Geometric Locus of the Characteristic Points of a Family of Curves 
Equation (3.521) may represent one or more curves. They are formed by the approaching points or by 
the boundary points of the family (Fig. 3.2372), or they form an envelope of the family, i.e., a curve 
which contacts tangentially every curve of the family (Fig. 3.237b). A combination of these two cases 
is also possible (Fig. 3.237c,d). 
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3. Equation of the Envelope 
The equation of the envelope can be calculated from (3.521), where a can be eliminated from the fol- 
lowing equation system: 


OF 
F=0, m =0. (3.522) 
Oa 
E Determine the equation of the family of ΧΙ 


straight lines arising when the ends of a line seg- 
ment AB with |AB| = l are sliding along the 
coordinate axes (Fig. 3.238a). The equation 


1 
of the family of curves is 2 + ANE il y > 
lsina  Ilcosa x 
or F = rcosa + ysina — lsinacosa = 0, A 
Ρ l 
OF " 2 IND 
Aa i sina--gycosa-lcos* a+lsin* a = 0. 
a 
Eliminating a gives z?/5 + y?’ 2? asanen- a) ῃ Bx b) 
velope, which is an astroid (Fig. 3.238b, see 
also p. 103). Figure 3.238 
3.6.2 Space Curves 
3.6.2.1 Ways to Define a Space Curve 
1. Coordinate Equations 
Το define a space curve there are the following possibilities: 
a) Intersection of Two Surfaces: F(r,y,z) 20, d(r,y,z) — 0. (3.523 
b) Parametric Form: «= x(t), y=y(t), z= z(t), (3.524 
with t as an arbitrary parameter; mostly t = x, y or z are used. 
c) Parametric Form: r-—z(s) y=y(s), z= z(s), (3.525a 


with the arc length s between a fixed point A and the running point P as the parameter: 


t 2 2 2 
τ dzxV | (dy dz = 
E H (5) | (2) + (5) dt. (3.525b 
0 


2. Vector Equations 
With r as radius vector of an arbitrary point of the curve (see 3.5.1.1, 6., p. 182) the equation (3.524 
can be written in the form 


F=F(t), where F(t) = z(t)i 4 y(0)j + z(t)k, (3.526 
and (3.525a) in the form 
F=F(s), where rF(s)— c(s)i + y(s)] + 2(s)k. (3.527 


3. Positive Direction 

This is the direction of increasing parameter t for a curve given in the form (3.524) and (3.526); for 
(3.525a) and (3.527) it is the direction in which the arc length increases. 

3.6.2.2. Moving Trihedral 

1. Definitions 


At every point P of aspace curve can be defined three lines and three planes, apart from singular points. 
They intersect each other at the point P, and they are perpendicular to each other (Fig.3.239). 
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1. Tangent is the limiting position of the secants PN 
for N — P (Fig.3.220), p. 244. 

2. Normal Plane is a plane perpendicular to the tan- 
gent (3.239). Every line passing through the point P and 
contained by this plane is called a normal of the curve at the 
point P. 

μοι... 3. Osculating Plane is the limiting position of the 
planes passing through three neighboring points M, P and 
N, for N + P and M — P (Fig.3.240). The tangent line 


binormal rectifying plane 


os¢ulating plane 


tangent f principal normal is contained by the osculating plane. 
} 4. Principal Normal is the intersection line of the nor- 
a normal plane mal and the osculating plane, i.e., it is the normal contained 
Val by the osculating plane. 
5. Binormal is the line perpendicular to the osculating 
Figure 3.239 plane through the point P. 


6. Rectifying Plane is the plane spanned by the tangent and binormal 
lines. 

The positive directions on the lines (1.), (4.) and (5.) are defined as follows: 
7. Moving trihedral The positive directions on the lines tangent, prin- 
cipal normal and binormal are defined as follows: 


a) On the tangent line it is given by the positive direction of the curve; the 
Figure 3.240 unit tangent vector t has this direction. 

b) On the principal normal it is given by the sign of the curvature of the curve, and given by the unit 
normal vector ri. 
c) On the binormal it is defined by the unit vector 
b=tx ai, (3.528) 
where the three vectors t, ii, and b forma right-handed rectangular coordinate system, which is called 
the moving trihedral. 
2. Position of the Curve Related to the Moving Trihedral 


For the usual points of the curve the space curve is on one side of the rectifying plane at the point P, and 
intersects both the normal and osculating planes (Fig.3.241a). The projections of a small segment of 
the curve at the point P on the three planes have approximately the following shapes: 


1. On the osculating plane it is similar to a quadratic parabola (Fig.3.241b). 

2. On the rectifying plane it is similar to a cubic parabola (Fig.3.241c). 

3. On the normal plane it is similar to a semi cubical parabola (Fig.3.241d). 

If the curvature or the torsion of the curve are equal to zero at P or if P is a singular point, i.e., if 
a" (t) = y(t) = z'(t) = 0 hold, then the curve may have a different shape. 


5 Ἡ E 
» < > 
t t "n 
b) c) d) 


Figure 3.241 
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3. Equations of the Elements of the Moving Trihedral 
1. The Curve is Defined in the Form (3.523) For the tangent see (3.529), for the normal plane 
see (3.530): 


X-Z νι 4-2 mm X-zY-yZ-z 

OF OF|~ [oF oF|~ jar or) (3539) OF OF OF 

Oy Oz Oz Ox Ox Oy Ox ἂν Oz |=0. (3.530) 
09 00 09 0o 09 09 op op 00 

ὃν Oz Oz Or Ox Oy Or Oy Oz 


Here x, y, z are the coordinates of the point P of the curve and X, Y, Z are the running coordinates of 
the tangent or the normal plane; the partial derivatives belong to the point P. 

2. 'The Curve is Defined in the Form (3.524), (3.526) In Table 3.27 the coordinate and vector 
equations belonging to the point P are given with z, y, z and also with r. The running coordinates and 


the radius vector of the running point are denoted by X, Y, Z and R. The derivatives with respect to 
the parameter t refer to the point P. 

3. The Curve is Defined in the Form (3.5252, 3.527) If the parameter is the arc length s, for 
the tangent and binormal, and for the normal and osculating plane the same equations are valid as in 
case 2, just t must be replaced by s. The equations of the principal normal and the rectifying plane will 
be simpler (Table 3.28). 


3.6.2.3 Curvature and Torsion 
1. Curvature of a Curve and Radius of Curvature 


The curvature of a curve at the point P is a measure which describes the 
deviation of the curve from a straight line in the very close neighborhood of 


2 dr 
this point. The exact definition is by the help of the tangent vector t — ds 
5 


(see Fig.3.242): 


At dt -- 
K= lim ------ (3.531) Figure 3.242 


~ 


A~ > ds’ 
PN=>0 PN 


1. Radius of Curvature The radius of curvature is the reciprocal value of the absolute value of the 
curvature: 
1 

ΤΝ (3.532) 
2. Formulas to calculate K and p 
a) If the curve is defined in the form (3.525a): 
Pr 
ds? 
where the derivatives are with respect to s. 
b) If the curve is defined in the form (3.524): 


|| = 


= xn 4 y? | 21, (3.533) 


dr (duy (dery 
5 dt dt? dt dt? (x? $ y? $ 27?) (a? } y? } pi? (a! ar!" | yy" ab atate 
ΚΞ 3 = (x + y? + ΤΗ 


(3.534) 


The derivatives are calculated here with respect to t. 
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Table 3.27 Vector and coordinate equations of accompanying configurations of a space curve 


Vector equation Coordinate equation 
Tangent: 
ἥ-πι dr X-r Y-y Z-z 
=f μας = 
dt α΄ y at 
Normal plane: 
»] dr g a afl F. 
(Β- m z'(X-—z)Ty(Y -y))-z(Z-2z)-0 
Osculating plane: 
m dr dy X-zY-yZ-z 
τ 1 / / / 
ΞΕ πε a = g Y 2 =0 
( dt dt? gl! M alt. 
Binormal: 
> dr d Χ-α Υ-! —z 
ο 7 ΠΡΙΝ T 7 = er 
dt | dt? y z 2.2 x 
y" 2 2 gl! ο, y" 
Rectifying plane: 
> „dd od a ο ον 
( r dd x iz) = 0 x y 2 0, 
tdt dE l m n 
with l = yz" ντ’ 
m = σα — z"a! 
"ne aly!" ET aly! 
Principal normal: 
= dr Ir dr X-r Υ-] Ζ 
R r+ Ps x : x : p 7 ss = T 4 = 7 5 
dt dt d? yz PE vy 
mn nd lm 
T position vector of the space curve, R position vector of the accomp. configuration 
! For the mixed product of three vectors see 3.5.1.6, p. 185 


Tabelle 3.28 Vector and coordinate equations of accompanying configurations 


as functions of 


;he arc length 


Coordinate equation 


| Element of trihedral | Vector equation 
d >= 4 dr 
Principal normal R=r+A— 
ds? 
E à pf 
Rectifying plane (R—r)—-20 
i ds? 


X=g 


ΦΧ — £) +y" (Y -y)-z"(Z-z)-20 


F position vector of the space curve, R. posit 


ion vector of the accomp. configuration 
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3. Helix The equations 

x -—acost, y=asint, z=bt (a>0,b>0) (3.535) 
describe a so-called helix (Fig.3.243) as a right screw. If the observer is 
looking into the positive direction of the z-axis, which is at the same time 
the axis of the screw, then the screw climbs in a counter-clockwise direction. 
A helix winding itself in the opposite orientation is called a left screw. 
Β Determine the curvature of the helix (3.535). The parameter { is to 


be replaced by s = ἐνα --ῦ. Then holds r = acos 


5 
Va? +’ d 


E bs a 
asin M , and according to (3.533), K = ——-, p= 
va? +b? va? +b? gio ) a? + b2’ P 
a? +b? ER ; 
. Both quantities K and p are constants. Figure 3.243 


Another method, without the parameter transformation in (3.534), produces the same result. 


2. Torsion of a Curve and Radius of the Torsion Circe 
The torsion of a curve at the point P is a measure which describes the deviation of the curve from 
a plane curve in the very close neighborhood of this point. The exact definition by the help of the 


binormal vector b (3.528), p. 257 (see also Fig.3.244) is: 


Ab db 
T= lim ---- ds (3.536) 
PNPN “S 
The radius of torsion is 
1 
T= =. (3.537) 
IT] Figure 3.244 
1. Formulas for Calculating T and τ 
a) If the curve is defined in the form (3.525a): 
of y z 
x x" y" gilt 
o ας Cr dir gi! y" g!" 
T=% = . 2 
P (s ds? a) (al? + y? + 272)" η 
where the derivatives are taken with respect to s. 
b) If the curve is defined in the form (3.524): 
i vy 2 
dr d?r dr qam gl 
Τμ αρα _ 2 |0" y" z" (3.539) 


πε ΡΤ 24 2)" 
dé 213 (x? + y? + 2729 
dt 
where p should be calculated by (3.532) and (3.533). 
The torsion calculated by (3.538), (3.539) can be positive or negative. In the case T > 0 an observer 
standing on the principal normal parallel to the binormal sees that the curve has a right turn; in the 
case T < 0 it has a left turn. 
Β The torsion of a helix is constant. With the notation R for the right screw and L for the left screw 


*For the mixed product of three vectors see 3.5.1.6, 2., p. 185. 
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the torsion is 
—asint acost b 
—acost —asint 0 


a2 4 22.7 asint —acost 0 b a? + b? b 
= ge των. ΡΞ ; =- zs 
x a [(—asint)? + (acost)? +b]? a? +b? jo tomm a? + D? 
3. Frenet Formulas » 4 
The derivatives of the vectors t, n, and b can be expressed by the Frenet formulas: 
dé απ di € | b db tt (3.540) 


ds p° ds p E ds T 
Here p is the radius of curvature, and 7 is the radius of torsion. 
4. Darboux Vector 
The Frenet formulas (3.540) can be represented in the clearly arranged form (Darboux formulas) 


d 4 » di 4 _ db 4 | 
qe ον qo OB, το (3.541) 
Here d is the Darboux vector, which has the form 
; 1. 1, 
d= =t + -b. (3.542) 
T p 
Remarks: 


1. With the help ofthe Darboux vector the Frenet formulas can be interpreted in the sense of kinematics 
(see [3.4]). 
2. The modulus of the Darboux vector equals the so-called total curvator A of a space curve: 


j= 3*3-l |. (3.543) 
3.6.3 Surfaces 
3.6.3.1 Ways to Define a Surface 
1. Equation of a Surface 
Surfaces can be defined in different ways: 
a) Implicit Form : F(x,y,2z) — 0. (3.544) 
b) Explicit Form : z — f(z,y). (3.545) 
c) Parametric Form: x = z(u,v), y-y(uv) z= z(u,v). (3.546) 
d) Vector Form : F=F(u,v) with F=2(u,v)i+y(u,v)j + z(u, v)k. (3.547) 


Running the parameters u and v over all allowed values gives the coordinates and the radius vectors 

of all points of the surface from (3.546) and (3.547). The elimination of the parameters u and v from 

the parametric form (3.546) (if possible) yields the implicit form (3.544). The explicit form (3.545) is 

a special case of the parametric form with u = r and v = y. 

WB The equation of the sphere in Cartesian coordinates, parametric form, and vector form (Fig. 3.246): 
e+yt+22—a=0; (3.548a) X —acosusinv, y — asinusinv, z=acosv; (3.548b) 
F = a(cos u sin vi + sin usin vj + cos vk). (3.548c) 

2. Curvilinear Coordinates on a Surface 

If a surface is given in the form (3.546) or (3.547), and the values of the parameter u can be changed 


while the other parameter v = vo is fixed, then the points r(x, y, ο) describe a curve F = r(u, vo) on 
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the surface. Substituting for v different but fixed values v = vi, v = v2,...,U = Un one after the other, 
gives a family of curves on the surface. When moving along a curve with v — const only u is changing, 
this curve is called the u-line (Fig. 3.245). Analogously one gets another family of curves, the v-lines, 
by varying v and keeping u = const fixed with u1, ug,..., Un. In this way a net of coordinate lines is 
defined on the surface (3.546), where the two fixed numbers u = u; and v = vy; are the curvilinear or 
Gauss coordinates of the point P on the surface. 

If a surface is given in the form (3.545), the coordinate lines are the intersection curves of the surface 
with the planes r — const and y — const. With equations in implicit form F(u,v) — 0 or with the 
parametric equations u = u(t) and v = v(t) of these coordinates, curves on the surfaces can be defined. 
W In the parametric equations of the sphere (3.548b,c) u means the geographical longitude of a point 
P, and v means its polar distance. The v lines are here the meridians APB; the u lines are the parallel 
circles CPD (Fig. 3.246). 


3.6.3.2 Tangent Plane and Surface Normal 

1. Definitions 

1. Tangent Plane The precise general mathematical definition of the tangent plane is rather com- 
plicated, so here the investigation is to be restricted to the case, when the surface is defined by two 
parameters. Suppose, for a neighborhood of the point P(x, y, z), the mapping (u, v) — r(u, v) is in- 


. : — Or - Or . 
vertible, the partial derivatives r, = Bu andr, = δν are continuous, and not parallel to each other. 
} v 


u υ 
Then P(x, y, z) is called a regular point of the surface. If P is regular, then the tangents of all curves 
passing through P, and having a tangent here, are in the same plane, and this plane is called the tangent 
plane of the surface at P. If this happens, the partial derivatives ru, Ἐν are parallel (or zero) only for 
certain parametrization of the surface. If they are parallel for every parametrization, the point is called 
a singular point (see 3., p. 263). 


surface normal 


Figure 3.245 Figure 3.246 Figure 3.247 

2. Surface Normal The line perpendicular to the tangent plane going through the point P is called 
surface normal at the point P (Fig. 3.247). 
3. Normal Vector The tangent plane is spanned by two vectors, by the tangent vectors 

, Or . δε 

τρ P= By 
of the u- and v-lines. The vector product of the tangent vectors r, x r, is a vector in the direction of 
the surface normal. Its unit vector 
Pu X ry 


(3.549a) 


No = (3.549b) 


μεις Py 


is called the normal vector. Its direction to one or other side of the surface depends on which variable 
is the first and which one is the second coordinate among u and v. 
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2. Equations of the Tangent Plane and the Surface Normal (see Table 3.29) 
W A: For the sphere with equation (3.548a) is valid: 


a) as tangent plane: 2z(X — x) + 2y(Y — y) + 2z(Z —z) =0 or aX +yY +2Z-a? =0, (3.550a 
e$ Ye Z= X Y Z 
b) as surface normal: e E ρε ενα” (3.550b 
2x 2y 2z 2 y zZ 
W B: For the sphere with equation (3.548b) is valid: 
a) as tangent plane: X cosusinv + Y sinusinv + Z cosu = a, (3.550c 
X Y Z 
b) as surface normal: (3.550d 


cosusinv  sinusinv σοβυ 


3. Singular Points of the Surface 
If for a point with coordinates x = zi, y = yi, Z = τι ofthe surface with equation (3.544) all the 
equalities 

OF OF OF 

= 0, — = 0, — = 0, F (x,y,z) =0 3.551 

Fe το ay =O C (5551) 
are fulfilled, i.e., if every first-order derivative is zero at the point P(xi, y1, σι). then this point is called 
a singular point. All tangents going through here do not form a plane, but a cone of second order with 
the equation 


OF a OF ο, OF OF 
dod ( 11) 4 3y? ( y) 3 ER (Z-a)- gn (X —23))(Y -- νι) 
OF OF 
+ une yı)(Z -- Σι) 4 23:0: 4 z)(X — zı) =0, (3.552) 


where the derivatives belong to the point P(x, yi, 21). If also the second derivatives are equal to zero 
at this point, then there is a more complicated type of singularity. Then the tangents form a cone of 
third or even higher order. 


3.6.3.3 Line Elements of a Surface 


1. Differential of Arc 
Consider a surface given in the form (3.546) or (3.547). Let P(u, v) an arbitrary point and N(u+du, v+ 


dv) another one close to P, both on the surface. The arclength of the arc segment PN on the surface 
can be approximately calculated by the differential of an arc or the line element of the surface with the 
formula 


ds? = Edu? + 2F du dv + Gdv’, (3.553a) 
where the three coefficients 


E- Oz V? ὃν 2 az\’ p Ox Ox _ Oy ὃν , ὃς dz 
Hem (5) i (25) " Ou) ' Tou Ou dv | dudo | Ou Be? 


2 2 2 
+ (3) (5) (3.553b) 


are calculated at the point P. The right-hand side (3.553a) is called the first quadratic fundamental 
form of the surface. 
ΒΑ: For the sphere given in the form (3.548c) clearly 


E-a^smvw F=0, G=a’, ds =a?(sin?vdu? + dv’). (3.554) 
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W D: For a surface given in the form (3.545) clearly 


E=1+4p’, 


F=pq, 


G=14+¢@ with p=—, q 
Ox 


Oz _ Oz 
^ 


Table 3.29 Equations of the tangent plane and surface normal 


Type of equation 


Tangent plane 


Surface normal 


(3.544 


mm 
p 
σι 

τὰ 
en 


OF OF X-v Y-y Z-z 
e (X -z)-L—-(Y- B = 
go ta OF - OF - BF 
"AN WE w^ Ὃν 
3; 4 —2)20 
X-v Y-y Z-z 
Z-z X-£ Y 
P( )+a(¥ —y) " d zz 
X-zY-yZ-z X-c | Y-y _ Z-z 
Ox ὃν Oz ὃν Oz ὃς Ox Ox Oy 
Ou ðu Ou |-0 Ou Ou Ou Ou Ou Ou 
Or Oy ðz Oy Oz Oz Ox Ox Oy 
ðv ðv Ov Ov Ov Ov Ov Ov Ov 
(8 -rrin = 03) R = F4+X(r x r3) 
or (Β-.2Ν =0 or R=F+AN 


. a OF 
normal vector. Besides ri = 8 
u 


In this table z, y, z and r are the coordinates and radius vector of the 


plane and surface normal in the point P; furthermore p = 


points P of the curve; 


X,Y,Z and R are the running coordinates and radius vectors of the points of the tangent 


Oz Oz > 
E» ge ONN ee 
οἱ andr OE. ο 02 . 06 
279v "8p 1 ὃν 


1 For the mixed product of three vectors see 3.5.1.6, 2., p. 185 


2. Measurement on the Surface 


1. 


The Arclength of a surface curve u = u(t), v = v(t) for tg € t € t4 is calculated by the formula 


du dv 


2F 


dv 


hel 


Figure 3.248 


du\?* l 
dt | 


dt dt 


οί 


dt 


9 
) dt. 


(3.556) 


2. The Angle Between Two Curves τι = r(ui(t),vi(t)) and τὸ = 
r(us(t), v2(t)) on the surface F = r(u, v) is the angle between their tangents 


with the direction vectors fy and Fy (Fig. 3.248). It is given by the formula 


2222 wa 22 > > 
Eù ἃ; + Ε(ἄιῦ» + Vü) + αἴινο 


= 22 T 22 22 77 οι 
V Eà; -2Füjvi-- Gv, / Eà; + 2F tv. + (νο 


Here the coefficients E, F and G are calculated at the point P and ü b ib, y 1 and Vo represent the first 
derivatives of u; (f), u»(t), vi(t) and ου (8). calculated for the value of the parameter t at the point P. 


3.6 Differential Geometry 265 


If the numerator of (3.557) vanishes, the two curves are perpendicular to each other. The condition of 
orthogonality for the coordinate lines v = const and u = const is F = 0. 

3. The Area of a Surface Patch S bounded by an arbitrary curve which is on the surface can be 
calculated by the double integral 


s= / ds (3.558a) with dS = VEG — F?du dv. (3.558b) 
(5) 

dS is called the surface element or element of area. 
The calculation of length, angle, and area on a surface is possible with (3.556), (3.557), (3.558a,b) if the 
coefficients E, F, and G of the first fundamental form are known. So the first quadratic fundamental 
form defines a metric on the surface. 
3. Applicability of Surfaces by Bending 
If a surface is deformed by bending, without stretching, compression or tearing, then its metric remains 
unchanged. In other words, the first quadratic fundamental form is invariant under bendings. Two 
surfaces having the same first quadratic fundamental form can be rolled onto each other. 


Figure 3.249 


3.6.3.4 Curvature of a Surface 


1. Curvatures of Curves on a Surface 

If different curves I’ are drawn through a point P of the surface (Fig. 3.249), their radii of curvature 
p at the point P are related as follows: 

1. The Radius of Curvature p ofa curve I’ at the point P is equal to the radius of curvature of a 
curve C, which is the intersection of the surface and the osculating plane of the curve I’ at the point P 
(Fig. 3.2492). 

2. Meusnier's Theorem For every plane section curve C of a surface (Fig. 3.249b) the radius of 
curvature can be calculated by the formula 


p = Rcos(ii, Ñ). (3.559) 
Here R is the radius of curvature of the normal section Cnorm, which goes through the same tangent 
NQ as C and also contains the unit vector N of the surface normal; X(n, N) is the angle between 
the unit vector ri of the principal normal section of the curve C and the unit vector N of the surface 
normal. The sign of p in (3.559) is positive, if Ñ is on the concave side of the curve Cnorm and negative 
otherwise. 
3. Euler’s Formula The curvature of a surface for every normal section Cnorm at the point P can 
be calculated by the Euler formula 

1 cos?a sin? a 


uetus der (3.560) 
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where Rı and Rə are the radii of principal curvatures (see (3.562a)) and a is the angle between the 
planes of the section C and Οι (Fig. 3.249c). 
2. Radii of Principal Curvature 
The radii of principal curvatures of a surface are the radii with maximum and minimum values. They 
can be calculated by the principal normal sections Οι and C3 (Fig. 3.249c). The planes of ΟἹ and C5 
dy 
are perpendicular to each other, their directions are defined by the value E which can be calculated 
a 
from the quadratic equation 
2 
, dy : oy, dy 
[tpg — s(1 + q°)] ( ) +(e ερ τα] 
x 
where the parameters p, q, r, s, t are defined in (3.562b). If the surface is given in the explicit form 
(3.545), Ry and Rz are the roots of the quadratic equation 


(rt — S?) R? + h[2pqs — (1 + p?)t — (A-- g)r]R-- hà 20 with (3.5622) 


Oz Oz Oz Oz Pz 
j =, 85 o t= = nd h=\/1 2 2. 562k 
Pan’ 1 Oy ᾽ ir 7 OrOy' Oy? me "POM pem 


The signs of R, Rı and Ry can be determined by the same rule as in (3.559). 
If the surface is given in vector form (3.547), then instead of (3.561) and (3.562a) the corresponding 
equations hold 


+ [s(1 + p°) — rpq] = 0, (3.561) 


dx 


dv 
du 
(LN — M?)R? - (EN - 2FM + GL)R + (EG — F’) = 0, (3.563b) 


with the coefficients L, M, N of the second quadratic fundamental form. They are given by the equal- 
ities 


(GM FN) ( ) + (GL ΕΝ F(FL— EM) - 0, (3.5632) 


= d E d! E d" 
L= ΓΒ =, M= ùR -, N=7,R F 
VEG — F? VEG - ΕΣ VEG - F? 
Here the vectors Fuu, Fy», and r,, are the second-order partial derivatives of the radius vector r with 
respect to the parameters u and v. In the numerators there are the determinants 


(3.563c) 


Pu Py Oz Ox Oy Oz Px Py Oz 
Ou? Ou? Qu? OuOv Oudv OuOv Ov? Ov? Ov? 

ἀ-|9α ὃν Oz) gal Oe Oy z| p| Oy Oz) (3.5634) 
Ou Ou Ou Ou ðu Ou Ou Ou Ou 
ὃ dy 0: ὃν Oy δε x ὃν ὃς 
Ov Ov Ov Ov Ov Ov Ov Ov Ov 

The expression 
Ldu? + 2Mdudv + Nd? (3.563e) 


is called the second quadratic fundamental form. It contains the curvature properties of the surface. 
Lines of curvature are the curves on the surface which have the direction of the principal normal section 
at every point. Their equations can be got by integrating (3.561) or (3.563a). 

3. Classification of the Points of Surfaces 


1. Elliptic and Umbilical Points If at a point P the radii ΠῚ and Rə of the principal curvatures 
have the same sign, then every point of the surface is on the same side of the tangent plane in a close 
neighborhood of this point, and P is called an elliptic point (Fig. 3.250a) on the surface. This fact 
can be expressed analytically by the relation 


LN- M? » 0. (3.5642) 
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a) 


Figure 3.250 


2. Circular or Umbilical Point This is the point P of the surface where the radii of the principal 
curvatures are equal: 

Rı = Ro. (3.564b) 
Then for the normal sections R = const. 
3. Hyperbolic Point In the case of different signs of the radii R; and Rə of the principal curvatures 
the concave sides of the principal normal sections are in opposite directions. The tangent plane inter- 
sects the surface, so the surface has a saddle form in the neighborhood of P. P is called a hyperbolic 
point (Fig. 3.250b); the analytic mark of this point is the relation 


LN — M? <0. (3.564c) 
4. Parabolic Point If one of the two radii of principal curvatures R, or Rə is equal to oo, then either 
one of the principal normal sections has an inflection point here, or it is a straight line. At P thereisa 
parabolic point (Fig. 3.250c) of the surface with the analytic mark 

LN- M? --0. (3.564d) 
W Al] the points of an ellipsoid are elliptic, those of a hyperboloid of one sheet are hyperbolic, and those 
of a cylinder are parabolic. 
4. Curvature of a Surface 
Two quantities are used mostly to characterize the curvature of a surface: 


1/1 1 
1. Mean Curvature of a surface at the point P: H = 2 = + x) : (3.565a) 


2. Gauss Curvature of a surface at the point P: K (3.565b) 


RR 
W A: For the circular cylinder with the radius a these are H = η and K = 0. 
a 


W B: For elliptic points K > 0, for hyperbolic points K < 0, and for parabolic points K = 0 hold. 
3. Calculation of H and K, ifthe surface is given in the form z = f(x,y): 


_ r(L+@) - 2pgs +t(1 +p) . s t=? 
2(1 + p? + Ps (3.566a) K= (3.566b) 


H ] 
(+p? + q°)? 


For the meaning of p, q, r, s, t see (3.562b). 

4. Classification of the Surfaces According to their Curvature 

1. Minimal Surfaces are surfaces with a zero mean curvature H at every point, i.e., with Rı = — Rə. 
2. Surfaces of Constant Curvature have a constant Gauss curvature K = const. 

W A: ΚΣ 0, for instance the sphere. 


268 9. Geometry 


ΒΒ: K < 0, for instance the pseudosphere (Fig. 3.251), i.e., the 
surface obtained by rotating the tractrix around the symmetry axis 
(Fig. 2.79). 


3.6.3.5 Ruled Surfaces and Developable Surfaces 
1. Ruled Surface i 


A surface is called a ruled surface if it can be represented by moving a 
line in space. 

2. Developable Surface 
If a ruled surface can be developed upon a plane, i.e., rolled out without Figure 3.251 
stretching or contracting any part of it, it is called a developable surface. 

Not every ruled surface is developable. 

Developable surfaces have the following characteristic properties: 


a) For all points the Gauss curvature is equal to zero, and 


b) if the surface is given in the explicit form z = f(a, y) the conditions of developability are fulfilled: 
a) K=0, b) rt—s?=0. (3.567) 

For the meaning of r, t, and s see (3.562b). 

W A: The cone (Fig. 3.192) and cylinder (Fig. 3.198) are developable surfaces. 

E B: The hyperboloid of one sheet (Fig. 3.196) and hyperbolic paraboloid (Fig. 3.197) are ruled 

surfaces but they cannot be developed upon a plane. 


3.6.3.6 Geodesic Lines on a Surface 


1. Concept of Geodesic Line 
(see also 3.4.1.1, 3., p. 160). A theoretic curve of the surface can go through every point P(u, v) of the 


surface in every direction determined by the differential quotient P. and it is called a geodesic line. It 
has the same role on the surface as the straight line on the plane, and it has the following properties: 
1. Geodesic lines are the shortest curves between two points on the surface. 

2. If a material point is moving on a surface drawn by another material point on the same surface, and 
no other forces have influence on it, then it is moving along a geodesic line. 

3. If an elastic thread is stretched on a given surface, then it has the shape of a geodesic line. 

2. Definition 


A geodesic line on a surface is a curve such that its principal normal at every point has the same direction 
as the surface normal. 


W On a circular cylinder the geodesic lines are circular helices. 
3. Equation of the Geodesic Line 


If the surface is given in the explicit form z = f(x, y), the differential equation of the geodesic lines is 
ond ly V dy\* di 
(14 p? 4-4) d = pt ( 3 + (2ps — qt) (2) + (pr 298) qr. (3.568) 
T a ur 


If the surface is given in parametric form (3.546), then the differential equation of the geodesic lines is 
fairly complicated. For the meaning of p, q, r, s, and t see (3.562b). 


4 Linear Algebra 


4.1 Matrices 
4.1.1 Notion of Matrix 
1. Matrices A of Size (m, n) or Briefly A(m,n) 


are systems of m times n elements, e.g., real or complex numbers, or functions, derivatives, vectors, 
arranged in m rows and n columns: 


απ 042 *** Ain «+ Ist row 
A = (aij) = Yd E l D id iii: (4.1) 
απ "ue 2 don = m-th row 
toT T 
Ist 2nd n-th column. 


With the notion size of a matrix matrices are classified according to their number of rows m and number 
of columns n: A of size (m, n). A matrix is called a square matriz if the number of rows and columns is 
equal, otherwise it is a rectangular matrix. 

2. Realand Complex Matrices 

Real matrices have real elements, complex matrices have complex elements. If a matrix has complex 
elements 


Quy, + ibu (4.2a) 
it can be decomposed into the form 
A+iB (4.2b) 


where A and B have real elements only (arithmetical operations see 4.1.4, p. 272). 
If a matrix A has complex elements, then its conjugate complex matrix A* has the elements 


αἴ, = Re(a,,) — i Im (a,,). 4.26 
μν μ p 


3. Transposed Matrices AT 
Changing the rows and columns of a matrix A of size (m, n) gives the transposed matriz AT. This has 
the size (n, m) and 


(arp) = (αμν) (4.3) 
is valid. 
4. Adjoint Matrices 


The adjoint matrix AV of a complex matrix A is the transpose of its conjugate complex matrix A* 
(which can not be confused with the adjoint matrix Aaqj, see 4.2.2, p. 278): 


AH — (A*)T. (4.4) 
5. Zero Matrix 


A matrix 0 is called a zero matrix if it has only zero elements: 


00:::0 
00:::0 

0-|.: i4 (4.5) 
00-0 
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4.1.2 Square Matrices 

1. Definition 

Square matrices have the same number of rows and columns, i.e., m = n: 

Q11 t7 Gin 

A=Aanm=[ ico: (4.6) 
[TREE 

The elements a,, of a matrix A in the diagonal from the left upper corner to the right lower one are 

the elements of the main diagonal. They are denoted by a11, 425, . .. , Ann, i-e., they are all the elements 

aw With p= v. 

2. Diagonal Matrices 


A square matrix D is called a diagonal matrix if all of the non-diagonal elements are equal to zero: 


ay, 0 ::: 0 ai O 
0 αρ... 0 a22 

aw —0 for pAv: D= E E te g = " : (4.7) 
0 0 ::: ünn ο Ann 


3. Scalar Matrix 
A diagonal matrix S is called a scalar matriz if all the diagonal elements are the same real or complex 
number c: 


οεθ-..0 
Ος:::0 

S=|...,; (4.8) 
00-:-c¢ 


4. Trace or Spur of a Matrix 


For a square matrix, the trace or spur of the matrix is defined as the sum of the main diagonal elements: 


Tr (A) = ay, + a22 +... + ann = b» (4.9) 
μξι 


5. Symmetric Matrices 
A square matrix A is symmetric if it is equal to its own transpose: 


A — AT, (4.10) 
For the elements lying in symmetric positions with respect to the main diagonal 

αμν = Ap (4.11) 
is valid. 


6. Normal Matrices 
satisfy the equality 
AHA = AAF. (4.12) 
(For the product of matrices see 4.1.4, p. 272.) 
7. Antisymmetric or Skew-Symmetric Matrices 
are the square matrices A with the property: 


A — —AT, (4.13a) 
For the elements a,, of an antisymmetric matrix the equalities 
apv = --ᾱνμ, Ay, =0 (4.13b) 


are valid, so the trace of an antisymmetric matrix vanishes: 


TEA) = Ve (4.130) 
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The elements lying in symmetric positions with respect to the main diagonal differ from each other only 
in sign. 

Every square matrix A can be decomposed into the sum of asymmetric matrix A, and an antisymmetric 
matrix Ας: 


A=A,+A,, with A = sa +A"); Aas = sa LATY: (4.13d) 


8. Hermitian Matrices or Self-Adjoint Matrices 
are square matrices A equal to their own adjoints : 
A — (A*)T ΑΒ, (4.14) 


Over the real numbers the concepts of symmetric and Hermitian matrices are the same. The determi- 


nant of a Hermitian matrix is real. 
9. Anti-Hermitian or Skew-Hermitian Matrices a 
are the square matrices equal to their negative adjoints: 

A = —(A*)T = —AF. (4.15a) 
For the elements a,,, and the trace of an anti-Hermitian matrix the equalities 

* . η = 

Aw = --ᾱμνι Ap = 0; Tr(A)=0 (4.15b) 
are valid. Every square matrix A can be decomposed into a sum of a Hermitian matrix Ap and an 
anti-Hermitian matrix Aah: 

A=A),+A,n with A, = 


(A+ AB), Am = Z(A — AP). (4.15c) 


NI = 
Nle 


10. Identity Matrix I 
is a diagonal matrix such that every diagonal element is equal to 1 and all of the non-diagonal elements 
are equal to zero: 


10-0 
sec B ; 0 for y £ v, 

= TEE = (Suv) with ὃμν = Ud μ-ν. (4.16) 
00-1 


The symbol ὃμν is called the Kronecker symbol. 
11. Triangular Matrix 
1. Upper Triangular Matrix, U, is a square matrix such that all the elements under the main 
diagonal are equal to zero: 

R= (rw) with r,,—0 for uv. (4.17 
2. Lower Triangular Matrix, L, is a square matrix such that all the elements above the main 
diagonal are equal to zero: 

L-(l,) with lw =0 for μεν. (4.18 


4.1.3 Vectors 
Matrices of size (n, 1) are one-column matrices or column vectors of dimension n. Matrices of size (1, n 
are one-row matrices or row vectors of dimension n: 

ay 


a 
Column Vector: a = ° , (4.19a) Row Vector: a” = (αι,α»,... απ). (4.190 


απ 


By transposing, a column vector is changed into a row vector and conversely. A row or column vector 
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of dimension n can determine a point in the n dimensional Euclidean space IR". 
The zero vector is denoted by 0 or O° respectively. 


4.1.4 Arithmetical Operations with Matrices 


1. Equality of Matrices 
Two matrices A = (αμν) and B = (b,,) are equal if they have the same size and the corresponding 
elements are equal: 

A=B, when a, —b, for p=1,...,m;v=1,...,n. (4.20) 
2. Addition and Subtraction 
Matrices can be added or subtracted only if they have the same size. The sum/difference of two matrices 
is done by adding/subtracting the corresponding elements: 


A E B = (au) + (buv) = (du + byv). (4.21a) 


1-37 3-50 4-27 
m ica «t iM x 
For the addition of matrices the commutative law and the associative law are valid: 


a) Commutative Law: A 4- B - ΒΑ. (4.21b 
b) Associative Law: (ΑΒ) - C — A 4 (B 4 C). (4.21c 


3. Multiplication of a Matrix by a Number 
Α matrix A of size (m, n) is multiplied by a real or complex number a by multiplying every element o 
A by a: 

aA = a (ay,) = (Ady). (4.22a 


1 37 3 921 
" 3(0 -1 i) Ἓ (à -9 oe 
From (4.22a) it is obvious that one can factor out a constant multiplier contained by every element o 


a matrix. For the multiplication of a matrix by a scalar the commutative, associative and distributive 
laws for multiplication are valid: 


a) Commutative Law: aA = Αα; (4.22b 
b) Associative Law: α(βΑ) = (αβ)Α: (4.22c 
c) Distributive Law: (a + 8)A = aA + 8A; a(A+B)=aA+aB. (4.22d 


4. Division of a Matrix by a Number 

The division of a matrix by a scalar y # 0 is the same as multiplication by a = 1/1. 

5. Multiplication of Two Matrices 

1. The Product A B of two matrices Α and B can be calculated only if the number of columns of 
the factor A on the left-hand side is equal to the number of rows of the factor B on the right-hand side. 
If A is a matrix of size (m, n), then the matrix B must have size (n, p), and the product A B is a matrix 
C = (cp) of size (m, p). The element c, is equal to the scalar product of the j-th row of the factor A 
on the left with the A-th column of the factor B on the right: 


AB - (M amba) = (ca) =C (GSA 2h ον up). (4.23) 
ν--1 
L Ἡ Ὁ 3 9 
E A= 2—1 4/)],B=] —5 1||. Theelement co» of the product matrix C in accordance with 
Επ 0 T 0 9 


(4.23) is c2 =2-2—-1-144-3=15. 
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2. Inequality of Matrix Products Even if both products A B and B A exist, usually AB ABA, 
i.e., in general the commutative law for multiplication is not valid. If the equality AB = B A holds, 
then one says that the matrices A and B are commutable or commute with each other. 

3. Falk Scheme Multiplication of matrices A B = C can be performed using the Falk scheme (Fig. 
4.1). The element c, of the product matrix C appears exactly at the intersection point of the p-th 
row of A with the A-th column of B. 
WB Multiplication of the matrices Α 09) and Big.) is shown in Fig. 4.2 using the Falk scheme. 


p 3 |2 
5 |1|B 
at | B 0 |3 
σε 137 | -12 26 
m A [AB A 2434 11 [15] AB 
-101 | 1 
Figure 4.1 Figure 4.2 


4. Multiplication of the Matrices K; and Kz with Complex Elements For multiplication of 
two matrices with complex elements can be used their decompositions into real and imaginary parts 
according to (4.2b): Κι = Αι -ΓΙΒι, Ky = As + iB. Here Ay, Az, Bi, Ba are real matrices. After 
this decomposition, the multiplication results in a sum of matrices whose terms are products of real 
matrices. 

B (A+ iB)(A— iB) = A? + B? + i(BA — AB) (Powers of Matrices see 4.1.5,8., p. 276). Of course 
when multiplying these matrices it must be considered that the commutative law for multiplication is 
not valid in general, i.e., the matrices A and B do not usually commute with each other. 


6. Scalar and Dyadic Product of Two Vectors 

If the vectors a and b are considered as one-row and one-column matrices, respectively, then there are 
two possibilities to multiply them according to the rules of matrix multiplication: 

If a has size (1, n) and b has size (n, 1) then their product has size (1, 1), i.e. it is a number. It is called 
the scalar product of two vectors. If conversely, a has size (n, 1) and b has size (1, m), then the product 
has size (n,m), i.e., it is a matrix. This matrix is called the dyadic product of the two vectors. 


1. Scalar Product of Two Vectors The scalar product of a row vector aT = (a1, a2, .. . , αμ) with 
a column vector b = (51, bz, . . . ,bn)T — both having n elements — is defined as the number 
n 
alb = bla = abı + azb - + anbn = 5 αμὂμ. (4.24) 
m 


The commutative law for multiplication is not valid for a product of vectors in general, so one must 
keep the exact order of aT and b. If the order of multiplication is reversed, then the product ba? is a 
dyadic product. 

2. Dyadic Product or Tensor Product of Two Vectors The dyadic product of a column vector 
a = (a4, 05, ..., an)" of dimension n with a row vector bT = (bi, bo,..., bm) of dimension m is defined 
as the following matrix: 


a101 aiba +++ aibm 
Ab, Agby +++ aab, 

abs | 552? 7 (4.25) 
anbı nbs Mibi [Um 


of size (n, m). Also here the commutative law for multiplication is not valid in general. 
3. Hints on the Notion of Vector Products of Two Vectors In the domain of multi-vectors 
or alternating tensors there is a so-called outer product whose three-dimensional version is the well- 
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known vector product or cross product (see 3.5.1.5, 2., p. 184 ff). In this book the outer product of 
multi-vectors of higher rank is not discussed. 


7. Rank of a Matrix 


1. Definition Ina matrix A the maximum number r of linearly independent column vectors is equal 
to the maximum number of linearly independent row vectors. This number r is called the rank of the 
matriz and it is denoted by rank (A) = r. 

2. Statements about the Rank of a Matrix 

a) Because in a vector space of dimension m there exist no more than m linearly independent m- 
dimensional row or column vectors (see 5.3.8.2, p. 366), the rank r of a matrix A of size (m, n) cannot 
be greater, than the smaller of m and n: 


rank (A(5,)) =r € min (m, n). (4.26a) 
b) A square matrix A(n,n) is called a regular matrix if 
rank(A(,,)) =r =n. (4.26b) 


A square matrix of size (n, n) is regular if and only if its determinant differs from zero, i.e., det A 4 0 
(see 4.2.2, 3., p. 279). Otherwise it is a singular matrix. 
c) Consequently for the rank of a singular square matrix A(n,n), i.e., det A = 0 

rank (Aq ny) —-r«n (4.26c 
is valid. 
d) The rank of the zero matrix 0 is equal to zero: 


rank (0) = r = 0. (4.26d 
e) The rank of the sum and product of matrices satisfies the relations 

jrank(A) — rank(B)| € rank(A + B) € rank(A) + rank(B), (4.266 

rank(AB) < min(rank(A), rank(B)). (4.26f 


3. Rulesto Determine the Rank Elementary transformations do not change the rank of matrices. 
Elementary transformations in this relation are: 

a) Interchanging two columns or two rows. 

b) Multiplication of a row or column by a number. 

c) Addition of a row to another row or a column to an other column. 

In order to determine their ranks every matrix can be transformed by appropriate linear combinations 
of rows into a form such that in the j-th row (p = 2,3,..., m), at least the first j| — 1 elements are equal 
to zero (the principle of Gauss algorithm, see 4.5.2.4, p. 312). The number of row vectors different from 
the zero vector in the transformed matrix is equal to the rank r of the matrix. 

8. Inverse Matrix 
For a regular matrix A = (αμν) there is always an inverse matrix A! (with respect to multiplication), 
i.e., the multiplication of a matrix by its inverse yields the identity matrix: 


AAT AA =L (4.272) 
The elements of A^* = (8,,) are 
Ay, 


(4.27b) 


Bu, = : 
UP detA’ 
where A,,, is the cofactor belonging to the a,,, element of the matrix A (see 4.2.1, 1., p. 278). Fora 


practical calculation of A^! the method given in 4.2.2, 2., p. 278 should be used. In the case of a matrix 
of size (2, 2) holds: 


| fab -- 1 d —b 
A- ος). = Tuo 2) (4.28) 
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Remark: Why not define division among matrices but instead use the inverse for calculations? This 
is connected to the fact that division cannot be defined uniquely. The solutions of the equations 

BX, =A (B T Xı = BA 

regular), 

XB = A ΚΑΒ 
are in general different. 
9. Orthogonal Matrices 
If the relation 

ΑΤΞΑ or  AAT-ATA-I (4.30) 
holds for a square matrix A, then it is called an orthogonal matrix, i.e., the scalar product of a row and 
the transpose of another one, or the scalar product of the transpose of a column and another one are 
zero, while the scalar product of a row with its own transpose or of the transpose of a column with itself 
are equal to one. 
Orthogonal matrices have the following properties: 


(4.29) 


a) The transpose and the inverse of an orthogonal matrix A are also orthogonal; furthermore, the 
determinant is 


det A = +1. (4.31) 
b) Products of orthogonal matrices are also orthogonal. 


W The rotation matrix D, which is used to describe the rotation of a coordinate system, and whose 
elements are the direction cosines of the new direction of axes (see 3.5.3.3,2. p. 212), is also an orthogonal 
matrix. 

10. Unitary Matrix 

If for a matrix A with complex elements 


(A*T2A^ ο  A(A*)-(A*)TA-I (4.32) 


holds it is called a unitary matriz. In the real case unitary and orthogonal matrices are the same. 


4.1.5 Rules of Calculation for Matrices 

'The following rules are valid of course only in the case when the operations can be performed, for in- 
stance the identity matrix I always has a size corresponding to the requirements of the given operation. 
1. Multiplication of a Matrix by the Identity Matrix 

is also called the identical transformation: 

AI-IA — A. (4.33) 
(This does not mean that the commutative law is valid in general, because the sizes of the matrix I on 
the left- and on the right-hand side may be different.) 

2. Multiplication of a Square Matrix A by a Scalar Matrix S 
or by the identity matrix I is commutative 


AS=SA=cA with S given in (4.8), (4.34a) AI-IA — A. (4.34b) 
3. Multiplication of a Matrix A by the Zero Matrix 0 


results in the zero matrix: 
A0=0,  0A-0. (4.35) 

(The zero matrices above may have different sizes.) The converse statement is not true in general, i.e., 

from AB = 0 it does not follow that A = 0 or B = 0. 

4. Vanishing Product of Two Matrices 

The product of two matrices A and B can be the zero matrix even if neither of 

them is a zero matrix: 


AB=0 or ΒΑ =0 orboth, although A 40, B #0. (4.36) 


id 
0 0 
| v9 
0 1/0 0 
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5. Multiplication of Three Matrices 

(AB)C = A (B C) (4.37 
i.e., the associative law of multiplication is valid. 
6. Transposing of a Sum or a Product of Two Matrices 


(A+B)! = AT 4- ΒΤ, (ΑΡ): = B" AT, (AT)T — A. (4.38a 
For square invertible matrices Αν): 
(AT) = (A T)T (4.38b 
holds. 
7. Inverse of a Product of Two Matrices 
(AB)!-B!A'. (4.39 
8. Powers of Matrices 
AP=AA...A with p> 0, integer, (4.40a 
κακώς 
p factors 
A°=I (det A 4 0), (4.40b 
A?’=(A7') (30, integer; det A Z 0), (4.40c 
AP*— APA (p,q integer). (4.40d 


9. Kronecker Product 
The Kronecker product of two matrices A = (apn) of the type (m, n) and B = (b,,) of the type (p, r) is 
defined as the rule 

A ® B= (αμ B). (4.41) 
The result is a new matrix of type (m - p, n» r), arising from the multiplication of every element of A 
by the matrix B. 


3-5 0 13 
E A= E rs J of type (2,3), B = i A) of type (2, 2). 


s (1 ὃ) -ο-(} A) eG a 3 9 -5 -150 0 
2-1 2-1 2-1 6 -3 -10 5 0 0 
MOBS i Sy ft λα ο σσ a ee So 

"2-1 2 —1 2. 4.9. 3 13553 


gives a matrix of type (4.6). 
For the transpose and the trace are valid the equalities: 
(A & B)! — AT & ΒΤ, (4.42) 
Tr(A & B) = Tr (A) - Tr(B). (4.43) 
10. Differentiation of a Matrix 
If a matrix A = A(t) = (aj, (t)) has differentiable elements αμν (1) of a parameter t then its derivative 
with respect to { is given as 


dA _ (5-9) η. (4.44) 


di dt 


4.1.6 Vector and Matrix Norms 


The norm of a vector or of a matrix can be considered as a generalization of the absolute value (mag- 
nitude) of numbers. Therefore a real number is assigned as ||v|| (Norm x) to the vector x or as ||A || 


4.1 Matrices 277 


(Norm A) to a matrix A. These numbers must satisfy the norm axioms (see 12.3.1.1, p. 669). For 
vectors x € R” they are: 


1. ||x|| > 0 for every x; ||x||=0 if and only if x = 0. (4.45 


2. ||Ax|| = |A| ||x|| for every x and every real number A. (4.46 


3. |x - yl| < ||xI| + |ly|| for every x and y (triangle inequality) (see also 3.5.1.1, 1., p. 182). (4.47 
There are many different ways to define norms for vectors and matrices. But for practical reasons it is 
better to define a matrix norm ||A|| and a vector norm ||x|| so that they might satisfy the inequality 

|| Ax|| < IAI [Ix]. (4.48 
This inequality is very useful for error estimations. If the matrix and vector norms satisfy this inequality, 
then one says that they are consistent with each other. If there is a non-zero vector x for every A such 


that the equality holds in (4.48), then one says the matrix norm || A|| is the subordinate to the vector 
norm ||x| 


4.1.6.1 Vector Norms 


If x = (£1, 22,..., η). is a real vector of n dimensions, i.e., x € R” , then the most often used vector 
norms are: 


1. Euclidean Norm 


n 


sel] = Isla: (4.49) 
2. Supremum or Uniform Norm 
x|| = ||X||oo := max|z;|. (4.50) 
1<i<n 
3. Sum Norm 
n 
x|| = |κ||1 := * κι. (4.51) 
i=1 
Bl In R’, in elementary vector calculus ||x||; is considered as the magnitude of the vector x. The 
magnitude |x| = ||x||» gives the length of the vector x. 


4.1.6.2. Matrix Norms 
1. Spectral Norm for Real Matrices 


ΙΑ = ΙΑ! := V Amax (AT A). (4.52) 


Here Amax( ATA) denotes the greatest eigenvalue (see 4.6.1, p. 314) of the matrix ATA. 


2. Row-Sum Norm 


n 


ΑΙ = ΙΑ] := mas ijl- 4.53 
|| Al] = || All max 2.losl (4.53) 


3. Column-Sum Norm 


πα max jas]. (4.54) 
Y i=1 


1<j<n 
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It can be proved that the matrix norm (4.52) is the subordinate norm to the vector norm (4.49). The 
same is true for (4.53) and (4.50), and for (4.54) and (4.51). 


4.2 Determinants 
4.2.1 Definitions 


1. Determinants 
Determinants are real or complex numbers uniquely associated with square matrices. The determinant 
of order n associated with the (n, n) matrix A = (aj). 


Q11 412 *** Gin 
Q21 422 *** Gan 

D = det A = det (aw) 2| . . . . |, (4.55) 
απι Qn2 *** Ann 


is calculated in a recursive way using the Laplace expansion rule: 


det A = Y aj, Ay, (u fixed, expansion along the i-th row), (4.56a) 
v=1 

det A = SO ayy Αμ, (v fixed, expansion along the v-th column). (4.56b) 
μξι 


Here A,, is the subdeterminant belonging to the element apy multiplied by the sign factor (—1)^*". 


Aw is called the cofactor or algebraic complement. 

2. Subdeterminants 

The subdeterminant of order (n — 1) belonging to the element apy of a determinant of order n is the 
determinant obtained by deleting the j-th row and the v-th column. 

W Expansion of a determinant of order four along the third row: 


Q11 012 413 Q14 dj Q13 44 


21 423 Q24 
Q41 G43 Q44 


Q11 012 G14 
a21 022 Q24 
G41 A42 Q44 


Q11 012 Q13 
021 022 Q23 


a12 Q13 Q14 
= Q31 | 192 Q23 Q24 
a42 Q43 Q44 


021 Q22 023 124 


— 432 + 33 αμα . 


031 032 033 Q34 
Q41 Q42 143 141 


Q41 G42 G43 


4.2.2 Rules of Calculation for Determinants 

Because of the Laplace expansion the following statements about rows are valid also for columns. 
1. Independence of the Value of a Determinant 

The value of a determinant does not depend on which row was chosen. 

2. Substitution of Cofactors 


If during the expansion of a determinant the cofactors of a row are replaced by the cofactors of another 
one, then one gets zero: 


Σαμ Αλν = 0 (u, A fixed; À Z u). (4.57) 
v=1 


This relation and the Laplace expansion result in 

Aaa A = A Aga = (det A)T. (4.58) 
The adjoint matrix of A, which is the transpose of the matrix made from the cofactors of A, is denoted 
by Αι. There must not be a confusion of this adjoint matrix with the transposed conjugate of a 
complex matrix AP (see (4.4), p. 269). From the previous equality one gets the inverse matrix 


Απ. = (4.59) 


1 
WE MT 
det A^ ^9" 
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3. Zero Value of a Determinant 
A determinant is equal to zero if 
a) a row contains zero elements only, or 
b) two rows are equal to each other, or 
C) a row is a linear combination of the others. 
4. Changes and Additions 
The value of the determinant does not change if 
a) its rows are exchanged for its columns, i.e., reflection in the main diagonal does not affect the value 
of it: 
det A — det AT, (4.60) 
b) any row is added to or subtracted from another one, or 
c) a multiple of any row is added to or subtracted from another one, or | 
d) a linear combination of other rows is added to any row. 
5. Sign on Changing Rows 
If two rows are interchanged in a determinant, then the sign of the determinant changes. 
6. Multiplication of a Determinant by a Number 
The value of a determinant will be multiplied by a if the elements of a row are multiplied by this number. 


The next formula shows the difference between this and the multiplication of a matrix A of size (n,n 
by a number a 


det (aA) = a"det A. (4.61 


7. Multiplication of Two Determinants 
The multiplication of two determinants can be reduced to the multiplication of their matrices: 


(det A)(det B) = det (AB). (4.62 
Since det A = det AT (see (4.60)), we have the equalities 
(det A)(det B) = det (AB) = det (ABT) = det (ATB) = det (ATBT), (4.63 


i.e., it is permissible to take the scalar product of rows with columns, rows with rows, columns with 
rows or columns with columns. 

8. Differentiation of a Determinant 

Suppose the elements of a determinant of order n are differentiable functions of a parameter t, i.e., 
auv = αμν({). In order to differentiate the determinant with respect to t, one differentiates one row at 
one time and finally one adds the n determinants. 


W For a determinant of size (3, 3) follows: 


1 Uu u 
411 012 413 Q11 019 043 11 012 Q13 411 012 013 
/ 1 / 
ag | 021 922 023 | = | Q21 Q22 Q23 + |a 055 A23 | + | Q21 402 Q23 | . 
À $ / 1 
a31 032 433 a31 432 433 a31 032 433 431 035 033 


4.2.3 Evaluation of Determinants 


1. Value of a Determinant of Second Order 


11 012 
= 441022 — 2112. (4.64) 


21 022 


2. Value of a Determinant of Third Order 
The Sarrus rule gives a convenient scheme for the calculations, but it is valid only for determinants of 
order three. It is the following: 


280 4. Linear Algebra 


ü21 ..022....02 a22 = 041023033 + 012023031 + 013021032 
3j. 32 Q33| d31 432 


aus κακο 012 
f 


— (a31a22013 + 33023011 + 033021032). (4.65) 
The first two columns are copied after the determinant, then the sum of the products of the elements 
along the redrawn declining segments is calculated, then the sum of the products of the elements along 
the dotted inclining segments is subtracted. 
3. Value of a Determinant of n-th Order 
By the expansion rule the calculation of the value of a determinant of order n is reduced to the evalu- 
ation of n determinants of order (n — 1). But for practical reasons (to reduce the number of required 
operations), first one transforms the determinant with the help of the rules discussed above into a form 
such that it contains as many zeros as possible. 


299 4) 12 59 4 2 53 4 2 

4 8 ὃ-δι |403-5| "^4 01-5 ΠΣ ἡ 13 4 
126 4| |1 06 4 1 02 4 

(rule 4) (rule 6) — 0 (rule 3) 

11 0 - 

--21|41-—5 =-2 (371 ΗΕ... 
19 4 
(rule 4) 


Remark: An especially efficient method to determine the value of a determinant of order n can be 
obtained by transforming it in the same way as it is done in order to determine the rank of a matrix 
(see 4.1.4, 7., p. 274), i.e., all the elements under the diagonal a11, a22,..., @nn are equal to zero. Then 
the value of the determinant is the product of the diagonal elements of the transformed determinant. 


4.3 Tensors 


4.3.1 Transformation of Coordinate Systems 


1. Linear Transformation 
By the linear transformation 


d, = aj, + απο + d1333 
= AX or 2» = 3144 + ἄοο Το + 0233133 (4.66) 
T4 = Agi LX, + A32%q + 43323 


[1 


a coordinate transformation is defined in the three-dimensional space. Here x, and 1, (μ.-- 1, 2, 3) are 
the coordinates of the same point but in different coordinate systems K and K. 

2. Einstein's Summation Convention 

Instead of (4.66) one can write 


3 
n= lant, (μ-1,3.3) (4.67a) 
vel 


or briefly by Einstein 

Ly = Gy, (4.67b) 
i.e., it is to calculate the sum with respect to the repeated index v and put down the result for u = 1, 2,3. 
In general, the summation convention means that if an index appears twice in an expression, then the 
expression is added for all values of this index. If an index appears only once in the expressions of an 
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equation, for instance y in (4.67b), then it means that the equality is valid for all possible values of this 
index. 


3. Rotation of a Coordinate System 
If the Cartesian coordinate system K is given by rotation of the system K, then for the transformation 
matrix in (4.66) A = D is valid. Here D = (ἆμν) is the orthogonal rotation matrix. The orthogona 
rotation matrix D has the property 

D” =D". (4.68a 
The elements d, of D are the direction cosines of the angles between the old and new coordinate axes. 
From the orthogonality of D, i.e., from 

DD'=I ad D'D=I, (4.68b 
it follows that 


3 3 
Y ἁμιάνι = ὃμν, SP diudiv = ὃμν (u,v = 1,2,3). (4.68c 
i=1 k=1 
The equalities in (4.68c) show that the row and column vectors of the matrix D are orthonormalized, 
because ὃμν is the Kronecker symbol (see 4.1.3, 10., p. 271). 

The elements ἆμν of the rotation matrix can be determined by the Cardan angles (see 3.5.3.5, p. 214 
or Euler angles (see 3.5.3.6, p. 215). For rotation in the plane see 3.5.2.2,2., p. 191; in space see 3.5.3.3, 
p. 213. 


4.3.2 Tensors in Cartesian Coordinates 


1. Definition 

A mathematical or a physical quantity T can be described in a Cartesian coordinate system K by 3” 
elements t;j...m, the so-called translation invariants. Here the number of indices i, j,...,m is exactly 
equal to n (n > 0). The indices are ordered, and every of them takes the values 1,2 and 3. 


If under a coordinate transformation from K to K for the elements lij... according to (4.66) 


3 
Laon — Σ 
i=l 


is valid, then T is called a tensor of rank n, and the elements t;;...m (mostly numbers) with ordered 
indices are the components of the tensor T. 

2. Tensor of Rank 0 

A tensor of rank zero has only one component, i.e., it is a scalar. Because its value is the same in every 
coordinate system, one talks about the invariance of scalars or about an invariant scalar. 

3. Tensor of Rank 1 

A tensor of rank 1 has three components t4, t2 and t3. The transformation law (4.69) is now 


Me 


3 
e 5 QyiQyj * * * Gomtij--m , (4.69) 
m=1 


1 


2 


3 
t= X tae (uw = 1,2,3). (4.70) 
i=1 


It is the transformation law for vectors, i.e., a vector is a tensor of rank 1. 
4. Tensor of Rank 2 


If n = 2, then the tensor T has nine components £;;, which can be arranged in a matrix 


ἔτι tig tis 
T =T= | ta t» tz |. (4.71a) 


t31 t32 t33 
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The transformation law (4.70) is now: 


3 3 
lu = X 5 Quidyjlij (u,v = 1,2,3). (4.71b) 
i=l j=l 


So, a tensor of rank 2 can be represented as a matrix. 
ΒΑ: The moment of inertia O, of a solid with respect to the line g, which goes through the origin 
and has direction vector à = aT, can be represented in the form 


Or TN Θα A Ox: 
Θ, =a'@a 4.724 with © -(8)-|-864 9, —O |, (4.72b) 
g 2 y: 

-O,. -Oy. O, 


the so-called inertia tensor. Here O,, Oy and O, are the moments of inertia with respect to the coordi- 
nate axes, and Ory, O,. and Θμ. are the deviation moments with respect to the coordinate axes. 


W B: The load-up conditions of an elastically deformed body can be given by the tension tensor 
011 712 σι» 
g = | 021 922 023 |. (4.73) 
031 032 933 
The elements oj, (i, k = 1,2, 3) are determined in the following way: At a point P of the elastic body 
a small plane surface element is chosen whose normal vector points to the direction of the x-axis of 
a right-angle Cartesian coordinate system. The power per surface unit on this element, depending on 
the material, is a vector with coordinates στι, σιο and σα. The other components can be explained 
similarly. 
5. Rules of Calculation 
1. Elementary Algebraic Operations The multiplication of a tensor by a number, and addition 
and subtraction of tensors of the same rank are defined componentwise, similarly to the corresponding 
operations for vectors and matrices. 
2. Tensor Product Suppose there are given a tensor A of rank m and a tensor B of rank n with 
components αμ... and brs... respectively. Then the 3™+” scalars 
Diosc E pre Dco (4.74a 
give the components of a tensor C of rank m+n. It is denoted by C = AB and it is called the tensor 
product of A and B. The associative and distributive laws are valid: 
(AB)C=A(BC), A(B+C)=AB+AC. (4.74b 


3. Dyadic Product The product of two tensors of rank 1 A = (a1,a5,a3) and B = (by, be, bs 
gives a tensor of rank 2 with the elements 


Cij = aid; (49 —1,2,3); (4.75a 
i.e., the tensor product results in the matrix 
a,b, aibo aybs 
azbı azby azbz |. (4.75b 
azbı azbz azb 


This will be denoted as the dyadic product of the two vectors A and B. 


4. Contraction Setting two indices equal to each other in a tensor of rank m (m > 2), and summing 
with respect to them, then one gets a tensor of rank m — 2, which is called the contraction of the tensor. 
Β The tensor C of rank 2 of (4.75a) with c;; = a;b; , which is the tensor product of the vectors A = 
(αι, a2, a3) and B = (by, b2, b3), can be contracted by the indices i and J, 


aibi = a,b, + a303 + a3b3 (4.76) 


4.8 Tensors 283 


giving a scalar, which is a tensor of rank 0. This gives the scalar product of vectors A and B. 


4.3.3 Tensors with Special Properties 
4.3.3.1 Tensors of Rank 2 


1. Rules of Calculation 
For tensors of rank 2 the same rules are valid as for matrices. In particular, every tensor Τ can be 
decomposed into the sum of a symmetric and a skew-symmetric tensor: 


T= 5 (T 4 T7) 4 E (T -17). (4.77a) 
A tensor T = (tij) is called symmetric if 

lj = ty for all i and j (4.77b) 
holds. In the case 

tij 2 —tji for all ? and j (4.77c) 


it is called skew- or antisymmetric. Obviously the elements t11 , t22 and t33 of a skew-symmetric tensor 
are equal to zero. The notion of symmetry and antisymmetry can be extended for tensors of higher 
rank if refering to certain pairs of elements. 

2. Transformation of Principal Axes 

For a symmetric tensor T, i.e., if tu = ty, holds, there is always an orthogonal transformation D such 
that after the transformation the tensor has a diagonal form: 


. ia 0 0 
T =| 0 ty 0 |. (4.782) 
0 0 fas 


The elements f;; , t22 and fsz are called the eigenvalues of the tensor T. They are equal to the roots λι, 
Ag and λα of the algebraic equation of third degree in A: 
Πλ ἐν tis 

ta t-A tes | = 0. (4.78b) 

t31 t32 t33 — A 
The column vectors d; , dy and d of the transformation matrix D are called the eigenvectors corre- 
sponding to the eigenvalues, and they satisfy the equations 

Τά, = Xd, (v = 1,2,3). (4.78c) 

Their directions are called the directions of the principal axes, and the transformation T to diagonal 
form is called the transformation of the principal axes. 


4.3.3.2 Invariant Tensors 
1. Definition 


A Cartesian tensor is called invariant if its components are the same in all Cartesian coordinate sys- 
tems. Physical quantities such as scalars and vectors, which are special tensors, do not depend on the 
coordinate system in which they are determined; they must not change their value either under trans- 
lation of the origin or rotation of a coordinate system K. One talks about translation invariance and 
about rotation invariance or in general about transformation invariance. 


2. Generalized Kronecker Delta or Delta Tensor 
If the elements t;; of a tensor of rank 2 are the Kronecker symbols, i.e., 
1 fori—j, 


pci { 0 otherwise, (4-79a) 
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then from the transformation law (4.71b) in the case of a rotation of the coordinate system considering 
(4.68c) follows 

tay = ἁμιάν; = Ow » (4.79b) 
i.e., the elements are rotation invariant. Putting them into a coordinate system so that they are inde- 
pendent of the choice of the origin, i.e., they will be translation invariant, then the numbers δρ; form a 
tensor of rank 2, the so-called generalized Kronecker delta or delta tensor. 


3. Alternating Tensor 
If &j, €; and ἔμ are unit vectors in the directions of the axes of a right-angle coordinate system, then for 
the mixed product (see 3.5.1.6, 2., p. 185) holds 
1, if i,j,k cyclic (right-hand rule), 
Eijk = & (€; X Ek) = 4 —1, ifi, j,k anticyclic, (4.80a) 
0, otherwise. 
Altogether there are 3? = 27 elements, which are the elements of a tensor of rank 3. In the case of a 
rotation of the coordinate system from the transformation law (4.69) it follows that 
dj di dor 


ἆμο ἀνα doa | = Εμνρ, (4.505) 


ἐμνρ = dpidvjdpkEijk = 


ἆμα dya da 
i.e., the elements are rotation invariant. Putting them into a coordinate system so that they are in- 
dependent of the choice of the origin, i.e., they are translation invariant, then the numbers ejj; form a 
tensor of rank 3, the so-called alternating tensor. 
4. 'Tensor Invariants 
There must not be a confusion between tensor invariants and invariant tensors. Tensor invariants are 
functions of the components of tensors whose forms and values do not change during the rotation of 
the coordinate system. 
Bb A: If for instance the tensor T = (tj) is transformed in T = (ἕν) by a rotation, then the trace 
(spur) of it does not change: 

Tr(T) = t + t22 + t33 = tu doa das. (4.81) 
The trace of the tensor T' is equal to the sum of the eigenvalues (see 4.1.2, 4., p. 270). 
E B: For the determinant of the tensor T = (t;;) 


ti tie {18 tu te tis 
t21 022 123 | = | tor t22 t23 (4.82) 
t31 032 t33 t31 t32 t33 


is valid. The determinant of the tensor is equal to the product of the eigenvalues. 


4.3.4 Tensors in Curvilinear Coordinate Systems 

4.3.4.1 Covariant and Contravariant Basis Vectors 

1. Covariant Basis 

By the help of the variable position vector are introduced the general curvilinear coordinates u, v, w: 
T= F (u,v, w) = z(u, v, w)& + y(u, v, w)éy + z(u, v, w)e;. (4.83a) 

The coordinate surfaces corresponding to this system can be got by fixing the independent variables 

u,v, w in F (u,v, w), one at a time. There are three coordinate surfaces passing through every point of 

the considered region of space, and any two of them intersect each other in a coordinate line, and of 

course these curves pass through the considered point, too. The three vectors 


Or Or Or 


due a (4.83b) 
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point along the directions of the coordinate lines in the considered point. They form the covariant basis 
of the curvilinear coordinate system. 


2. Contravariant Basis 


The three vectors 


1 a or 1 OF yg OF 1 OF F (4.842) 
DiOv δω) DNOw du}? Ὁ δι΄ dv Lus 


with the functional determinant (Jacobian determinant see 2.18.2.6,3., p. 123) 


Dix.y.z Bi Bin. Bi 
Bol ήν (4.84b) 
D(u,v, w) ανα 0, 


are always perpendicular to the coordinate surfaces at the considered surface element and they form 
the so-called contravariant basis of the curvilinear coordinate system. 


Remark: In the case of orthogonal curvilinear coordinates, i.e., if 


ti tie ti: tn tie tis| η» ap am aS ο: AS 
D(x. y. Ἡ tig tig fu ἴῃ» tis} gs gr Or Or Or Or 
DU quay = | tar t22 t23 | = | tor t22 tos "Z= : ad =0, "E =0 (4.85) 
D(u, v, w) ; y - | Ou Ov Ou Ow Ov Ow 
e 131 132 t33 t31 t32 t33 


then the directions of the covariant and contravariant basis are coincident. 


4.3.4.2 Covariant and Contravariant Coordinates of Tensors of Rank 1 


In order to be able to apply the summation convention of Einstein the following notation is introduced 
for the covariant and contravariant basis: 


δ᾽. , Or , C NP d 
Bp I m πα Ue 486 
1 LE OF E 1 Or 3r 42 1 or | Or - 23 (4.86) 
D\dv Ow 7^ D\aw * δι I> Dau 8v τ 

Then the following representations hold for v: 
$—Vgo-Vie-Vg-Vg o v—Vj + Vg? + Vg? (4.87 


The components V* are the contravariant coordinates, the components V; are the covariant coordinates 
of the vector v. For these coordinates the equalities 


vr = g" Vi and Vi = gi V (4.88a 


are valid, where 


σι = Jik = Fk ` Hi and g^ =g" -gh.g (4.88b 
respectively. Furthermore using the Kronecker symbol the equality 

de F! = On, (4.89a 
holds, and consequently 

g” Gm = km: (4.89b 


The transition from V^ to V, or from V, to V^ according to (4.88b) is described by raising or lowering 
the indices by oversliding. 


Remark: In Cartesian coordinate systems covariant and contravariant coordinates are equal to each 
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other. 


4.3.4.3 Covariant, Contravariant and Mixed Coordinates of Tensors of 
Rank 2 
1. Coordinate Transformation 


In a Cartesian coordinate system with basis vectors ΕἸ , €; and δη a tensor T of rank 2 can be represented 
as a matrix 


ti tig tis 
T= (5 too ta (4.90) 
t31 {30 t33 
To introduce curvilinear coordinates u4, uz, uz the following vector is used: 
T = © (Uy, U2, U3)E1 + £2(U1, U2, U3)E2 + £3(U1, Us, Ug) e3 . (4.91) 
The new basis is denoted by the vectors gi, j» and ga. Now it holds: 


" or Oni, ὅσα. ὅσα, Oxy 


i Ou Ou m Ow ο. Ou as Qui is (58) 
Substituting £j = g' , then follows g and g' as covariant and contravariant basis vectors. 
2. Linear Vector Function 
In a fixed coordinate system with the tensor T given as in (4.90) by the equality 

Ü= T7 (4.93a) 
the following vector representations 

T= VJE =V" fe, T= Weg" = μα, (4.93b) 


define a linear relation between the vectors Ὁ and ὦ. So (4.934) is to be considered as a linear vector 
function. 
3. Mixed Coordinates 
Changing the coordinate system, the equality (4.93a) will have the form 

ὦ - ΤΟ. (4.944 
The relation between the components of T and T is the following: 
P Ouk OX 

thi = 3 . 742 tm.: 

Or, Oui 

Introducing the notation 

ty τὰ (4.94c 
one talks about mixed coordinates of the tensor; k contravariant index, | covariant index. For the com- 
ponents of vectors Ὁ and w holds 

W* = T*V!. (4.94d 
If the covariant basis gy is replaced by the contravariant basis g^, then one gets similarly to (4.94b) and 
(4.94c) 


(4.94b 


r ὅση Ou 
= Em δε, mn > 
and (4.94d) is transformed into 
Wr = TV. (4.95b 
For the mixed coordinates Τε! and T* holds the formula 


Τὴ = g"g T, - (4.95c 


T; 


(4.95a 
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4. Pure Covariant and Pure Contravariant Coordinates 
Substituting in (4.95b) for V; the relation V; = gimV"™, then one gets 


Wy = Τι GimV™ = Tem V” , (4.96a 
also considering that 
Te dim = Tims (4.96b 


The Tym are called the covariant coordinates of the tensor T, because both indices are covariant. Sim- 
ilarly one gets the contravariant coordinates 


qum aa g TE . (4.97 
The explicit forms are: 
Ty = mme. a, (4.98a) qu -. Ole Bu (4.98b 


= ‘A, “mn: 
OLm δον 


δι, Ow 


4.3.4.4 Rules of Calculation 


In addition to the rules described on 4.3.2, 5., p. 283, the following rules of calculations are valid: 

1. Addition, Subtraction Tensors of the same rank whose corresponding indices are both covariant 
or contravariant can be added or subtracted elementwise, and the result is a tensor of the same rank. 
2. Multiplication The multiplication of the coordinates of a tensor of rank n by the coordinates of 
a tensor of rank m results in a tensor of rank m + n. 

3. Contraction If making the indices of a covariant and a contravariant coordinates of a tensor of 
rank n (n 7 2) equal, then can be used the Einstein summation convention for this index, and one gets 
a tensor of rank n — 2. This operation is called contraction. 

4. Oversliding Oversliding of two tensors is the following operation: Multiply both, then making a 
contraction so that the indices by which the contraction is made belong to different factors. 

5. Symmetry A tensor is called symmetric with respect to two covariant or two contravariant stand- 
ing indices if when exchanging them the tensor does not change. 

6. Skew-Symmety Α tensor is called skew-symmetric with respect to two covariant or two con- 
travariant standing indices if when exchanging them the tensor is multiplied by —1. 

W The alternating tensor (see 4.3.3.2, 3.,p. 284) is skew-symmetric with respect to two arbitrary co- 
variant or contravariant indices. 


4.3.5 Pseudotensors 

The reflection of a tensor plays a special role in physics. Because of their different behavior with respect 
to reflection polar and axial vectors are distinguished (see 3.5.1.1, 2., p. 181), although mathematically 
they can be handled in the same way. Axial and polar vectors differ from each other in their determi- 
nation, because axial vectors can be represented by an orientation in addition to length and direction. 
Axial vectors are also called pseudovectors. Since vectors can be considered as tensors, the general 
notion of pseudotensors is introduced. 


4.3.5.1 Symmetry with Respect to the Origin 


1. Behavior of Tensors under Space Inversion 
1. Notion of Space Inversion The reflection of the position coordinates of points in space with 
respect to the origin is called space inversion or coordinate inversion. In a three-dimensional Cartesian 
coordinate system space inversion means the change of the sign of the coordinates: 

(x,y,z) > (—2,-y, --ᾱ). (4.99) 
By this a right-hand coordinate system becomes a left-hand system. Similar rules are valid for other 
coordinate systems. In the spherical coordinate system holds: 


(τ, 9,9) > (--τιπ — 9, 4- m). (4.100) 
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Under this type of reflection the length of the vectors and the angles between them do not change. The 
transition can be given by a linear transformation. 

2. Transformation Matrix According to (4.66), the transformation matrix A = (αμν) of a linear 
transformation of three-dimensional space has the following properties in the case of space inversion: 


Gy = --ὃμν, det A = —1. (4.101a) 
For the components of a tensor of rank n (4.69) 
lum (S an (4.101b) 


holds. That is: In the case of point symmetry with respect to the origin a tensor of rank 0 remains a 
scalar, unchanged; a tensor of rank 1 remains a vector with a change of sign; a tensor of 
rank 2 remains unchanged, etc. 


i M z TA 
; 180° , D q 
Y t.i | 
y [ € r^ ~ | 
uy t, ΄ 


Figure 4.3 


2. Geometric Representation 

The inversion of space in a three-dimensional Cartesian coordinate system can be realized in two steps 
(Fig.4.3): 

1. By reflection with respect to the coordinate plane, for instance the x, z plane, the coordinate system 
x,y,z turns into the coordinate system x, —y, z. A right-hand system becomes a left-hand system (see 
3.5.3.1,2., p. 209). 

2. By a rotation of the system x, y, z around the y-axis by 180° we have the complete coordinate system 
x,y,z reflected with respect to the origin. This coordinate system stays left-handed, as it was after the 
first step. 

Conclusion: Space inversion changes the orientation of a polar vector by 180°, while an axial vector 
keeps its orientation. 


4.3.5.2 Introduction to the Notion of Pseudotensors 


1. Vector Product under Space Inversion Under space inversion two polar vectors a and b 
are transformed into the vectors —a and — b, i.e., their components satisfy the transformation formula 
(4.101b) for tensors of rank 1. However, if considering the vector product c — a x b as an example 
of an axial vector, then one gets c = c under reflection with respect to the origin. This is a violation 
of the transformation formula (4.101a) for tensors of rank 1. Therefore the axial vector c is called a 


pseudovector or generally a pseudotensor. 

W The vector products r x 0,7 x F, V x U= rot U with the position vector r, the speed vector v, the 
power vector F and the nabla operator V are examples of axial vectors, which have “false” behavior 
under reflection. 

2. Scalar Product under Space Inversion If using space inversion for a scalar product of a 
polar and an axial vector, then again there is a case of violation of the transformation formula (4.101b) 
for tensors of rank 1. Because the result of a scalar product is a scalar, and a scalar should be the 
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same in every coordinate system, here it is a very special scalar, which is called a pseudoscalar. It has 
the property that it changes its sign under space inversion. Pseudoscalars do not have the rotation 
invariance property of scalars. 

W The scalar product of the polar vectors F (position vector) and v (speed vector) by the axial vector 
ὦ (angular velocity vector) results in the scalars F- @ and v- ὤ, which have the “false” behavior under 
reflection, so they are pseudoscalars. 

3. Mixed Product under Space Inversion The mixed product (a x b): c (see 3.5.1.6, 2., p. 185) 
of polar vectors a, b, and c is a pseudoscalar according to (2.), because the factor (a x b) is an axial 
vector. The sign of the mixed product changes under space inversion. 

4. Pseudovector and Skew-Symmetric Tensor of Rank 2 The tensor product of axial vectors 
a = (a1, @2,a3)" and b = (bı, be, b3)” results in a tensor of rank 2 with components t;; = a;b; (i,j = 
1,2, 3) according to (4.74a). Since every tensor of rank 2 can be decomposed into a sum of a symmetric 
and a skew-symmetric tensor of rank 2, according to (4.81) 


1 1 
ty = 5 (αι; + ajbi) 4 7 (aib; — ajb;) (i, j= 1,2,3) (4.102) 
holds. The skew-symmetric part of (4.102) contains exactly the components of the vector product 


(a x b) multiplied by 1, so the axial vector c = (a x b) with components c1, c», c3 can be considered 


as a skew-symmetric tensor of rank 2 


0 ο σα C23 = a2b3 — azb = c, 
C-—c-—|-c 0 c3 (4.103a) where C31 = a3b, — ayb3 = co, (4.103b) 
—C13 —C23 0 C12 = 403 — αοὖι = C3, 


whose components satisfy the transformation formula (4.101b) for tensors of rank 2. 


Consequently every axial vector (pseudovector or pseudotensor of rank 1) ¢ = (ci, c», c3)? can be con- 
sidered as a skew-symmetric tensor C of rank 2: 
0 ὦ -ῶ 
C=c=|-c3 0 ο |. (4.104) 
C9) —& 0 


5. Pseudotensors of Rank n The generalization of the notion of pseudoscalar and pseudovector 
is a pseudotensor of rank n. It has the same property under rotation as a tensor of rank n (rotation 
matrix D with det D = 1) but it has a (—1) factor under reflection through the origin. Examples of 
pseudotensors of higher rank can be found in the literature, e.g., [4.1]. 


4.4 Quaternions and Applications 


Quaternions were defined by Sir Wiliam Rowan Hamilton in 1843. The basic question which resulted 
the discovering of quaternions was that how could division of vectors in the three dimensional Euclidian 
space be defined. It is possible by embedding them into Β΄, and introducing the quaternion multipli- 
cation, what leads to the division ring of quaternions. 


Quaternions, like complex numbers, both are special cases of a Clifford-algebra of order n, with 2” 
generalized numbers as basis (see [4.5], [22.22]): 


gn 


A= Σ ira (i, hyper-complex elements, a; complex numbers). (4.105a) 
[mr 


The following special cases have practical importance: 
n = 1: 2-dimensional complex numbers with 

i = l,i = i (i imaginary unit), a),a (real numbers). (4.105b) 
n = 2: Quaternions as 4-dimensional numbers with hyper-complex elements 


i = 1, ip =i, i = j, i4 = k (hyper-complex elements), a4,a5,03,a4 (real numbers) (4.105c) 
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and the rules of multiplication (4.107). In physics the PAULI’s spin matrices and spinors are represented 
as quaternions. 

n = 3: Biquaternions (s. 4.4.3.6, 1., p. 306) 

n — 4: Clifford-numbers are known in physics as DIRAC-matrices. 

Quaternions are used most often to describe rotations. The advantages of the quaternions are: 

e the rotation is performed directly around the required axis, 

e the gimbal-lock problem does not occur. Gimbal is a pivoted support allowing the rotation around 
a single axis (e.g. gyrocompass), and gimbal lock means that the axes of two of the three gimbals are 
driven into parallel configuration. 

The disadvantage of quaternions is that only rotations can be described with them. To represent trans- 
lations, scaling or projections matrices are needed. This disadvantage can be overcome by biquater- 
nions, by which every motion of rigid bodies can be represented. 

Quaternion are used in computer-graphics, satellite-navigation, in vector analysis, in physics and in 
mechanics. 


4.4.1 Quaternions 
4.4.1.1 Definition and Representation 
1. Imaginary Units 


Quaternions are generalized complex numbers in the form 


w 1... (4.106) 
where w, x, y, z are real numbers, and the generalized imaginary units are i, j,k, which satisfy the 
following rules of multiplication: 


D-p-k'--L1 ij=k=-ji, jk-i--kj ki-j--ik (4.107) 


E 


ΠΝ The multiplication of the generalized units is shown in 


k 
ij—1/ k |-j the accompanying table of multiplication. Alternatively 
j-k|-1|i the multiplication rules can be represented by the cycle 
i >j 


J 

ΚΙ j |-i|-1 shown in Fig.4.4. Multiplication in the direction of an 
arrow results in a positive sign, opposite to the arrow di- 
Multiplication table Figure 4.4 rection results in a negative sign. 


Consequently, the multiplication is not commutative but associative. The four-dimensional Euclidian 
vector space IR^ provided with quaternion-multiplication is denoted by H in honour of R.W. HAMIL- 
TON. Quaternions form an algebra, namely the division ring of quaternions. 


2. Representation of Quaternions 

There are different representation of quaternions: 

e as hyper-complex numbers q = w + ix + jy + kz = qo + q with scalar part qo = Scq and vector part 
q = Vecq, 


e as four dimensional vector q = (w, x, y, z)? = (qo, q)” consisting of the number w € R. and the vector 


(zy) € R?, 
e in trigonometric form q = r(cosy + ng sing), where r = |g| = Vw? + a? + y? + 2? is the length of 


w 1 
, and ng = x,y,z) 
πμ Tey n? 


the four dimensional vector in R4, and cos p= . Ng is a unit vector 


in R?, depending on q. 
Remark: The multiplication rules for quaternions differ from the usually introduced rules in IR? and 
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Ε΄ (see (4.109b), (4.114), (4.115)). 


3. Relation between Hyper-complex Number and Trigonometric Form 
qo qo q lal 


q=%0+q= || ( | 2) = |q| ( Fc a) =r(cosy+ngsiny), (4.108) 
lal al la| lal ld] a 
if |q| # 0. If|q| = 0, then there is 


q = qo = lol (4.108b) 
if qo #0. 

4. Pure Quaternions 

A pure quaternion has a zero scalar part: go = 0. The set of pure quaternions 1 is denoted by Ho. It is 
often useful to identify the pure quaternions q with the geometric vector q € R?, ie. 


do — f |qo| = [go] cos 0 for qo > 0, 
lgol ἐ |4ο[(--1) = |qo| cos. for qo < 0, 


, if q represents a pure quaternion, 
if q is interpreted as a geometric vector. 


q= qo + E (4.109a) 


The multiplication rule for p, q € Ho is 
pa--p:dctpxd, (4.109b) 
where : and x denote the dot-product and cross-product in R?, respectively. The result of (4.109b) is 


to interpret as a quaternion. 


ὃν ὃν δ, z 
WB Let V = Jz zc ài Lt aK be the nabla-operator (see 13.2.6.1, p. 715), and let V = vı (x, y, z)i + 
y 
υο(α, y, 2) j + va(z, y, z z)k be a vector-field. Here i, 1, K are unit-vectors being parallel to the coordinate 
axes in a Cartesian coordinate system. If V and V are interpreted as pure quaternions then according 
to (4.107) their product is: 


το. Ov, Qu) Qus εὖ Ov3 Όυ (T Ov, vz T Qv) Qu, 
Or ὃν δι  VOy ὃς Jio δε] ὃν. yj 


This quaternion can be written in vector interpretation: 
Vy = -div V + rot V, 


but the result should be considered as a quaternion. 

5. Unit Quaternions 

A quaternion q is a unit quaternion if |g] = 1. The set of unit quaternions is denoted by H;. H; is 
a so-called multiplicative Lie-group. The set H, can be identified with the three dimensional sphere 
S? ={xe Ri: |x| =1} 

4.4.1.2 Matrix Representation of Quaternions 


1. Real Matrices 
If the number 1 is identified with the identity matrix 


(4.110a) 


furthermore 
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mn 


l> 


0 0001 
0 a | 0 0-10 
1 kt 9199 
0 -1000 
then a quaternion q = w + ix + jy + kz can be represented as a matrix 
w -ᾱ-Ἡ z 
a| £ w —z-—y 
“ly 2 vr 
—z y -r w 


2. Complex Matrices 


Quaternions can be represented by complex matrices: 
QA(0 -ἲ ;Af(0-1 kA —i 0 
40)? 9 711075: ES i 


ο Af wr-iz —ir—y 
q=w+is+jy+kz t (SO vs 


So 


Remarks: 


(4.110b) 


(4.110c) 


(4.111a) 


(4.111b) 


1. On the right hand side of equations (4.111a,b) i represents the imaginary unit of complex numbers. 
2. Matrix representation of quaternions is not unique, i.e. it is possible to give representations different 


from the ones in (4.110b,c) and (4.111a,b). 

3. Conjugate and inverse Element 

The conjugate of quaternion q = w + ix + jy + kz is the quaternion 
j—-w-ir-—jy-—kz. 

Obviously 
la? -aqd—4dq— w^ c xy ασ. 

Consequently every quaternion q € H V {0} has an inverse element 

-1 q 


q —13 
lal? 


4.4.1.3 Calculation Rules 
1. Addition and Subtraction 


Addition and subtraction of two or more quaternions are defined as 
αι tq2— 43... 


(ωι + iz + jyi + kx) + (we + ire 


t 
(wx 4 
+k(z1 + 22 — z3 + ...). 
Quaternions are added and subtracted as vectors in R4, or as matrices. 
2. Multiplication 


The multiplication is associative, so 


dido = ( wi tity + jy + κσι)(ω» + ive + juo + kz) 


I 
Τ 
w — wz +...) Επι + z2 — z3 +...) gn ya — ya... 


(wwz — απο — yiya — 2122) (wre + W221 + iz» — zy) 


+ j(wiy2 + wey + zia — 2923) «πο + W221 ο 


Using the usual vector products in IR? (see 3.5.1.5, p. 184) it can be written in the form 


9142 = (Gor + d,)(d02 + dy) = Yor Go2 — di: da + qı X Ga, 


(4.112a) 


(4.112b) 


(4.1120) 


(4.113) 


(4.114) 


(4.115) 
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where d; - d» is the dot-product, and d, x d» is the cross-product of the vectors G1, do € R®. Next 
is to identify the R? with the space Ho of the pure quaternions. 

Remark: Multiplication of quaternions is not commutative! 

The product qıq2 corresponds to the matrix multiplication of matrix L, with vector q2, and it is equal 
to the product of matrix R4, with q1: 


Wy -Ώι -ϑι —4 w2 W2 ο —Y2 —22 ΠΠ 

- [71 Wi σι Yi πο | _ _ | %2 W2 22 —Y2 Tı 
1192 =Lq q2 = eo own allay | Re =| ge ee ας dh wl 4.116) 

uoa 1 τῶι ya Y2 2 W2 T2 yı 

21 -ῃι τι Wy 22 ο Y2 —% W2 ZL 


3. Division 
The definition of division of two quaternions is based on the multiplication: q1, q € H, q» 4 0, 


qi E q2 


=H =a. (4.117) 

q2 lal 
The order of the factors is important. 

1 
= j, ο, = 4(1 -- on lol = 1, Jo = — and s 

E Let qg = 1+j, o = σα k), then |go| = 1, d; al +k) and so 

qi q2 1 $3 q2 1 bs ics 

= = 1+i+j+k — 1-i+j+k). 

αμ νο ο I ρα = yl τον 
4. Generalized Moivre Formula 
Let q € H, whrer q = qo + q = r(cosy + nqsing) with r = |g| and v = arccos " , cosy = " i 

. = q q 

án = fl ο ο. 
sing = πι then for arbitrary k € N : 

gh = rest? = r^(cos(k p) + ng sin(k q)) . (4.118) 
5. Exponential Function 
For q = qo + q € Hits exponential expression is defined as 

oo që 

e = D πιω e® (cos |a] + nq sin |α]). (4.119) 
Properties of the exponential function: 
For any q € H holds: 

ete? =1, (4.120a) e! Z0,  (4120b) ef = ον τα = euet, (4.120c) 

ea" = —], especially e" = οτ = e" = --Ι. (4.120d) 

Unit quaternion u and 0 € R : e"" = cos + usin v. (4.1200) 


If qı qo = q2 qı then e?! *? = eù e?? . But it does not follow from e? **? = e! e? that qi q2 = qo qi. 
E Since ({π}(π) = kr? A —kz? = (jr) (iz) therefore also holds 


c" c* — (cos m)(cosm) = (—1)(—1) = 1, but ei = (os (v27) + ne sin (v2r)) Xd. 


6. Trigonometric Functions 
For q € H let 


cos q := ee + ea) ; sing := -04 Gi — gx . (4.121) 


NI 
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cos q is an even function, against which sin q is an odd function. 
Addition formula: It is valid for any q = qo +q € H 


COS q = COS qo cos q —sinqopsinq, sing = sin qo cos q + cos qo sing. (4.122) 


7. Hyperbolic Functions 
For q € H let 


1 
cosh q := 5 (et + ε) ! sinh q := -nq (ο - ε) , (4.123) 
cosh q is an even function, against which sinh q is an odd function. 
Addition formula: It is valid for any q = qo + q € H 
cosh q = cosh qo cosq — sinhgosinh q, sinh g = sinh qo cos q + cosh qo sinh q . (4.124) 


8. Logarithmic Function 
For q = @ +q = r(cosp +ngsiny) € H and k € Z the k-th branch of the logarithmic function is 


defined as 


Inr nq(o-c2kx),|g| Z 0 or |g| 2 0 and qo > 0, 
— a q q ; 5 
log q : { not defined for |qg| —0 and q <0. (4.125) 
Properties of the logarithmic function: 
οὐδε! — q for any q € H, for which log, qis defined, (4.126a) 
logg ef = q for any q € H with |q| # (U+ υπ, 1 € Z, (4.126b) 
log,1=0, (41260) logy i= 2 logoj = zh logok = sk. (4.126d) 


In the case when log q and log qz or qı and gz commute, then, if k is suitable defined, the following 
known equality (4.127) holds: 

log(qı q2) = log qı + log qo. (4.127) 
For the unit quaternions q € H; holds |g| = 1 and q = cosy + nq sin y and so 

log q := logo q =ngy for q 4-1, (4.128) 


9. Power function 
Let q € H anda € R., then 


gh --- eret, (4.129) 


4.4.2 Representation of Rotations in IR? 

Spatial rotations are performed around an axis, the so-called rotation axis. It goes through the origin. 
It is oriented by a direction vector a 4 0 (on the axis). The positive direction on this axis is chosen 
by a. The positive rotation (rotation angle ϱ > 0) is a counterclockwise rotation with respect to the 
positive direction. The direction vector is usually given normed, i.e. |a| = 1. 


The equality 

w=Rv, (4.130a) 
means vector W arises from vector V by the rotation matrix R, i.e., the rotation matrix R transforms 
vector V into W. Since rotation matrices are orthogonal matrices holds 


R!-R? (4.130b) 
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and (4.130a) is equivalent to 

¥V=R 'w=R'w. (4.130c) 
Remark: At spatial transformations it is necessary to distinguish between the followings: 
a) geometric transformations, i.e. when geometric objects are transformed with respect to a fixed co- 
ordinate system, and 
b) coordinate transformations, i.e. the object is fixed while the coordinate system is transformed with 
respect to the object (see 3.5.4, p. 229). 
Now, geometric transformations are handled with quaternions. 


4.4.2.1 Rotations of an Object About the Coordinate Axes 


In a Cartesian coordinate system the axes are oriented by the basis vectors. The rotation around the 
x axis is given by matrix R,, around the y axis by R, and around the z axis by R., , where: 


Y Ὁ 0 cosB 0 sing cosy — sin y 0 
R,(a) :— k cosa -sna ) , Ει(β) -( 0 1 0 | , Β.(η) = (9m COS y o) . (4.131) 


0 sina cosa — sin f 0 cos 8 0 0 1 
The relation between a rotation of an object and the rotation of the coordinate system (see 3.5.3.3,3., 
p. 213)) is 
R, (a) = D(a) ,R,(8) = ῬΤ(β).Β.(η) = DI). (4.132) 


Remark: The representation of the rotation matrices in homogenous coordinates is given in 3.5.4.5,1.,2., 
p. 234. 


4.4.2.2 Cardan-Angles 
Every rotation R around an axis passing through the origin can be given as a sequence of rotations 
around the coordinate axes in a given coordinate system (see also 3.5.3.5, p. 214), where here 
e the first rotation is around the z axis by an angle ac, then 
ο the second rotation is around the y axis by an angle 5c, then 
e the third rotation is around the z axis by an angle yc. 
The angles ac, Bc, Yc are called Cardan-angles. Then the rotation matrix is 
R = Rc := R;(5c)R, (8c) Ro (ac) (4.1332) 


cos Bc cos ye sin ac sin Sc cos yc — cos ac sin yc  cosac sin Bc cos yc + sin ac sin yc 
= | cos8csinyc sin ac sin fc sin yc + cos ac cos ye cos ac sin Bo sin yc — sin ac cos yc | .(4.133b) 
— sin Bo sin ac cos Bc cos ac cos Ba 
Adventages: 
e very popular representation of rotations, 
e clear structure. 
Disadventages: 
e the order of rotations is important i.e. in general holds 
R,(ac)R,(6c)R:(5c) # R:(3c) R,(Bc)R(ac) . (4.133c) 
ο the representation is not unique since R(ac, bc, yc) = R(—ac + 180°, Bc + 180°, yo + 180°) , 
e not suitable for rotations after each other (e.g. at animations), 
e gimbal lock can happen (rotation of an axis by 90° goes into an other axis) 
W Gimbal Lock case: rotation around the y axis by 90° 


( 0 sin(ac—^c)  cos(ac — Yc) | 


R(ac,90°,¥c) 2 | 0 cos(ac — 3c) —sin(ac — vc) (4.133d) 


-1 


It can be seen that one degree of freedom is lost. In practical applications it can lead to unpredictable 
motions. 
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Remark: It can be realized that Cardan-angles are called sometimes as Euler-angles, in the literature 
however their definitions can be different (see 3.5.3.6, p. 215). 


4.4.2.3 Euler Angles 


The Euler-angles v, Ὁ, p are often introduced as follows (see 3.5.3.6, p. 215): 
e the first rotation around the z axis by angle wv, 
e the second rotation around the image of x axis by angle Ὁ), 
e the third rotation around the image of z axis by angle y. 
The rotation matrix is 
R = Rer :- R.(y)R,(V)R.(v)) (4.134a) 
cos cosy — sin Y cosÜsing — cos ysin ọ — sin cos cosy sinwsind 
sin Y cos y + cosy cos sing — siny sin y + cosi cos cos — cossint | .(4.134b) 
sin Ü sin q sin Ü cos y cos Ὁ 


4.4.2.4 Rotation Around an Arbitrary Zero Point Axis 


The counterclockwise rotation around a normed vector ἃ = (a+, ay, az) with |a] = 1 by an angle ϱ is 
made in 5 steps: 

1. Rotation of a around the y axis until reaching the x,y plane: a’ = Rã using R4 according to 
(4.135a). The result: vector a’ is in the z, y plane. 

2. Rotation of a’ around the z axis until becoming parallel to the x axis a’’ = Rea’ by R3 according 
to (4.135b). The result: vector a’’ is parallel to the x axis. 


Axr 0 az 
a2 + a? a2 + a? a2 + a? αν 0 
R, = 0 1 0 , (4.135a R = = [13 3:22 .  (4.135k 
1 Ls Ἢ : ( 5a) 2 αν a? Γαξθ ( 5b) 
- - 0 - - 0 0 1 
a2 + a? a2 + a? 
3. Rotation around the x axis by an angle y by Βα: 
1 0 0 
R; =R,(y) = | 0 cosy -sing |. (4.135c) 
0 siny cosy 


Rotations R4 and R» are inverted in the following two steps. 
4. Inverse rotation of Re, i.e. rotation around the z axis by the angle 9 where sin 9 = ay, cos -- 


γα» + a? according to (4.135d). 


5. Inverse rotation of R4, i.e. rotation around the y axis by —o where 


sin(—a) = [He , cos(—a) around the y axis according to (4.135e). 
a2 + a2 Vd Γαλ 
Ay —à; 
yal-cta2 -ay 0 y a2 + a2 a2 4- a2 
R= ay [a2 a20 |, (4135d) ΒΤ = 0 1 ? . (4.1356) 
0 0 1 Z 
a2 + a2 a2 4- a2 


Finally the composition matrix is: 


R(& ϱ) = ἃ; ἃ; Β:Β»ΒΙ = (4.1358) 
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cos p + a2(1 — cosy) ακαφ(1 -- cos p) — a;sing a,a,(1— cosy) + aysin y 
aya,(l— cosy) d-a;sinq cosy + a2(1— cosy) aya;(l— cosy) — assin y |. (4.135g) 
a;d,(l— cos p) — aysin azay(l— cosy) +a,siny cosy + a2(1 — cosy) 


Matrix R(&, v) is an orthogonal matrix, i.e. its inverse is equal to its transpose: R^! (&, ϱ) = R™(a, ϱ). 
The following formulas are also valid: 


Rx = R(a, ϱ)Χ 
(cos y)X + (1 — cos 2 | τα xx (4.136a) 
a 
= (cos y)X + (1 — cos y)Xg + (sin Pu x X. (4.136b) 


In these formulas the vector X is decomposed into two components, one is parallel, the other is perpen- 
X-a_ ' eee ss 

αρ a, the perpendicular part is r = X — Χα. The orthogonal 

a? 

part is in a plane whose normal vector is a, so its image is cos gr + sin yr*, where r* is obtained from 


dicular to à. The parallel part is Xg — 


-- DENT DR PE MR ; oe 
T by a 90° rotation in positive direction: r* = —a x r. The result of the rotation of vector X is 


X-a X-a 1 
Xa + cos yr + sin gr^ = | a+ (cosg) (x 3 a) + (sin Ir xr (4.136c) 
a a 
x(X-X,-—axx. (4.136d) 
Advantages: 
e „Standard representation" in computer-graphics, 
ο CARDAN-angles should not be determined, 
e no gimbal lock. 


Disadvantage: Not suitable for animation, i.e. for interpolation of rotations. 
4.4.2.5 Rotation and Quaternions 


If the unit vector ἃ in (4.135f) is identified as the pure quaternion a (while the angle of rotation p 
remains the same), then one gets: 


αὖ ἡ αἱ —-45—4$ Ἅφιῳ = 2404: 5 20193 + 2092 
R(a, ϱ) = | 204% + 29098 d-i +È- di 29203 - 24091 = R(q) (4.13Τα) 
20193 — 29041 2q2q3 + 2904ι d) — di — d5 ^ di 


where gg = cos T andq = (q, q»; g) = αν, ay, az)? sin? „i.e. gis the unit quaternion q = g(a, p) = 
cos τ + asin £ € H; . If vector X is considered as R? 5 X = zji + πο] + x4k € Ho, then 


R(a,g)x = R(q) x = qx. (4.137b) 
Especially the columns of the rotation matrix are vectors εις: 


1 0 0 
Rev- (afo)a (1): JODES qjq ακπ). (4.137€) 


Consequences: 


^ 


e The matrix of rotation can be determined with the help of quaternion q — cos » T asin A 
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ο For the rotated vector R(a, y)x holds R(a, y)x = qxq in the sense of quaternion multiplication 
and identifying IR? with the set of pure quaternions Ho . 


For every unit quaternion q € H; q and —q determine the same rotation, so H; is a double covering 
of SO(3). Performing rotations one after the other corresponds to multiplication of quaternions, i.e. 


R(q) Βία) = Rp q1); (4.138) 
and the conjugate quaternion corresponds to the inverse rotation: 

R-(q) = R(). (4.139) 
H Rotation by 60° around the axis defined by (1, 1,1)". First the direction vector should be normed: 

1 3 1 
a= ys l, 1 Then with sing = sin60° = and cosy = cos60° = 5 the rotation matrix 


ERI becomes 
1 


i ͵ 2-12 
R(—~(1,1,1)7, 60°} =—{ 2 2-1). 
(αι ) ) 34.12 2 


The quaternion describing the rotation is: 


q= agents ar) = cos 30° 4 ; (1 4- j 4- k) sin 30° 


V3 v3 
-Yy eere o Y a ο 
Furthermore 
1) (V8 VB.) (vB VB 
ΠΕ l g itd SHE ΠΩ 


(2 Bi j J (s | τ Bit i) 
24, 24, 12 
+ 36 


εις ' : 
νά Να. Ν΄ l 
The two other columns are determined analogously: 


a(i) ος. ο 


2 6 2 6 


dUL τα. κκ Sein) σαι ΠΕΙ 


1 6 2 6 


0)" δν δὴ 13) 


4.4.2.6 Quaternions and Cardan Angles 


The rotation matrix in Cardan angles (see (4.133a,b), p. 295) is exactly a matrix of rotation with a unit 
quaternion g € H,. 


Ἀσίας, Bc, yc) = R(yc) R,(&c) Rz (ac) (4.1402) 
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cos Bc cos yc sin ac sin Bc cos yc — cos ac sin yc cosacsin fo cos yc + sin ac sin yc 
= ( cos 8csinyo sin ac sin fc sin yc + cos ac cos yc | cosac sin fo sin ης — sin ag cos Yo | 4.140» 
— sin Bo sin ac cos Ba cos ac cos Ba 
dj-tdi—di—4di 2q4»—2dod; 2q193 + 240% 
= [ru]; = | 2042 + 20003 ας — +- a 24293 — 24001 = R(q) 4.140c 
24143 — 2qod» 2q»q5 + 240% dj—4i- didi 


0 0 
qi1jg q|0]g]l. 4.140d 
0 1 


Comparing the matrix elements one gets 
T21 š 1 

tangc = —, sinfg-—-ra, tanac = ; 

Tu T33 


(4.141a) 


In general, the solution is not unique, which is typical in trigonometric problems. However the unique- 
ness of angles can be reached by discussion of the defined domains. 
Reversed, it is easy to get the unit quaternion from the rotation matrix. 
Aqodi = T33 — T23, 4qoq2 = Tia — T31; 44093 = ra — "12, (4.141b 

4q — 1 = 4d — dj — d; — dj — 93 = Tu + T22 + T33. (4.141c 

Since q and —q define the same rotation, go can be determined as 
1 
do = ον Τη Ἔτου +T33 +1. (4.141d 


'The other components are 


T32 — T»; Tia — T: Ta —T 

= x go So... με (4.141e 
Aqo 440 4q0 

W Let the rotation matrix be the following: 


Va -3V6 ον» 
1 
Ree 1 1 
2| v2 ἀνθ την 
0 1 V3 


n= 


1. Determination of the Cardan angles: Based on the above formulas sin fc = --ται = 0, so fc = km, 
T. π Τα 1. 

k € Z. Furthermore tan yc E 1, so yc Hkr, k € Z, and from tan ac = 22 _ — it follows 
Ti 4 Tay V3 


T 
that ac = 6 + kr, k € Z. The angles are unique, if they are determined as the ,,possible smallest” 


; : T 
ones, i.e. the rotation whose angles have an absolute value < P So the angles are 


T T 
» βε-ο, : f 
ac 6 Pc Yo Ἴ 
2. Determination of the unit quaternion which results this rotation: 
; 1 1 1 1 1 
4qi— 1 3 (v2 H 3v6 H v3) also qo γί + 3 (v2 H 3v6 + v3) ~ 0,8924 = cos $. 


The (possible smallest) angle is p = 53.6474°, so sin £ = 0.4512. 


3. Determination of the further components of q and the direction of the axis of rotation 
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a= (dy, dy, az)": 
1 1 
T32 — T23 Cae 2) qı 
p= æ 0,2391 SO Ay ο © , 0299 
4qo 4qo sin? 
1 1 
ο να 
Ti3 — T31 2 9 q2 
o = = 5ο 0, 1 S = & 0, 2195, 
qo din do 0, 099 SO dy sin 3 0, 2195, 
1 1 
mn Σίντχνο) " 
g = = = 0,3696 so a, = —2. e 0,8192. 
sir 


4qo 4qo nf 

Remark: At the calculation of the components in (4.1416) it can be a problem when qo is zero or 
close to zero. In this case the unit quaternion can not be determined by the formulas in (4.141e). To 
understand this situation one discusses the trace of the rotation matrix: 


Tr R = ri 722 + r33 = 4q — 1. (4.142a) 


I 
If Tr R > 0, then qo = ον TrR+1> 0, and the formulas (4.141e) can be used without any problem. 


If TrR < 0, then go can be close to zero. In this case the greatest element of the main diagonal is 
considered. Assume, it is r11. Then |gi| is greater than |q2| or |g3|. The components qi, q2, q3 also can 
be determined from the elements of the main diagonal of the rotation matrix. Choosing the positive 
sign for the square-roots follows: 
1 1 1 
n= ον 1 Τη — r22 — T33, ϐ PM l- ro» — Tii — 133, da PM 1+ ras — Τα — Το .(4.142b) 


Calculation rules: From this facts the following calculation rules are derived: 


e If TrR < 0 and ry 2 ro and ry, > 733, then qı has the greatest absolute value, so 


ON 3 uie Ss ban) 
T32 Τρ Bc 21 + E ge (4.142c) 


qo = ; 
? Aqi Aq 4ᾳι 


e If TrR < 0 and rag > r1, and 723 > 733 , then qo has the greatest absolute value, so 


Du m a TF ΠΒ, di T23 t T32 , (4.1424) 


q 
» Aq» " Aq» Aq» 


e If YR < 0 and r33 > rij and r33 > 725, then q3 has the greatest absolute value, so 


" T3 — T12 ᾷ Tài c lua 
0 = |; 02-—. , 
Aqa 498 


T23 + T32 
4q3 


Since the CARDAN-angles define the rotations around the corresponding axes, one can find the assign- 
ments given in the following table. Then the rotation 


Βία, 6, y) = R((0,0, 1)", y)R((0, 1,0)", 8)R((1,0,0)", a) (41428 
corresponds to the unit quaternion 


d= Q. Q, Qr . (4.142g) 


(4.1420) 


qo = 
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rotation Cardan angle | around quaternion | 
Ro((1,0,0)", ac) ac x axis 2 | 
Rc((0, 1,0)", Bc) Bo y axis : | 
τ i : „c = YC 
Rc((0,0, 1)*, yc) ye zaxis | Q, := cos E + ksin 5 


: T 
W Knowing the Cardan angles ag = e 


be determined in the following way: 


Q, = cos > + isin i — cos i + isin τ ; 
bo a Bo T 
Qy = cos 9 Tising cos0 -- jsin0 — 1, 
Yc VG π EU 
z = 008 ks cos — + ks ; 
Q 85 sin -5 ) 8 in 8 


The final result coincides with that given on page 299: 


π 5 τον A T 
Bc = 0, 9c = T the quaternion describing this rotation can 


q:- QQQ c : + ksin 5) 1 (cos i + isin =) 


π 


π Tm 5 T. T 
= cos—-cos +1cos—-sin 
8 12 


8 12 
= 0,8924 + 0, 2391i 4 


0, 0991j + 0, 3696k . 


4.4.2.7 Efficiency of the Algorithms 


doing TE 
rJsmg 


2T 
+ sin 
12 


T 
| ksing * COS 


12 


To estimate the efficiency of the algorithms standard operations are defined from which the more com- 
plex operations are originated. For complicated comparisons with other methods see [4.12]. 


Let 
e M: number of multiplications, 


e A: number of additions and subtractions, 


e D: number of divisions, 


e 5: number of standard functions calls, e.g. trigonometric 
erable number of multiplications, divisions and additions, 
e C: number of comparisons of expressions, which increase 


unctions, which are composed of a consid- 


he computing time by interrupting the al- 


gorithm. 
Operation ΑΙΜΙΡΙΒΙΟ 
Quaternion into Matrix 12 | 12 
Matrix into Quaternion (TrR >0)| 6 |5 |1]|1]|1 
Matrix into Quaternion (IrR <0) | 6 [5 |1|14,3 
Rotation of a vector | A | M Remarks 


with rotation matrix | 6 | 9 


with unit quaternion | 24 | 32 


with unit quaternion | 17 | 24 


fast quaternion multiplication 


with unit quaternion | 18 | 21 


normal quaternion multiplication | 


changing into rotation matrix 
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Rotation of n vectors A M Remarks 
with rotation matrix 6n 9n 
with unit quaternion | 24n 32n | normal quaternion multiplication 
with unit quaternion 17n 24n fast quaternion multiplication 
with unit quaternion | 12+6n | 12+9n changing into rotation matrix 
composition of two rotations | A | M 

with rotation matrix 18 | 27 

with unit quaternion 12 | 16 


Summary: An algorithm based on quaternions is faster only when rotations are performed after each 
other. It occurs mainly in computer graphics at animations, i.e. at approximations of rotations. 


4.4.8 Applications of Quaternions 
4.4.3.1 3D Rotations in Computer Graphics 


To describe motion flows interpolation of rotations are used. Since the 3D-rotations can be represented 
by unit quaternions, algorithms are developed for interpolation of rotations in computer graphics. The 
easiest idea is to start analogously to the definition of linear interpolation in Euclidian-spaces. Basic 
algorithms are Lerp, Slerp and Squad. 
1. Lerp (linear interpolation) 
Let p,q € H; and t € [0, 1], then 

Lerp(p, q, t) = p(1 — t) t qt. (4.143) 
e This is a linear segment in R*, connecting p € Hı ~ S3 C Rf with q € H,~ οὓς R*. 
e This segment is inside of the unit sphere in R4, and does not represent any connecting curve on the 
unit sphere S? ~ H,. 
e Therefore the rotation is determined by normalizing the found quaternion. 
This simple algorithm is almost perfect. The only problem is that after finding the interpolation points 
on the secant between the given points and normalizing the found quaternions, the resulted unit quater- 
nions are not equidistant quaternions. This problem is solved by the following algorithm. 


2. Slerp (Spherical linear interpolation) 
Let p,q € Hi, t € [0, 1] and φ (0 <  « 7) the angle between p and q. Then 


A o| E — t)y) | iy ees 


Slerp(p, q, t) = ρ(ρ9)' = p *q m (4.144) 
sin y sing 


e Interpolation along the great circle on the unit 
sphere 5$? C R?, p and q are connected; 
ο The shortest connection is chosen, —Sc (pq) = 
(p, q) > 0 must hold (where (, ) denotes the dot prod- 
uct of p and q in R?). 
In Fig.4.5 the interpolations according to Lerp(a) 
and Slerp(b) are compared. 

Figure 4.5 


Special case p = 1: Let p = 1 = (1, 0, 0, 0)" and q = cosy + nq sing, then 
Slerp(p, q, t) = cos(t p) + nqsin(t v). (4.145) 
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Special case equidistant grids: Let = a then 
n 


k 1 
qk := Slerp(p, q, —) —(sin(g—kw)p+sin(kw)q), k=0,1,...,n. (4.146) 
n siny 
Interpretation of the Slerp interpolation: To show the equivalence of the two expressions in 
(4.144) first Q = p^!q = p = pq is calculated. Since p,q € H; the scalar part is 
p 
Qo = ScQ = Sc (pq) = (p, q) = cosy. (4.147) 


Since p = p: 1, and q = pp^!q = pQ, the interpolation between 1 and Q is multiplied by p to keep the 
interpolation between p and q. 


sin((1— t sin(ty sin((1 — t) sin(t sin(t, 
Q(t) = 4-99), Θα η. (tv) 
sin sino sing sing sin Y 
. sinycos(tp) — sin(tp) cos p + sin(ty) cos | 
sing ' Q sin p 


_ sin(ty) sin y 
= cos(ty) + Πρ sin(ty) = ege — eE _ Ct. (4.148) 
It follows that 


sin((1 — t)y sin(ty " - 
a(t) = pQ(t) =p ία ^, : ) =pQ' = p(p !qy = pd. (4.149) 
sin y sin Y 
3. Squad (spherical and quadrangle) 
For qi, qi+ı € Hi and t € [0, 1] the rule is 
Squad(q; qi+1; $i, Si+1; t) = Slerp(Slerp(qi, qii; t), Slerp(si, Si+1, t), 2t(1 — t)) (4.150) 
lexan: toate tg... 
ths ese eso ( og (4; E og(q; 4 2 l 


e The resulted curve is similar to a Bézier curve, but it keeps the spherical instead of the linear inter- 
polation. 

e The algorithm produces an interpolation curve for a sequence of quaternions qo, q1, ---, qN- 

ο The expression is not defined in the first and last interval, since g_1 is necessary to calculate so and 
qN44 to calculate sy . A possible way out is to choose sy = go and sy = qn, (or to define g_; and qy +1). 
There are additional algorithms based on quaternions: nlerp, log-lerp, islerp, quaternion de Casteljau- 
splines. 


4.4.3.2 Interpolation by Rotation matrices 
The Slerp-algorithm can be described completely analogously with the help of rotation matrices. The 
logarithm of a 3 x 3 rotation matrix R is needed (i.e. an element of group SO(3)) and it is defined by a 
group-theoretical context as the skew-symmetric matrix r (i.e. an element of the Lie group so(3)), for 
which e" = R. Then the Slerp-algorithm can be used to interpolate between rotation matrices Ro and 
R,,which is described as 

R(t) = Ro(Rj! R4)! = Ro exp(tlog(Rg! R)) . (4.151) 
In general it is more simple to use the quaternions based algorithm and to determine R(t) from q(t) 
according to the calculations of the rotation matrix representing the unit quaternion. 


4.4.3.3 Stereographic Projection 
If 1 € H, ~ S? is taken as the North pole of the three dimensional sphere 5? , then unit quaternions 
or elements of the three dimensional sphere can be mapped by the stereographic projection Hı 3 q > 
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1+q)(1—q)7! € Ho ~ R? into pure quaternions or into IR? respectively. The corresponding inverse 
1 8 
mapping is 


R? ~ Ho > p> (p-1)(p+1) teH ~ S. (4.152) 
4.4.3.4 Satellite navigation 


The orientation of an artificial satellite 
circulating around the Earth is to be de- 
ermined. The fixed stars are consid- 
ered to be infinitely far away, so their 
direction with respect to the Earth and 

c he satellite are identical (see Fig.4.6). 
satellite Any difference in measurements can be 
deduced from the different coordinate 
systems and so from the relative rota- 
Figure 4.6 ion of the coordinate systems. 


fixed coordinate 
system 


i-th fixed star, infinitely 
far away 


Let à; be the unit vector pointing into the direction of the i-th fixed star from in the Earth's fixed 
coordinate system, and b; be the unit vector pointing into the direction of the i-th fixed star in the 
satellite's fixed coordinate system. The relative rotation of both coordinate systems can be described 
by a unit quaternion h € H;: 

b; =hajh. (4.153) 
If more fixed stars are considered, and the data are overlapped by measuring errors, then the solution 
is determined by the least squares method, i.e. as the minimum of (4.154), where A is a unit quaternion 


and a; — à; and b; — b; are unit vectors: 


Q = Y |b;-hà;hP = Y (οι — hà;h)- (b; — hà; h) 
i=1 i=1 


= Y bi- ha;h)(b; haih) ΣΩ b;ha;h— hajhb;). (4.154) 
i-i 


i=1 


Since the group H, of the unit quaternions form a Lie group, the critical points of Q? can be determined 
by the help of derivative 


vh- h 
O,h = lim E ah (v, h quaternions, Ὁ real) (4.155) 
v0 9 
from 
8,Q?! =-S> (bivha,h+b, ha, (vh) +vha,hb,+ haj(vh)b,) =0. (4.156) 
i=1 
Here v , b; and ha, h are pure quaternions, so V = —v, and therefore (4.156) can be simplified: 
oR? = —4v - B haihx bi) =0. (4.157) 
i-l 
Since here v is arbitrary, this expression vanishes if 
Y hühxb,-0. (4.158) 
i=1 


Let R be the rotation matrix represented by the unit quaternion A, i.e. hà;h = Ra;. With the 3 x 3 
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matrix 


0 —a; αν 
K(a) = ( a, 0 :3 (4.159) 


—üy üy Ὁ 

defined by vector & = (ax, ay, az) € IR? for any vector b € R? one gets: 

K(a)b=axb,  K(K(àb)-baT -—àbT. (4.160) 
From this relation the critical points of the minimum problems are determined: 

Y K(Raàxb;-0 e Σια Κ᾽ -Rā BI) - 0 e RP - PTRT (4.161) 

i=l i=1 
where P = 57748; bT . If P is decomposed into the product P = RIS, where matrix S is symmetric 
and P = R, is a rotation matrix, then from (4.161) follows 

RR;S = SRR”, (4.162) 
and 

R = R, (4.163) 
is obviously a solution, since in this case R RFS = S = SR,RT, because R;RT = E. However there 
are three other solutions, namely 

R-RjR, (j= 1,2,3), (4.164) 
where R; denotes the rotation by 7 around the j-th eigenvector of S , i.e., there is RjSR; = S. That 
R = RR, is a solution of (4.162) which can be seen from R;R;RIS = SR;RIRT --» R,S = 
SR? <> EjSR;- S. 
The solution for which Q? is minimal is 

R= Rp, falls detP>0, 

~ LR Rp, falls detP <0, 

where R, is the rotation by 7 around the eigenvector of S associated with the eigenvalue of the smallest 
absolute value. 
4.4.3.5 Vector Analysis 


If the V operator (see (13.67), 13.2.6.1, p. 715) and a vector V (see 13.1.3, p. 704) are identified with 
VQ and v in quaternion calculus, i.e. 


(4.165) 


o ὃ 
=i +j- +k. 4.1 
Va E dmt ὅσ᾽ (2166 
v(z, y, 2) = vi (z, y, 2)i + vole, y, 2)j + valz, y, 2) (4.167 


with i,j,k (according to (4.107), p. 290), then the multiplication rule for quaternions (see (4.109b), 
p. 291) gives 


Voy =-V-V¥4+V x V — —div V + rot V, (4.168 
(see also Bl in 4.4.1.1, 4. p. 291). 
Substituting 
ὃ ὃ ὃ ὃ 
D Fi Hj tk and 4.169a 
δὲ ür Fay “ae 7 ES 
w(t, x, y, z) = wolt, v, y, z) + wilt, z, y, zji + wa(t, x, y, 2)} + w(t, x, y, z)k 


= wo(t, v, y, z) - w(t, v, y, z), (4.169b 
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then 
ὃ : 

Da - ài wo — div w + rot w + grad wo. (4.169c 

Especially, for an arbitrary twice continuously differentiable function f(t, v, y, 2) 
"-— € Pf Op Pf 
VaVa f VaVa f VV f ET T By? T ER As f und (4.170a 
93 o? 93 
VoVof =-vv £o - LT LT T Lag, (4.170b 


δα; Oy Oz? 
where A; denotes the Laplace operator in IR? (see (13.75) in 13.2.6.5, p. 716). 
of ο of 9f 
δ O2 Op o2 
where A, denotes the Laplace-operator in Β΄. Va, just as D is often called the Dirac- or Cauchy- 
Riemann operator. 
4.4.3.6 Normalized Quaternions and Rigid Body Motion 


1. Biquaternions 


DDf =DDf Auf (4.170c 


A biquaternion À has the form 

h=hot+teh, with ho, hy € H (4.171) 
Here ε is the dual unit, that commutes with every quaternion, furthermore εὖ = 0. The multiplication 
is the usual quaternion multiplication (see 4.115, p. 292). 
2. Rigid Body Motion 


By the help of unit biquaternions, i.e. biquaternions with 


YI E [m ho ho = 1 , 
hh — (h chy )(h hy) =1 : 4.172 
th = (ho + ehi)(ho + ehi) unis (4.172) 
rigid-body motion (rotation and translation after each other) can be described in IR?. 
Table 4.1 Rigid body motions with biquaternions 
Element Representation by 
point p = (pr, Py, p.) in space Ď= 1+ pe with p = pri + pyj + p-k 
rotations r € H; unit quaternions 
. 1 : F 3 
translations t = (tz, ty, tz) 1+ gee with t =t,i+tyj+t.k 
The unit biquaternions 
m 1 1 
h=ho+me= (1 | ste) r7 7 gine tcHo, rcH;, (4.173) 


give a double covering over the group SE(3) of the rigid-body motion in R? since and —h describe 
the same rigid-body motion. 
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4.5 Systems of Linear Equations 


4.5.1 Linear Systems, Pivoting 
4.5.1.1 Linear Systems 


A linear system contains m linear forms 
Y= 41%, T Gi232 res T Aintn T ᾱι 
ο = 19121 + 2239 +++: + AanXn + 090 
V 2131 22:02 2nUn or y=Ax+a (4.1742) 
Um = Ami 1 2:02 uo Amntn T Gm 
with 
a1 Q12 777 Qin αι, Ti, Yi, 
Q21 Q22 *** Gan a2, T2, yz; 
A= , a= > X=] 0.1, y=]. |. (4.174b) 
Am1 Am2 '** Amn Am Tn Ym 
The elements aj, of the matrix A of size (m, n) and the components a, (u = 1,2, . . . , m) of the column 
vector a are constants. The components x, (m = 1,2,...,n) of the column vector x are the independent 


variables, the components y, (u = 1,2,..., m) of the column vector y are the dependent variables. 


4.5.1.2 Pivoting 


1. Scheme of Pivoting 

If an element a; is not equal to zero in (4.1742), the variable y; can be exchanged for an independent one 
and the variable x; for a dependent one in a so-called pivoting step. The pivoting step is the basic ele- 
ment of pivoting, the method by which for instance systems of linear equations and linear optimization 
problems can be solved. The pivoting step is achieved by the schemes 


zi 20 5 Tho Zn d 
Yı |ü Q12 +++ Qik coo Gin 01 |21 Ma Yi ty 1 
y DRE α a . a Yi) O11 O1: 01k Qin αι 
U2|d21 Q22 *'** Agk *** Am Q2 

Y2|Q21 Q22 3 Qr An 40 

ο ο οκ Tbe EP 4.175) 
Yi | Qil Gig ++ | Qik Qin Qi’ i ( 
‘ Tk| Qil Q2 ''' Aik t7 Qin Qi 
πο eae oe ο ου... Mens 
5 a : Ym|Am1 Am2 *** Amk *** Amn Am 
Tk | Qil Aig tt Qik '' Qin Aj 


where the left scheme corresponds to the system (4.174a). 
2. Pivoting Rules 


The framed element in the scheme aj; (aj, # 0) is called the pivot element; it is at the intersection of 
the pivot column and pivot row. The elements αμν and αμ of the new scheme on the right-hand side 
will be calculated by the following pivoting rules: 


1 Auk F 
1. a, =—, (4.176a) 32. o = ue (u —1,...,mpz i), (4.176b) 
Qik Qik 
Qiy αι 
3. Ay = . Q= (v Ξ 2 ος UFR), (4.176c) 
Qik Qik 
Qiy 


4. αμν = ἂμν — Apk = Quv Ἔαμκαιν, Qp αμ + αμκαι 


Qik 


(for every u Ai and every v Z k). — (4.176d) 
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To make the calculations easier (rule 4) one writes the elements aj, in the (m 4- 1)-th row of the pivoting 
scheme (cellar row). With this pivoting rule one can change further variables. 


4.5.1.3 Linear Dependence 


The linear forms (4.174a) are linearly independent (see 9.1.2.3, 2., p. 553), if all y,, can be changed for 
an independent variable x, . The linear independence will be used, for instance to determine the rank 
of a matrix. Otherwise, the dependence relation can be found directly from the scheme. 


yo T2 Yı Ya 1 

After three pivoting steps T3 8 ^3 e a 6 

(for instance y4 > c4, 3| 1 100-2 
Yo — Tı, yi — T3) the 

ἬΝ , i το. 

8: z4 0-201 0 


No further change is possible because a32 = 0, and one can see the dependence relation y3 = 2yi — 
3y2 + 10. Also for another sequence of pivoting, there remains one pair of not exchangeable variables. 


4.5.1.4 Calculation of the Inverse of a Matrix 
If A is a regular matrix of size (n, n), then the inverse matrix A^! can be obtained after n steps using 
the pivoting procedure for the system y — Ax. 
351 [αι T2 $3 [να πο z3 | y3 v2 Ye | vs muy 
: γη 3 51 γι 3-1-5 4|13|-1|-5 αι] 18-1 -5 
WA-|245 , "E ; , 
(255) — $245" yw|2 0o Π]’ απο 0 1’ zm -2 0 1 
ysl 1| 22 πι 1—2 —2 a) 5 -2-32 x,|/-21 2 8 


2 8-21 
After rearranging the elements one gets A^! = ( -1-5 13 | . (The columns are to be arranged with 
0 1 -2 


respect to the indices of y;, the rows with respect to the indices of ας.) 


4.5.2 Solution of Systems of Linear Equations 
4.5.2.1 Definition and Solvability 


1. System of Linear Equations 


A system of m linear equations with n unknowns z1, 12, ... En 
α1ι2ι a1212 + F ainn = ay 
09111 25332 + + AgnXn ay . 
: or briefly Ακ --α, (4.177a) 
Ami C1 Am2%2 + +++ + mna = Am 


is called a linear equation system. Here the following designations are used: 


Q11 Q12 ''' Gin αι, Ti, 
ü21 022 *** Gan a2; T2, 

ΑΞΙ. p a=] o], x= |" ]. (4.177b) 
Am1 Am2 *** Amn Am Yn 


If the column vector a is the zero vector (a = 0), then the system of equations is called a homogeneous 
system, otherwise (a # 0) it is called an inhomogeneous system of equations. The coefficients ay of the 
system are the elements of the so-called matrix of coefficients A, and the components αμ of the column 
vector a are the constant terms (absolute terms). 
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2. Solvability of a Linear System of Equations 
A linear system of equations is called solvable or consistent or compatible if it has a solution, i.e., there 
exists at least one vector x = α such that (4.177a) is an identity. Otherwise, it is called inconsistent. 
The existence and uniqueness of the solution depend on the rank of the augmented matriz (A, a). One 
gets the augmented matrix by attaching the vector a to the matrix A as its (n 4- 1)-th column. 
1. General Rules for Inhomogeneous Linear Systems of Equations An inhomogeneous linear 
system of equations Ax — a has at least one solution if 

rank (A) = rank (A, a), (4.1782) 
is valid. Furthermore, if r denotes the rank of A, i.e., r = rank (A), then 

a) for r = n the system has a unique solution, (4.178b) 


b) for r < n the system has infinitely many solutions, (4.178c) 


i.e., the values of n — r unknowns as parameters can be chosen freely. 


ΒΑ: 

21 — 211 + 323 v4 + 225 = 2 The rank of A is 2, the rank of the augmented matrix of co- 
331 22 + 5x3 — 3x4 15 = 6 efficients (A, a) is 3, i.e., the system is inconsistent. 
2a, + T2 + 2x3 — 2134 — 325 = 8 


Both the matrices A and (A, a) have rank equal to 3. Be- 


: cause r = n = 3 the system has a unique solution: xı = 
m= 3 κ... .. 2 
>» πο zm 23 zs 
—2x, + 223 + 3x3 = —4 T tf T 
Both the matrices A and (A, a) have rank equal to 2. The 
HC: system is consistent but because r « m it does not have 
πι — t2 + Z3 — r= 1 a unique solution. Therefore n — r = 2 unknowns can 
xı — t2 — e+ t= 0 be considered as free parameters: 2) = cz, — b, t3 = 
πι — 20 — 2x3 + 214 = -ᾱ 1 : 
tat 2' (σι, 21 arbitrary values). 
HD: 

21 + 221 24 Sa cmd 'There is the same number of equations as unknowns but 
2r, — T2 2x3 214 = 2 the system has no solution because rank(A) — 2, and 
30, + 20 + T3 + 344 =3 rank (A , a) = 3. 

71 — 320 + 3843 + x4 —0 


2. Trivial Solution and Fundamental System of Homogeneous Systems 


a) The homogeneous system of equations Ax — 0 always has a solution, the so-called trivial solution 


C= Soi = By = 0. (4.179a) 
(The equality rank (A) = rank (A , 0) always holds.) 
b) If the homogeneous system has the non-trivial solutions @ = (a1,05,...,05) and B. = (Bi, B», . . ., 


Bn), ie., œ AQand 8 Z 0,then x — s o +t β is also a solution with arbitrary constants s and t, 
i.e., any linear combination of the solutions is a solution as well. 

Suppose, the system has exactly / non-trivial linearly independent solutions αι, Q2,..., αι. Then these 
solutions form a so-called fundamental system of solutions (see 9.1.2.3, 2., p. 553), and the general 
solution of the homogeneous system of equations has the form 


x = kya, + ἆ2αο ^ + kia (Ki, ko,..., ky arbitrary constants). (4.179b) 
If the rank r of the coefficient matrix A of the homogeneous system of equations is less than the number 
of unknowns n, i.e., 7 < n , then the system of equations has | = n — r linearly independent non-trivial 
solutions. If r = n, then the solution is unique, i.e., the homogeneous system has only the trivial 
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solution. 


To determine a fundamental system in the case r < n one chooses n — r unknowns as free parameters, 
and expresses the remaining unknowns in terms of them. If reordering the equations and the unknowns 
so that the subdeterminant of order r in the left upper corner is not equal to zero, then one gets for 
instance: 


21 = Pi (Lyi, Lrt2,---,Ln) 
X9 = Lo(Lpr41, Try yin) (4.180) 
Tr = © (rq, Try. αμ). 


Then one can get a fundamental system of solutions choosing the free parameters, for instance in the 
following way: 


Uri Tr+2 Tr43 C^ Ln 
1. fundamental solution: PQ) des 
2. fundamental solution: 0 1 0 «0. (4.181) 
(n —r)-th fundamental solution: 0 0 0 >> 1 
ΒΕ: The rank of the matrix A is equal to 2. The system can be 
x ο + ὅσ: t= 0 ] : 
- I τ | 2n I Bra =0 solved for xı and 2» resulting to: x, = 373 T4, 20 = 
3a t+ 8933 + πᾳΞ 0 . . 
zı 32$ — 9x3 + 724 = 0 25 - 2x4 (x3, 4 arbitrary). Fundamental solutions are œ= 
3 7 
Czy hb 0)? and ag = (—1, —2,0, 1)1. 


4.5.2.2 Application of Pivoting 


1. System of Linear Functions Corresponding to a Linear System of Equations 
In order to solve (4.177a), a system of linear functions y = Ax—ais assigned to the system of equations 
Ax = aso the use of pivoting (see 4.5.1.2, p. 307) is possible: 


Ax=a (4.182a) is equivalent to y-Ax-a-0. (4.182b) 


The matrix A is of size (m, n), a is a column vector with m components, i.e., the number of equations 
m must not be equal to the number of unknowns n. After finishing the pivoting one substitutes y = 0. 
'The existence and uniqueness of the solution of Ax — a can be seen directly from the last pivoting 
scheme. 

2. Solvability of Linear Systems of Equations 

The linear system of equations (4.182a) has a solution if one of the following two cases holds for the 
corresponding linear functions (4.182b): 

Case 1: All y, (u = 1,2,..., m) can be exchanged for some z,. This means the corresponding system 
of linear functions is linearly independent. 

Case 2: At least one y, cannot be exchanged for any xy, i.e., 

Yo = Ati + Aaya o Amm + Ao (4.183) 
holds and also Ag = 0. This means the corresponding system of linear functions is linearly dependent. 
3. Inconsistency of Linear Systems of Equations 
The linear system of equations has no solution if in case 2 above Ao Æ 0 holds. In this case the system 
has contradictory equations. 
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πι — 2353 +403 — x4— 2 

821 + 3239 — 323 + day = 3 | y Ye tz ys 1 

2x, — 9322 + 521 — 324 1 After three pivoting steps zi 3 g 3 2 2 1 

m" m i m d (for instance y — αι, 11 1 

| zi t2 x4 m4 Ys — z4, y2 — x2) fol- za -4 -4 3-4 -2 
y| 1-2. 4-1-2 lows: 3 1 3 
y2|-3 3-3 4-3 t| στο 0-35 3 
γαι 2-3 5-3 1 


This calculation ends with case 1: yi, ya, y3 and 23 are independent variables. Substituting y; = y2 = 
yz = 0, and z4 = t (—oo < t < oo isa parameter) consequently, the solution is: x4 = 2t+ 1, ο = 3t— 2, 
t= t, 24 = ὃν 


4.5.2.3 Cramer’s Rule 


There is the very important special case when the number of equations is equal to the number of un- 
knowns 


α1121 + 1215 4 F ainn = 04 
a21%1 + d2535 d F Amtn = a2 

(4.184a) 
Ani X1 + Qg2X2 ++++ + Anntn = An 


and the determinant of the coefficients does not vanish, i.e., 
D=detA Z0. (4.184b) 


In this case the unique solution of the system of equations (4.184a) can be given in an explicit and 
unique form: 


=p Ip o Map: (4.184c) 
D, denotes the determinant, which is obtained from D by replacing the elements a,, of the v-th column 
of D by the constant terms αμ. for instance 


Q11 Q1 Q13 *** Gin 
Q21 02 123 *** Gan 

Dej .. e . (4.184d) 
Ani Qn An3 ** * Ann 


If D = 0 and there is at least one D, 7 0, then the system (4.184a) has no solution. 
In the case D — 0 and D, — 0 for all v — 1,2,...,n, then it is possible that the system has a solution 
but it is not unique. (see Remark p. 311). 


221 To Lo + 323 = 9 2 13 
B τι — 2% + 23 = —-2 D=|1-21)=18, 
32, + 229 + 223 T 3 22 
9 13 2 93 219 
Ρι-|-2-21 13, Ds 1-21|-326, D3 = |1 —2 —2| = 39. 
T..2:2 3 72 3 2 7 
. : D D: D: 
The system has the unique solution x; - 1,20 5 2, 3 = 3. 


Remark: From practical consideration the Cramer rule is not useful for higher-dimensional problems. 
As the dimension of the problem increases, the number of required operations increases very fast, so, 
for numerical solutions of linear systems of equations one uses the Gauss algorithm or pivoting or an 
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iteration procedure (see 19.1.1, p. 949). 
4.5.2.4 Gauss's Algorithm 


1. Gauss Elimination Method In order to solve the linear system of equations Ax = a (4.177a) of 
m equations with n unknowns one can use the Gauss elimination method. With the help of an equation 
one unknown is to be eliminated from all the other equations. So one gets a system of m — 1 equations 
and n — 1 unknowns. This method will be repeated until the result is a system of equations in row 
echelon form, and from this form one can determine the existence and uniqueness of the solution easily, 
and the solution itself can be found if it exists. 

2. GaussSteps The first Gauss step is to be demonstrated on the augmented matrix of coefficients 
(A, a) (see examples on p. 309): 

Supposing a1; Æ 0, otherwise exchanging the first equation for another one. In the matrix 


Qi Q12 *** Ain | a1 

Q21 22 *** Aon | A2 (4 185 ) 
. a 

Om1 Am2 ** * Amn|Am 


the appropriate multiple of the first row is to be added to the others in order to make the coefficients of 


. l d9| α a 
xı equal to zero, i.e., multiply the first row by — —-, — —, ..., — — then add them to the second, 


αιι au ai 
third,..., m-th row. The transformed matrix has the form 


Q11 012 Qin | Q1 
0 αὐ» -- a, a5 
22 2n 2 
: (4.185b) 
/ i 1 
0 πιο ttt Oy |, 
After applying this Gauss step (r — 1) times the result is a matrix in row echelon form 
G»rxl o e Qin αι 
I i / 
Q3rpi >- Gan 95 
HH LA HH 
βγει > An a3 
(r—1) (r—1) (r—1) (4.186) 
Arri xev ὃν ay 
(r-1) 
0 TT 0 [m 
U^ dt uus ue .Ὁ 0 uu 0 av) 


3. Existence and Uniqueness of the Solution The Gauss steps are elementary row operations 
so they do not affect the rank of the matrix (A, a), consequently the existence and uniqueness of the 
solution and the solution itself do not change. Formula (4.186) implies that the following cases may 
occure concerning the solutions of the inhomogeneous linear system of equations: 


-- -- Ey ids 

Case 1: The system has no solution if any of the numbers ati ) ats S a’-) differs from zero. 
ου. ee -1 = Ρ « 

Case 2: The system has a solution if at 1 ) ata ya, a’) = 0 is valid. Then there are two 


Cases: 


a) r =n: The solution is unique. 
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b) r < n: The solution is not unique; n — r unknowns can be chosen as free parameters. 


If the system has a solution, then the unknowns are to be determined in a successive way starting with 
the last row of the system of equations with the matrix in row echelon form (4.186). 


W A: + 210 + 3x3 + 411 2 1 2 3 4| -2 
Pus | Ts | P fy = After three Gauss steps | 0—1 = -7 6 
dp f Ai f x f do. — 9 the augmented matrix of 0—4 4| —4 
.. u i 224 3 -— coefficients has the form 0 0 0 40—40 


The solution is unique and from the corresponding system of equations with a triangular matrix follows: 
T4 l,34 0,4 τν αν — 0. 


HB: Ti 309 1223 = 5 —] —3 -12|-5 
αι + 2x3 + ὅσα 2 After two Gauss steps the 0. 5 17 7 
5x2 + l7r3 = 7 augmented matrix of coeffi- 0.0 0/0 
3x; — T2 + 2x3 = 1 cients has the form 00 0 0 
Τπι — 421 -- 23= 0 0.0 ο 0 
There is a solution but it is not unique. Choosing one unknown as a free parameter, for instance x3 = t 
Ta XT 4 9 
(—oo < t < oo), and one gets 3 = t, 3 = 2 — —t, 11 = 2 — zt. 
5 5 5 65 


4.5.3 Overdetermined Linear Systems of Equations 


4.5.3.1 Overdetermined Linear Systems of Equations and Linear 
Least Squares Problems 


1. Overdetermined System of Equations 
Consider the linear system of equations 


Ax=b (4.187) 
with the rectangular matrix of coefficients A = (aij) (i = 1,2,...,m; j = L2,...,n; m > n). 
The matrix A and the vector b = (bi, 05, . .., b,,)* on the right-hand side are given, and the vector 
X = (@1,%2,...,2,)" is unknown. Because m > n holds this system is called an over-determined 


system. One can tell the existence and uniqueness of the solution and sometimes also the solution, for 
instance by pivoting. 

2. Linear Least Squares Problem 

If (4.187) is the mathematical model representing a practical problem (i.e., A, b, x are reals), then 
because of measuring or other errors it is impossible to find an exact solution of (4.187) such that it sat- 


isfies all of the equations. Substituting any vector x there will be a residual vector £r = (ri,T2,...,T,,)7 
given as 
r—-Ax-b, τ:0. (4.188) 


In this case x is to be determined to make the norm of the residual vector r as small as possible. Suppose 
now A, b, x are real. If considering the Euclidean norm, then 


m 


Sor? = r'r = (Ax — b)'(Ax- b) = min (4.189) 
i=1 


must be valid, i.e., the residual sum of squares must be minimal. Gauss already had this idea. The 
formula (4.189) is called a linear least squares problem. The norm ||r|| = v/rTr of the residual vector r 
is called the residue. 

3. Gauss Transformation 

The vector x is the solution of (4.189) if the residual vector r is orthogonal to every column of A. That 
is: 


ATr = AT(Ax —b) - 0 or ATAx = ΑΤΡ. (4.190) 
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Equation (4.190) is actually a linear system of equations with a square matrix of coefficients. One refers 
to it as the system of normal equations. It has dimension n. The transition from (4.187) to (4.190) is 
called Gauss transformation. The matrix AT A is symmetric. 

If the matrix A has the rank n (because m > n all columns of A are independent), then the matrix 
AT A is positive definite and also regular, i.e., the system of normal equations has a unique solution if 
the rank of A is equal to the number of unknowns. 


4.5.3.2 Suggestions for Numerical Solutions of Least Squares Problems 
1. Cholesky Method 


Because the matrix AT A is symmetric and positive definite in the case rank (A) = n, in order to solve 
the normal system of equations one can use the Cholesky method (see 19.2.1.2, p. 958). Unfortunately 
this algorithm is numerically fairly unstable although it works fairly well in the cases of a “big” residue 
||r|| and a “small” solution ||x||. 

2. Householder Method 

Numerically useful procedures in order to solve the least squares problem are the orthogonalization 
methods which are based on the decomposition A = QR. Especially useful is the Householder method, 
where Q is an orthogonal matrix of size (m, m) and R is a triangular matrix of size (m,n) (see 4.1.2, 
11. p. 271). 

3. Regularized Problem 

In the case of rank deficiency, i.e., if rank (A) < n holds, then the normal system of equations no longer 
has a unique solution, and the orthogonalization method gives useless results. Then instead of (4.189) 
the so-called regularized problem is considered: 


rlr + ax'x = min! (4.191) 
Here a > 0 is a regularization parameter. The normal equations for (4.191) are: 
(ATA + aI)x = ΑΤΡ. (4.192) 


The matrix of coefficients of this linear system of equations is positive definite and regular for a > 0, 
but the appropriate choice of the regularization parameter a is a difficult problem (see [4.7]). 


4.6 Eigenvalue Problems for Matrices 


4.6.1 General Eigenvalue Problem 

Let A and B be two square matrices of size (n, n). Their elements can be real or complex numbers. 
The general eigenvalue problem is to determine the numbers λ and the corresponding vectors x Z 0 
satisfying the equation 

Ax = \Bx. (4.193) 
The number λ is called an eigenvalue, the vector x an eigenvector corresponding to A. An eigenvector 
is determined up to a constant factor, because if x is an eigenvector corresponding to A, so is cx (c = 
constant) as well. In the special case when B — I holds, where I is the unit matrix of order n, i.e., 

Ax = \x or (A — AI)x = 0, (4.194) 
the problem is called the special eigenvalue problem. It occures very often in practical problems, espe- 
cially with a symmetric matrix A, and so it is to be discussed later in detail. More information about 
the general eigenvalue problem can be found in the literature (see [4.16]). 
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4.6.2 Special Eigenvalue Problem 
4.6.2.1 Characteristic Polynomial 


The eigenvalue equation (4.194) yields a homogeneous system of equations which has non-trivial solu- 
tions x Z 0 only if 


det (A — AT) = 0. (4.195a) 
By the expansion of det (A — AI) one gets 


ay, — À a12 ü13-* "dig 
det (A — AT) = ba n =À a23 -> ἄρῃ 

2n dng an3 ae ünn — 

Ρι(λ) = (21) A* $+ yA" + + ad + ag = 0. (4.195b) 


So the determination of the eigenvalues is equivalent to the solution of a polynomial equation. This 
equation is called the characteristic equation; the polynomial P,,(A) is the characteristic polynomial. Its 
roots are the eigenvalues of the matrix A. For an arbitrary square matrix A of size (n,n) the following 
statements hold: 

Case 1: The matrix A, has exactly n eigenvalues Αι, A2, ..., An, because a polynomial of degree n 
has n roots if they are considered with their multiplicity. The eigenvalues of a real symmetric matrix 
are real numbers, in other cases the eigenvalues can also be complex. 

Case 2: If all the n eigenvalues are different, then the matrix Αμ) has exactly n linearly independent 
eigenvectors x; as the solutions of the equation system (4.194) with \ = A;. 

Case 3: If A; has multiplicity n; among the eigenvalues, and the rank of the matrix Ai») — Alis 
equal to r; , then the number of linearly independent eigenvectors corresponding to A; is equal to the 
so-called nullity n — r; of the matrix of coefficients. The inequality 1 € n — r; € n; holds, i.e., fora 
real or complex quadratic matrix Απ) there are at least one and at most n real or complex linearly 
independent eigenvectors. 


2-3 1 2—A -3 1 
W A: 3 1 3], det(A-AI) = 31—A 3 ==) —M +9) = 0. 
-5 2-4 -5 2—4—A 
The eigenvalues are A, = 0, 4 = 1, 4 = —2. The eigenvectors are determined from the corresponding 


homogeneous linear system of equations. 
ολι- 0: 2a, — 3:9 + 23 =0 
8321 + 2+ 323 =0 
—5a, + 21ο — 421 = 0. 


, E ; 11 . 
One gets for instance by pivoting: xı arbitrary, 20 = τοΐ 25 = -21ι + 8311 = T07 Choosing 
10 
x, = 10 the eigenvector is xy = C1 3 |, where ΟἹ ¥ 0 is an arbitrary constant. 
—11 
e λ. = 1: The corresponding homogeneous system yields: «3 is arbitrary, 20 = 0,4 = 3£2— £3 Z3. 
—1 
Choosing «3 = 1 the eigenvector is Χρ = C5 ( o) , where C5 Æ 0 is an arbitrary constant. 
m 1 
. f . . 4 
e \3 = —2: The corresponding homogeneous system yields: x2 is arbitrary, 21 = πη, r3 = —4ri + 


3 
7 4 
352 = -372 Choosing x2 = 3 the eigenvector is Χα = C3 3 |, where C3 z 0 is an arbitrary 
-T 
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constant. 
30-1 3—A 0 -ι 
.. 14 Jl det (A — AI) = 14—A 1| = —233-F102? — 32A + 32 = 0. 
—10 3 -1 08 -λ 
The eigenvalues are 44 = 2, Ay = A3 = 4. 
e A, = 2: One obtains xz is arbitrary, 20 = --2ῃ, xı = x3 and chooses for instance 73 = 1. So the 


1 
corresponding eigenvector is xı = Ci ( —1 |, where C, ¥ 0 is an arbitrary constant. 


e Ay = λα = 4: One obtains 2ο, 23 are arbitrary, xı = —x3. There are two linearly independent 


0 -1 
eigenvectors, e.g., for 20 = 1,2} = 0 and a = 0, x3 = 1: x2 = Cy (1). X3 = C3 í η where 
0 1 
Cə # 0, C3 Z 0 are arbitrary constants. 


4.6.2.2 Real Symmetric Matrices, Similarity Transformations 


In the case of the special eigenvalue problem (4.194) for a real symmetric matrix A the following state- 
ments hold: 


1. Properties Concerning the Eigenvalue Problem 
1. Number of Eigenvalues The matrix A has exactly n real eigenvalues A; (i = 1,2,..., n), count- 
ing them by their multiplicity. 


2. Orthogonality ofthe Eigenvectors The eigenvectors x; and x; corresponding to different eigen- 
values A; Z A; are orthogonal to each other, i.e., for the scalar product of x; and x; 


x? x, = (xx) =0 (4.196) 
is valid. 
3. Matrix with an Eigenvalue of Multiplicity p For an eigenvalue which has multiplicity p (A = 
Ay = Ag =... = Ap), there exist p linearly independent eigenvectors x,,X5,...,X,. Because of (4.194) 
all the non-trivial linear combinations of them are also eigenvectors corresponding to A. Using the 
Gram-Schmidt orthogonalization process one can choose p of them such that they are orthogonal to 
each other. 
Summarizing: The matrix A has exactly n real orthogonal eigenvectors. 


110 


e À, = às = —1: From the corresponding homogenous system of equations one gets: 21 is arbitrary, 
29 is arbitrary, 73 = —x, — 20. Choosing first xı = 1, x9 = 0 then x, = 0, £2 = 1 one gets the linearly 


011 
E A= k 0 ] , det (A — AT) A? 3A +2 = 0. The eigenvalues are λι = Ay = —1 and λα = 2. 


-1 


0 
Cy ( 1 , where C; Æ 0 and Cs ¥ 0 are arbitrary 
-1 


1 
independent eigenvectors x = Οι ( o) and Χο 


constants. 

e À; = 2: One gets: x, is arbitrary, 22 = 2, 73 = xı, and choosing for instance xı = 1 one gets the 
1 

eigenvector X4 = C3 | 1 |, where C5 Æ 0 is an arbitrary constant. The matrix A is symmetric, so the 


eigenvectors corresponding to different eigenvalues are orthogonal. 
4. Gram-Schmidt Orthogonalization Process Let V, be an arbitrary n-dimensional Euclidean 
vector space. Let the vectors X,,X5,...,X, € V, be linearly independent. Then there exists an or- 
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thogonal system of vectors YuyueciY,€ V, which can be obtained by the vectors x; as follows: 
E (xi, y.) 
y,—XnY,—Xk- Σ, y;  (k-2,3,..,n). (4.197) 
i—l (Y y,) 
Remarks: 
1. Here (x,, y,) =x, y, is the scalar product of the vectors x, and y ,. 
2. Corresponding to the orthogonal system of the vectors y,,y,,...,y,, one gets the orthonormal 
ar τ Y: n Y, - y 
system X,,X5,...,X, with X, = =|, & = ——23.,..., €, = π-π, where |ly || = ,/(y,, y.) is 
r lly ll lly, MIA = cnt 


the Euclidean norm of the vector y ;. 


ES ( 2) : ( ] 
;¥,=X2 == and X, = —=| -1 |: 
` CHEAR 1/2 ΠΝ 


2. Transformation of Principal Axes, Similarity Transformation 
For every real symmetric matrix A, there is an orthogonal matrix U and a diagonal matrix D such that 
A = UDU". (4.198) 
The diagonal elements of D are the eigenvalues of A, and the columns of U are the corresponding 
normed eigenvectors. From (4.198) it is obvious that 
D = UTAU. (4.199) 
Transformation (4.199) is called the transformation of principal axes. In this way A is reduced to a 
diagonal matrix (see also 4.1.2, 2., p. 270). 
If the square matrix A (not necessarily symmetric) is transformed by a square regular matrix G such 
a way that 


GAG- (4.200) 


then it is called a similarity transformation. The matrices A and A are called similar and they have 
1e following properties: 


τε 


1. The matrices A and A have the same eigenvalues, i.e., the similarity transformation does not affect 
the eigenvalues. 


2. If A is symmetric and G is orthogonal, then Ais symmetric, too: 

A=GTAG with G'™G=I (4.201) 
The relation (4.201) is called an orthogonal-similarity transformation. In this context (4.199) means 
that a real symmetric matrix A can be transformed orthogonally similar to a real diagonal form D. 
4.6.2.3 Transformation of Principal Axes of Quadratic Forms 
1. Real Quadratic Form, Definition 
A real quadratic form Q of variables £1, 13,..., £n has the form 


n n 


Q=} Y agziz; =x" Ax, (4.202) 


i=1 j=1 
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where x = (#1, 22,...,2n)* is the vector of real variables and the matrix A = (a;;) is a real symmetric 
matrix. 

The form Q is called positive definite or negative definite, if it takes only positive or only negative values 
respectively, and it takes the zero value only in the case x; = r9 =... = £n = 0. 


'The form Q is called positive or negative semidefinite, if it takes non-zero values only with the sign 
according to its name, but it can take the zero value for non-zero vectors, too. 
A real quadratic form is called indefinite if it takes both positive and negative values. According to the 
behavior of Q the associated real symmetric matrix A is called positive or negative definite, semidefinite 
or indefinite. 
2. Real Positive Definite Quadratic Form, Properties 
1. In a real positive definite quadratic form Q all elements of the main diagonal of the corresponding 
real symmetric matrix A are positive, i.e., 

aij ^0 (¢=1,2,...,n) (4.203) 
holds. (4.203) represents a very important property of positive definite matrices. 
2. A real quadratic form Q is positive definite if and only if all eigenvalues of the corresponding matrix 
A are positive. 
3. Suppose the rank of the matrix A corresponding to the real quadratic form Q = x! Ax is equal to 
r. Then the quadratic form can be transformed by a linear transformation 

x = Cx (4.204) 


into a sum of pure quadratic terms, into the so-called normal form 


Q-X'Kx-Y pij (4.205) 
i—l 
where p; = (sign A;)k; and ki, k2, ...,k, are arbitrary, previously given, positive constants. 


Remark: Regardless of the non-singular transformation (4.204) that transforms the real quadratic 
form of rank r into the normal form (4.205), the number p of positive coefficients and the number 
q =r — p of negative coefficients among the p; of the normal form are invariant (the inertia theorem of 
Sylvester). The value p is called the index of inertia of the quadratic form. 

3. Generation of the Normal Form 


A practical method to use the transformation (4.205) follows from the transformation of principal axes 
(4.199). First it is to perform a rotation on the coordinate system by the orthogonal matrix U, whose 
columns are the eigenvectors of A (i.e., the directions of the axes of the new coordinate system are the 
directions of the eigenvectors). This gives the form 


Q= Ik=} Adj. (4.206) 


Here L is a diagonal matrix with the eigenvalues of A in the diagonal. Then a dilatation is performed 


by the diagonal matrix D whose diagonal elements are d; = Dir The whole transformation is now 
given by the matrix 

C=UD, (4.207) 
and one gets: 

Q = x’ Ax = (UDX)'A(U DX) = x" (D™UTA UD)& 

= x" DL Dx = x" Kx. (4.208) 

Remark: The transformation of principal axes of quadratic forms plays an essential role at the classi- 
fication of curves and surfaces of second order (see 3.5.2.11, p. 206 and 3.5.3.14, p. 228). 
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4. Jordan Normal Form 
Let A be an arbitrary real or complex (n, n) matrix. Then there exists a non-singular matrix T such 
that 

T'!AT-J (4.209) 
holds, where J is called the Jordan matriz or Jordan normal form of A. The Jordan matrix has a block 
diagonal structure of the form (4.210), where the elemnts J; of J are called Jordan blocks: 

Ji ^ 

J2 (0) λο (0) 
J= Si , (4.210) Je "s. | (4.211) 


Ji An 


They have the following structure: 
1. If A has only single eigenvalues λ), then J; = Aj and k = n, i.e., J is a diagonal matrix (4.211). 


2. If Aj is an eigenvalue of multiplicity pj, then there are Xf 

one or more blocks of the form (4.212) where the sum of 7 Asi fe) 

the sizes of all such blocks is equal to p; and 377, pj =n J;- uon ! (4.212) 
holds. The exact structure of a Jordan block depends on ET 

the structure of the elementary divisors of the character- ο J X 

istic matrix A — AI. ^j 


For further information see [4.15], [19.16] vol. 1. 


4.6.2.4 Suggestions for the Numerical Calculations of Eigenvalues 


1. Eigenvalues can be calculated as the roots of the characteristic equation (4.195b) (see examples on 
p. 915). In order to get them the coefficients a; (i = 0,1,2,...,n — 1) of the characteristic polynomial 
of the matrix A must be determined. However, one should avoid this method of calculation, because 
this procedure is extremely unstable, i.e., small changes in the coefficients a; of the polynomial result 
in big changes in the roots Aj. 

2. There are many algorithms for the solution of the eigenvalue problem of symmetric matrices. Two 
types can be distinguished (see [4.7]): 

a) Transformation methods, for instance the Jacobi method, Householder tridiagonalization, QR algo- 
rithm. 

b) Iterative methods, for instance vector iteration, the Rayleigh-Ritz algorithm, inverse iteration, 
Lanczos method, the bisection method. As an example the power method of Misesis discussed here. 


[2 
2 
[e] 


3. The Power Method of Mises Assume that A is real and symmetric and has a unique dominan: 
eigenvalue. This iteration method determines this eigenvalue and the associated eigenvector. Let the 
dominant eigenvalue be denoted by Ai, that is, 


πο a] > il e > Anl: (4.213 
Let X4,X»,..., X, be the associated linearly independent eigenvectors. Then: 
1. Ax; =Aix; (¢=1,2,...,n). (4.214 
2. Each element x € R” can be expressed as a linear combination of these eigenvectors x ;: 
X—60X,460X5-:--cc0,X, (G const; i= 1,2,...,n). (4.215 
Multiplying both sides of (4.215) by A k times, then using (4.214) follows 
k k k k k ^ : Xn ! 
A*x = GATX, + C2A$X3 o + CAT X, λι]οιχι + C2 (3) Χο tess +Cn (*) SC: 
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From this relation and (4.213) one can see that 
A*x 
ση — x,ask > oo, 


- that is, A"x z c Mx,. 
Me i T 
101 


This is the basis of the following iteration procedure: 
Step 1: Select an arbitrary starting vector x € R”. 
Step 2: Iterative computation of 

x^) = Ακ (k 20,1,2,...; x? is given). 
From (4.218) and keeping in mind (4.217) follows: 

x) = AFx( & c Mex 
Step 3: From (4.218) and (4.219) it follows that 

xt) = Αχ! = A(AFx(9), 
A(A*x) ~ A(ciMx4) = c A (Ax), 
c (AF Ax) = λι(οιλέκι) 56 Aix ?, therefore 


x (^D a λικίθ, 


(4.217) 


(4.218) 


(4.219) 


(4.220) 


that is, for large values of k the consecutive vectors x *^! and x“) differ approximately by a factor λι. 


Step 4: Relations (4.219) and (4.220) imply for x ; and Ai: 
(x9 : xe 


ux UD τ. 
e y AN (x®, x(®) 


W For example, let 


233—115 1.77 1 
A-|-115 925 213], κθ- [ο]. 
1.77 —2.13 1.56 0 


(4.221) 


κ] xD — x? x?! normalization x4) x?! normalization 
1 3.23 14.89 88.27 1 7.58 67.75 1 
0 |—1.15 —18.12 —208.03 —2.36 |—24.93 —256.85 —3.79 
0 1.77 10.93 82.00 0.93 8.24 79.37 1.17 
à 9.964 10.177 
x(9 x| normalization x9 x) normalization 
9.66 96.40 1 10.09 102.33 il 
— 38.78 —394.09 —4.09 —41.58 —422.49]} —4.129 | ~ x, 
11.67 117.78 1.22 | 1238 125.73|\_ 1229 
10.16 10.161 z A4 
Remarks: 


1. Since eigenvectors are unique only up to a constant multiplier, it is preferable to normalize the 
vectors αἱ) as shown in the example. 


2. The eigenvalue with the smallest absolute value and the associated eigenvector can be obtained by 
using the power method of Mises for A~!. If A^! does not exist, then 0 is this eigenvalue and any vector 
from the null-space of A can be selected as an associated eigenvector. 


3. The other eigenvalues and the associated eigenvectors of A can be obtained by repeated application 
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of the following idea. Select a starting vector which is orthogonal to the known vector x4, and in this 
subspace A» becomes the dominant eigenvalue that can be obtained by using the power method. In order 
to obtain As, the starting vector has to be orthogonal to both x, and x5, and so on. This procedure is 
known as matriz deflation. 


4. Based on (4.218) the power method is sometimes called vector iteration. 


4.6.3 Singular Value Decomposition 


1. Singular Values and Singular Vectors Let A be a real matrix of size (m, n) and its rank be 
equal to r. The matrices AAT and ATA have r non-zero eigenvalues λν, and they are the same for 
both of the matrices. The positive square roots d, = VA, (v = 1,2,...,r) of the eigenvalues A, o 
the matrix AT A are called the singular values of the matrix A. The corresponding eigenvectors u, 
of AT A are called right-singular vectors of A, the corresponding eigenvectors v, of AAT left-singular 
vectors: 


ATAu,— Xu, AAV, - Ανν, (v—12,..,r) (4.222. 
The relations between the right and left-singular vectors are: 

Au, =d,v,, Av, = dyu. (4.222b 
A matrix A of size (m, n) with rank r has r positive-singular values d, (v = 1,2,...,r). There exist r 
orthonormalized right-singular vectors u, and r orthonormalized left-singular vectors v,. Furthermore, 
there exist to the zero-singular value n — r orthonormalized right-singular vectors u,(v = r 4-1,...,m 
and m — r orthonormalized left-singular vectors v, (v = r+ 1,..., m). Consequently, a matrix o 


size (m, n) has n right-singular vectors and m left-singular vectors, and two orthogonal matrices can 
be made from them (see 4.1.4, 9., p. 275): 


U-(u,u;....u&) V = (Vi; Vaze Vina). (4.223 


2. Singular-Value Decomposition The representation 


d0 0 ::: sex Ὁ 
dy i 
é T TOWS 
e 0d,/0-+-0 
A-—VAUT (4.224a) with A= 0 r -0 (4.224b) 
m — r TOWS 
ETE δω; 


T——— —— 
r columns n — r columns 


is called the singular-value decomposition of the matrix A. The matrix A, as the matrix A, is of size 
(m, n) and has only zero elements except the first r diagonal elements a,, = d, (v = 1,2,...,r). The 
values d, are the singular values of A. 

Remark: If substituting A" instead of AT and consider unitary matrices U and V instead of orthogo- 
nals, then all the statements about singular-value decomposition are valid also for matrices with com- 
plex elements. 


3. Application Singular-value decomposition can be used to determine the rank of the matrix A 
of size (m,n) and to calculate an approximate solution of the over-determined system of equations 
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Ax = b (see 4.5.3.1, p. 313) with the so-called regularization method, i.e., to solve the problem 


m n 2 n 
[Ακ — b|? + a|Ix|? = M7 » αμ — | +aŅ αἲ = min!, (4.225) 


i=1 Le=1 k=1 
where a > 0 is a regularization parameter. 


5 Algebraand Discrete Mathematics 


5.1 Logic 


5.1.1 Propositional Calculus 


1. Propositions 
A proposition is the mental reflection of a fact, expressed as a sentence in a natural or artificial language. 
Every proposition is considered to be true or false. This is the principle of two-valuedness (in contrast 
to many-valued or fuzzy logic, see 5.9.1, p. 413). “1 


“True” and “false” are called the truth value of the 
proposition and they are denoted by T (or 1) and F (or 0), respectively. The truth values can be 
considered as propositional constants. 

2. Propositional Connectives 

Propositional logic investigates the truth of compositions of propositions depending on the truth of the 
components. Only the extensions of the sentences corresponding to propositions are considered. Thus 
the truth of a composition depends only on that of the components and on the operations applied. So 
in particular, the truth of the result of the propositional operations 


"NOT A" (24), (5.1 “A AND B" (ΑΛ B), (5.2) 

“AOR B” (Αν B), (5.3 “IF A, THEN B" (A — B) (5.4) 
and 

“A IF AND ONLY IF B” (A & B) (5:5) 


are determined by the truth of the components. Here “logical OR” always means “inclusive OR”, i.e., 
“AND/OR”. In the case of implication, for A = B also the following verbal forms are in use: 


A implies B, B is necessary for A, A is sufficient for B. 
3. Truth Tables 
In propositional calculus, the propositions A and B are considered as variables (propositional variables) 


which can have only the values F and T. Then the truth tables in Table 5.1 contain the truth functions 
defining the propositional operations. 


Table 5.1 Truth tables of propositional calculus 


Negation Conjunction Disjunction Implication Equivalence 


A || 7A A|B\AAB A|B|[AvB A|B|A-—B A JB[|AeB 


FI T FF F ΕΙΕ F FF HN FF T 
T| F F|T F FIT T FIT T F|T F 
T|F F T|F T τις F τις F 
T/T T TIT T T|T T T/T T 


4. Formulas in Propositional Calculus 

Compound expressions (formulas) of propositional calculus can be composed from the propositional 
variables in terms of a unary operation (negation) and binary operations (conjunction, disjunction, 
implication and equivalence). These expressions, i.e., the formulas, are defined in an inductive way: 


1. Propositional variables and the constants T, F are formulas. (5.6) 
2. If A and B are formulas, then (A) (AA B) (AVB) (A=B) (ASB) (5.7) 
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are also formulas. 


Το simplify formulas parentheses are omitted after introducing precedence rules. In the following se- 
quence every propositional operation binds more strongly than the next one in the sequence: 


a, A, V, >, 9. 


Often the notation A instead of “~A” is used, and the symbol A is omitted. By these simplifications, 
for instance the formula ((A V (=B)) = ((A A B) V C)) can be rewritten more briefly in the form: 


ΑΝ B ^ ABV C. 
5. Truth Functions 


Assigning a truth value to every propositional variable of a formula, the assignment is called an inter- 
pretation of the propositional variables. Using the definitions (truth 
a truth value can be assigned to a formula for 
every possible interpretation of the variables. 
Thus for instance the formula given above de- 


A|B|C|AVB|ABVC|AVB-— ABVC 
F FF T F F 
ΕΙΕΙΤ a T T 
F|T|F F F E 
FITT F T T 
ΤΙΕΙΕ Τ F F 
T FJ|T T T T 
T TIE T T T 
TAE | salt T T T 


termines a 


ables) of propositional operations 


ruth function of three variables (a 


Boolean function see 5.7.5, p. 413). 


W In this way, every formula with n proposi- 
tional variables determines an n place (or n 


ary) truth 


unction, i.e., a function which as- 


signs a truth value to every n tuple of truth 

9n t 
values. There are 27" n ary truth functions, 
in particular these are 16 binary ones. 


6. Elementary Laws in Propositional Calculus 
Two propositional formulas A and B are called logically equivalent or semantically equivalent, denoted 
by A — B, if they determine the same truth function. Consequently, the logical equivalence of propo- 
sitional formulas can be checked in terms of truth tables. So there is, e.g., AV B  ABVC = BVC, 
i.e., the formula A V B = AB V C does not in fact depend on A, as follows from its truth table above. 
In particular, there are the following elementary laws of propositional calculus: 


1. Associative Laws 
(AANB)AC=AA(BAC), 


2. Commutative Laws 
AA^AB-—BAA, 

3. Distributive Laws 
(Αν B)C = AC V BC, 

4. Absorption Laws 
A(AVB)—A, 

5. Idempotence Laws 
AA — A, 

6. Excluded Middle 
AA=F, 

7. De Morgan Rules 
AB=AVB, 


(5.8a) 


(5.9a) 


(5.12a) 


(5.13a) 


(AVB)VC=AV(BVC). (58b 
Αν Β-- Bv A. (5.9b 
ABVC =(AVC)\(BVC). (510b 
Av AB -- A. 5.11b 
AV A- A. 5.12b 
AVA=T. 5.13b 
AVB=AB 5.14b 
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8. Laws for T and F 


AT=A, (5.15a) AVF=A, (5.15b 

AF =F, (5.15c) ΑΝΤ-Τ, (5.154 

T=, (5.15e) F=T. (5.15f 
9. Double Negation 

À- A. (5.16 
Using the truth tables for implication and equivalence, gives the identities 

A>B=AVB (5.17a) and A & B= ABV AB. (5.17b 


Therefore implication and equivalence can be expressed in terms of other propositional operations. 
Laws (5.17a), (5.17b) are applied to reformulate propositional formulas. 

E The identity AV B > ABV C = B V C can be verified in the following way: AV B > ABV C = 
AVBV ABVC=ABV ABVC=ABV ABVC=(AVA)BVC=TBVC=BVC. 


10. Further Transformations 
A(AV B) = AB, (5.18a) ΑΝ AB=AVB, (5.18b) 


(AVC)(BV Ο(Αν B) = (Αν C)(BV ©), (518) ACV BCV AB = ACV BC. (5.184) 


11. NAND Function and NOR Function As it is known, every propositional formula determines 
a truth function. Checking the following converse of this statement: Every truth function can be rep- 
resented as a truth table of a suitable formula in propositional logic. Because of (5.17a) and (5.17b) 
implication and equivalence can be eliminated from formulas (see also 5.7, p. 395). This fact and the 
De Morgan rules (5.14a) and (5.14b) imply that one can express every formula, therefore every truth 


function, in terms of negation and disjunction only, or in terms of negation and conjunction. There are 
two further binary truth functions of two variables which are suitable to express all the truth functions. 
They are called the NAND function or Shef- 

Table 5.2 NAND function Table 5.3 NOR function fer function (notation “|”) and the NOR 
AIB | A|B AIB] AĻB function or Peirce function (notation “ |”), 
with the truth tables given in Tables 5.2 

F|F T ΕΤΕΡ T and 5.3. Comparison of the truth tables 
ET T F/T F for these operations with the truth tables 
TF T. TF F of conjunction and disjunction makes the 
T|T| F "E E F terminologies NAND function (NOT AND) 


and NOR function (NOT OR) clear. 


7. Tautologies, Inferences in Mathematics 

A formula in propositional calculus is called a tautology if the value of its truth function is identically 
the value T. Consequently, two formulas A and B are called logically equivalent if the formula A <= B 
is a tautology. Laws of propositional calculus often reflect inference methods used in mathematics. As 
an example, consider the law of contraposition, i.e., the tautology 


A—-BeB-A. (5.194) 
This law, which also has the form 
A=>B=B=A, (5.19b) 


can be interpreted in this way: To show that B is a consequence of A is the same as showing that A is a 
consequence of B. The Indirect proof (see also 1.1.2.2, p. 5) is based on the following principle: To show 
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that B is a consequence of A, one supposes P to be false, and under the assumption that A is true, one 
derives a contradiction. This principle can be formalized in propositional calculus in several ways: 


A=>B=ABSA (5.20a) or AS>B=AB>B or (5.20b) 


A=>B=AB=SF. (5.20c) 


5.1.2 Formulas in Predicate Calculus 


For developing the logical foundations of mathematics one needs a logic which has a stronger expressive 
power than propositional calculus. To describe the properties of most of the objects in mathematics 
and the relations between these objects the predicate calculus is needed. 
1. Predicates 
The objects to be investigated are included into a set, i.e., into the domain X of individuals (or uni- 
verse), e.g., this domain could be the set IN of the natural numbers. The properties of the individuals, as, 
e.g., ^nisa prime ", and the relations between individuals, e.g., *^ m is smaller than n ", are considered 
as predicates. An n place predicate over the domain X of individual is an assignment P: X^ > {F,W}, 
which assigns a truth value to every n tuple of the individuals. So the predicates introduced above on 
natural numbers are a one-place (or unary) predicate and a two-place (or binary) predicate. 

2. Quantifiers 
A characteristic feature of predicate logic is the use of quantifiers, i.e., that of a universal quantifier or 
“for every” quantifier V and existential quantifier or “for some" quantifier 3. If P is a unary predicate, 
then the sentence “P() is true for every x in X" is denoted by V x P(x) and the sentence“ There exists 
an 2; in X for which P(x) is true "is denoted by 3x P(x). Applying a quantifier to the unary predicate 
P, gives a sentence. If for instance IN is the domain of individual of the natural numbers and P denotes 
the (unary) predicate “n is a prime", then V n P(n) is a false sentence and 3n P(n) is a true sentence. 
3. Formulas in Predicate Calculus 

The formulas in predicate calculus are defined in an inductive way: 


1. If x1,... , &n are individual variables (variables running over the domain of individual variables) and 
P is an n-place predicate symbol, then 

P(xi,...,v4) isa formula (elementary formula). (5.21) 
2. If A and B are formulas, then 

(^A), (A^ B), (AV B), (A = B), (A & B), (Vr A) and (3x A) (5.22) 


are also formulas. 

Considering a propositional variable to be a null-place predicate, the propositional calculus can be 
considered as a part of predicate calculus. An occurrence of an individual variable x is bound in a 
formula if x is a variable in V x or in d x or the occurrence of x is in the scope of these types of quantifiers; 
otherwise an occurrence of x is free in this formula. A formula of predicate logic which does not contain 
any free occurrences of individual variables is called a closed formula. 

4. Interpretation of Predicate Calculus Formulas 

An interpretation of predicate calculus is a pair of 

e a set (domain of individuals) and 

e an assignment, which assigns an n-place predicate to every n-ary predicate symbol. 

For every prefixed value of free variables the concept of the truth evaluation of a formula is similar to 
the propositional case. The truth value of a closed formula is T or F. In the case of a formula containing 
free variables, one can associate the values of individuals for which the truth evaluation of the formula 
is true; these values constitute a relation (see 5.2.3, 1., p. 331) on the universe (domain of individuals). 
W Let P denote the two-place relation € on the domain IN of individuals, where IN is the set of the 
natural numbers then 
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ο P(x, y) characterizes the set of all the pairs (x, y) of natural numbers with x < y (two-place or binary 
relation on IN); here z, y are free variables; 

e V y P(x, y) characterizes the subset of N (unary relation) consisting of the element 0 only; here x is 
a free variable, y is a bound variable; 

e Jx Vy P(x,y) corresponds to the sentence “ There is a smallest natural number ”; the truth value is 
true; here x and y are bound variables. 


5. Logically Valid Formulas 
A formula is called logically valid (or a tautology) if it is true for every interpretation. The negation o 
formulas is characterized by the identities below: 


-VzrP(r)—-3r-P(x) or —-3zP(r)-Vzc-P(x). (5.23 
Using (5.23) the quantifiers V and 3 can be expressed in terms of each other: 

VrP(r)—- πα πρι) or 3x P(x) =-Va-P(z). (5.24 
Further identities of the predicate calculus are: 

VrVyP(r,y)-—VyNz P(x,y), (5.25 

da dy P(x,y) oa (5.26 

Va P(x) AVx Q(x) 2 Vr (P(x) A Q(x)), (5.27 

ee (x) V Q(x)). (5.28 


The following implications are also valid: 


s r) V Vx Ω(α) > Va (P(x) V Q(x)), (5.29 
z (P(x) ^ Q(x)) > 3x P(x) Ada Q(z), (5.30 
x (P(x) > Q(x)) > σα P(x) > Va Q(x), (5.31 
x (P(x) 5» Q(x) > (Vx P(t) & Vx Q(z)), (5.32 


a =>Vyda P(x,y). (5.33 
The converses of these implications are not valid, in particular, one has to be careful with the fact tha: 
the quantifiers V and 3 do not commute (the converse of the last implication is false). 


6. Restricted Quantification 
Often it is useful to restrict quantification to a subset of a given set. So, there is considered 


Va eX P(x) asashort notation of Vr(rc X > P(x)) and (5.34 
Jx c X P(x) asashort notation of. 3x (x € X ^ P(x)). (5.35 


5.2 Set Theory 
5.2.1 Concept of Set, Special Sets 


The founder of set theory is Georg Cantor (1845-1918). The importance of the notion introduced by 
him became well known only later. Set theory has a decisive role in all branches of mathematics, and 
today it is an essential tool of mathematics and its applications. 

1. Membership Relation 
1. Sets and their Elements The fundamental notion of set theory is the membership relation. A 
set Α is a collection of certain different things a (objects, ideas, etc.) that belong together for certain 
reasons. These objects are called the elements of the set. One writes “a € A” or “a ¢ A” to denote “a 
is an element of A” or “a is not an element of A”, respectively. Sets can be given by enumerating their 
elements in braces, e.g., M = {a,b,c} or U = {1,3,5,...}, or by a defining property possessed exactly 
by the elements of the set. For instance the set U of the odd natural numbers is defined and denoted by 
U — (x | vis an odd natural number]. For number domains the following notation is generally used: 
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—10,1,2;...) set of the natural numbers, 
10,1, 21,2, -2,...) set of the integers, 


— 
κο | ο 


DqeZ^qz ο set of the rational numbers, 
q 


N 
Z 
Q= 
R set of the real numbers, 
C set of the complex numbers. 
2. Principle of Extensionality for Sets Two sets A and B are identical if and only if they have 
exactly the same elements, i.e., 

A—-BeVzr(rcAexc B) (5.36) 
W The sets {3, 1,3,7,2} and (1,2,3, 7) are the same. 
A set contains every element only “once” , even if it is enumerated several times. 


2. Subsets 
1. Subset If A and B are sets and 
Vo (te A vc B) (5.37) 


holds, then A is called a subset of B, and this is denoted by A C B. In other words: A is a subset of B, 
if all elements of A also belong to B. If for A C B there are some further elements in B such that they 
are not in A, then A is called a proper subset of B, and it is denoted by A C B (Fig. 5.1). Obviously, 
every set is a subset of itself A C A. 
E Suppose A = (2,4,6,8,10] is a set of even numbers and B = {1,2,3,...,10} is a set of natural 
numbers. Since the set A does not contain odd numbers, A is a proper subset of B. 
2. Empty Set or Void Set [It is important and useful to introduce the notion of empty set or void 
set, 0, which has no element. Because of the principle of extensionality, there exists only one empty set. 
ΒΛ: The set (z|z € RA 2? + 2x + 2 = 0) is empty. 
W B: () C M for every set M, i.e., the empty set is a subset of every set M. 
For a set A the empty set and A itself are called the trivial subsets of A. 
3. Equality of Sets Two sets are equal if and only if both are subsets of each other: 

A—-BeACBABGCA. (5.38) 
This fact is very often used to prove that two sets are identical. 
4. Power Set The set of all subsets A of a set M is called the power set of M and it is denoted by 
P(M),ie,P(M)-(A| AC M). 
E For the set M = {a,b,c} the power set is 

P(M) = (0, {a}, {b}, {c}, (a, b}, {a,c}, (b ο}, (a, b, ο) }. 
It is true that: 
a) If a set M has m elements, its power set IP(M) has 2" elements. 
b) For every set M there are M, Ø € IP(M), i.e., M itself and the empty set are elements of the power 
set of M. 
5. Cardinal number The number of elements of a finite set M is called the cardinal number of M 
and it is denoted by card M or sometimes by |M]. 
For the the cardinal number of sets with infinitely many elements see 5.2.5, p. 335. 


5.2.2 Operations with Sets 


1. Venn diagram 

The graphical representations of sets and set operations are the so-called Venn diagrams, when repre- 
senting sets by plane figures. So, Fig. 5.1, represents the subset relation A C B. 

2. Union, Intersection, Complement 

By set operations new sets can be formed from the given sets in different ways: 
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Figure 5.1 Figure 5.2 Figure 5.3 


1. Union Let A and B be two sets. The union set or the union (denoted by A U B) is defined by 
AUB-—Ír|rcAvzreB], (5.39) 
in words “A union B” or “A cup B”. If A and B are given by the properties E, and E» respectively, the 
union set AU B has the elements possessing at least one of these properties, i.e., the elements belonging 
to at least one of the sets. In Fig. 5.2 the union set is represented by the shaded region. 
E {1, 2,3} U {2,3,5,6} = (1,2,3,5,6]. 
2. Intersection Let A and B be two sets. The intersection set, intersection, cut or cut set (denoted 
by AN B) is defined by 
ANB={x|x@eAAze B}, (5.40) 
in words “A intersected by B” or “A cap B”. If A and B are given by the properties E; and E» respec- 
tively, the intersection AN B has the elements possessing both properties E, and Es, i.e., the elements 
belonging to both sets. In Fig. 5.3 the intersection is represented by the shaded region. 


Β With the intersection of the sets of divisors T(a) and T (b) of two numbers a and b one can define the 
greatest common divisor (see 5.4.1.4, p. 373). Fora = 12 and b = 18 holds T (a) = (1,2,3,4,6,12) and 
T(b) = {1,32, 3,0, 9, 18}, so T(12) n T(18) contains the common divisors, and the greatest common 
divisor is g.c.d. (12,18) — 6. 
3. Disjoint Sets Two sets A and B are called disjoint if they have no common element; for them 
ANB=0 (5.41) 
holds, i.e., their intersection is the empty set. 
W The set of odd numbers and the set of even numbers are disjoint; their intersection is the empty set, 
i.e., 
{odd numbers} {even numbers} = 0. 
4. Complement Considering only the subsets of a given set M, then the complementary set or the 
complement Cy4(A) of A with respect to M contains all the elements of M not belonging to A: 
Cu(A) 2 (| ve M^x ¢€ A}, (5.42) 
in words “complement of A with respect to M", and M is called the fundamental set or sometimes the 
universal set. If the fundamental set M is obvious from the considered problem, then the notation A 
is also used for the complementary set. In Fig. 5.4 the complement A is represented by the shaded 
region. 


M 


>o Ες), «e 


Figure 5.4 Figure 5.5 Figure 5.6 


3. Fundamental Laws of Set Algebra 
These set operations have analoguous properties to the operations in logic. The fundamental laws of 
set algebra are: 
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1. Associative Laws 


(An B)nC — An (Bn C), (5.43) (AU B)UC — AU(BUC). (5.44 
2. Commutative Laws 

ANB=BNA, (5.45) AUB=BUA. (5.46 
3. Distributive Laws 

(AUB)NC=(ANC)U(BNC), (5.47) (AN B)UC=(AUC)N (BU C). (5.48 
4. Absorption Laws 

An(AUB)—A, (5.49) AU (An B) =A. (5.50 
5. Idempotence Laws 

An A - A, 5.51) ΑΙ) Α-- A. (5.52 
6. De Morgan Laws 

AnB-AUB, 5.53) AUB=ANB. (5.54 
7. Some Further Laws 

ANA=9, 5.55) AUA=M (M fundamental set), (5.56 

ANM=A, 5.57) AU — A, (5.58 

An — 0, 5.59) AUM=M, (5.60 

M-0 5.61) 0—M (5.62 

A=A. 5.63) 
This table can also be obtained from the fundamental laws of propositional calculus (see 5.1.1, p. 323 


using the following substitutions: ^ by N, V by U, T by M, and F by Ø. This coincidence is not acci- 
dental; it will be discussed in 5.7, p. 395. 
4. Further Set Operations 
In addition to the operations defined above there are defined some further operations between two sets 
A and B: the difference set or difference A \ B, the symmetric difference AAB and the Cartesian 
product A x B. 
1. Difference of Two Sets The set of the elements of A, not belonging to B is the difference set or 
difference of A and B: 

ANB-—(x|x€ A^x6 B). (5.644) 
If A is defined by the property E; and B by the property E», then A \ B contains the elements having 
the property Ει but not having property E». 
In Fig. 5.5 the difference is represented by the shaded region. 
B {1,2,3,4} {3,4,5} = {1,2}. 
2. Symmetric Difference of Two Sets The symmetric difference AA B is the set of all elements 
belonging to exactly one of the sets A and B: 

AAB —(z|(xe€ AAc¢ B)V(r e BAx¢ A)). (5.64b) 
It follows from the definition that 

AAB = (AN B)U(B\ A) = (AUB) \ (An B), (5.64c) 
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i.e., the symmetric difference contains the elements which have exactly one of the defining properties 
E; (for A) and E» (for B). 

In Fig. 5.6 the symmetric difference is represented by the shaded region. 

E (1.2.3.44^(3.4,5] = {1, 2, 5}. 

3. Cartesian Product of Two Sets The Cartesian product of two sets A x B is defined by 


Ax B= ((a,b) | ae A^be B}. (5.65a 
The elements (a, b) of A x B are called ordered pairs and they are characterized by 

(a,b) = (c, d) @a=cAb=d. (5.65b 
The number of the elements of a Cartesian product of two finite sets is equal to 

card (A x B) = (cardA)(cardB). (5.65c 


WB A: For A = {1,2,3} and B = {2,3} one gets A x B = {(1,2), (1,3), (2, 2), (2,3), (3, 2), (3,3) 
and B x A = {(2,1), (2,2), (2, 3), (3, 1). (3, 2), (3, 3)) with cardA = 3, cardB = 2, card(A x B) = 
card(B x A) — 6. 
ΒΒ: Every point of the x, y plane can be defined with the Cartesian product R x R (R is the set ο 
real numbers). The set of the coordinates x, y is represented by R x R, so: 

R? = R x R={(2z,y)| x € Rye R}. 
4. Cartesian Product of n Sets 


From n elements, by fixing an order of sequence (first element, second element, ..., n-th element) an 
ordered n tuple is defined. If a; € A; (i = 1,2,...,n) are the elements, the n tuple is denoted by 
(a1, 02,...,a4,), where a; is called the i-th component. 
For n — 3,4,5 these n tuples are called triples, quadruples, and quintuples. 
The Cartesian product of n terms Αι x A» x -+ x A, is the set of all ordered n tuples (a4, a2, . . . , an) 
with a; € Aj: 

ΑΙΧ...Χ An = {ίαι,... απ) | a; € Αι (i =1,...,n)}. (5.664) 
If every A; is a finite set, the number of ordered n tuples is 

card(A, x Ag x +++ x An) = cardA; cardAg : - : card An. (5.66b) 


Remark: The n times Cartesian product of a set A with itself is denoted by A". 
5.2.3 Relations and Mappings 


1. nary Relations 
Relations define correspondences between the elements of one or different sets. An n ary relation or 
n-place relation R between the sets A,..., An is a subset of the Cartesian product of these sets, i.e., 


RC A x... x An. If the sets Aj, i = 1,...,n, are all the same set A, then R C A" holds and it is 
called an n ary relation in the set A. 
2. Binary Relations 
1. Notion of Binary Relations of a Set The two-place (binary) relations in a set have special 
importance. 
In the case of a binary relation the notation aRb is also very common instead of (a,b) € R. 
Β = Asan example, the divisibility relation in the set A = {1,2,3,4} is considered, i.e., the binary 
relation 

T = {(a,b) |a,b€ AA ajisa divisor of b} (5.67a) 

= {(1, 1), (1,2), (1, 3), (1, 4), (2, 2), (2, 4), (3,3), (4, 4)). (5.67b) 

2. Arrow Diagram or Mapping Function Finite binary relations R in a set A can be represented 
by arrow functions or arrow diagrams or by relation matrices. The elements of A are represented as 
points of the plane and an arrow goes from a to b if aRb holds. 
Fig. 5.7 shows the arrow diagram of the relation T in A = {1, 2,3, 4}. 
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Figure 5.7 Scheme: Relation matrix 


3. Relation Matrix The elements of A are used as row and column entries of a matrix (see 4.1.1, 
1., p. 269). At the intersection point of the row of a € A with the column of b € B there is an entry 
1 if aRb holds, otherwise there is an entry 0. The above scheme shows the relation matrix for T in 
A = {1,2,3,4}. 


3. Relation Product, Inverse Relation 
Relations are special sets, so the usual set operations (see 5.2.2, p. 328) can be performed between 
relations. Besides them, for binary relations, the relation product and the inverse relation also have 
special importance. 
Let R C A x B and S C B x C be two binary relations. The product R o S of the relations R, S is 
defined by 

RoS —((a,c) | 3b (b € B^aRb^bSc)). (5.68 


The relation product is associative, but not commutative. 


The inverse relation R^! of a relation R is defined by 


R^! = {(b,a) | (a,b) € R}. (5.69 
For binary relations in a set A the following relations are valid: 

(RUS)oT =(RoT)U(SoT), (5.70a) (RnS)oT C(RoT)n(SoT), (5.70b 

(RUS)'!-R'US, (5.70c) (RnS)'!-R'!ns', (5.70d 

(Rosj --6-1ο ΠῚ. (5.70e) 


4. Properties of Binary Relations 
A binary relation in a set A can have special important properties: 


Ris called 
reflexive, if Va € AaRa, (5.71a) 
irreflexive, if Va € A-—aRa, (5.71b) 
symmetric, if Va,b € A (aRb > bRa), (5.71c) 
antisymmetric, if Va,b € A (aRb ^ bRa > a = b), (5.71d) 
transitive, if Va,b,c € A(aRb ^ bRc > aRc), (5.Τ16) 
linear, if Va,b € A (aRb V bRa). (5.71f) 


These relations can also be described by the relation product. For instance: a binary relation is transi- 
tive if Ro R C R holds. Especially interesting is the transitive closure tra(R) of a relation R. It is the 
smallest (with respect to the subset relation) transitive relation which contains Π. In fact 


ta(R)-|)m-numumu..., (5.72) 


n21 


where R” is the n times relation product of R with itself. 
W Let a binary relation R on the set {1, 2, 3, 4, 5} be given by its relation matrix M: 
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Μι 2 3 4 5 M?|1 2 3 4 5 Μ3|1 2 3 4 5 
1/1 00 1 0ο 1/110 1 1 l.|i 101 1 
210006 1 0 2/0 1001 2.01010 
3/0 0 1 0 1 3/0 1101 3 (0 111 1 
4/0 10 0 1 4 JO 101 0 4/0 10 1 1 
510 1000 δ 10060610 5 |0 1001 


Calculating M? by matrix multiplication where the values 0 and 1 are treated as truth values and 
instead of multiplication and addition one performs the logical operations conjunction and disjunction, 
then, M? is the relation matrix belonging to R?. The relation matrices of R?, R! etc. can be calculated 
similarly. 

The relation matrix of RU R? U B? (the matrix on the left) can 


MV M?vM?|1 2 3 4 5 be get by calculating the disjunction elementwise of the matri- 
1 1101 1 ces M, M? and M E Since the higher powers of M contains no 
2 0 101 1 new 1-s, this matrix already coincides with the relation matrix 
3 0 1 11 1 of tra( R). 
4 0 1.0 1 1 The relation matrix and relation product have important ap- 
5 ο 10 1 1 plications in search of path length in graph theory (see 5.8.2.1, 


p. 404). 
In the case of finite binary relations, one can easily recognize the above properties from the arrow 
diagrams or from the relation matrices. For instance one can recognize the reflexivity from "self-loops" 
in the arrow diagram, and from the main diagonal elements 1 in the relation matrix. Symmetry is 
obvious in the arrow diagram if to every arrow there belongs another one in the opposite direction, or 
if the relation matrix is a symmetric matrix (see 5.2.3, 2., p. 331). Easy to see from the arrow diagram 
or from the relation matrix that the divisibility T' is a reflexive but not symmetric relation. 


5. Mappings 

A mapping or function f (see 2.1.1.1, p. 48) from a set A to a set B with the notation f: A > B isa 
rule to assign to every element a € A exactly one element b € B, which is called f(a). 

A mapping f can be considered as a subset of A x B and so as a binary relation: 


f —((a.f(a))a e A} ς Ax B. (5.73) 


a) f is called a injective or one to one mapping, if to every b € B at most one a € A with f(a) = b 
exists. 

b) f is called a surjective mapping from A to B, if to every b € B at least one a € A with f(a) = b 
exists. 

c) f is called bijective, if f is both injective and surjective. 

If A and B are finite sets, between which exists a bijective mapping, then A and B possess the same 
number of elements (see also 5.2.5, p. 335). 

For a bijective mapping f: A — B exists the inverse relation f~': B — A, the so-called inverse 
mapping of f. 

The relation product of mappings is used for the one after the other composition of mappings: If f: 
A — B and g: B — C are mappings, then f o g is also a mapping from A to C, and is defined by 


(f ο g)(a) = g(f(a)) . (5.74) 


Remark: Becareful with the order of f and g in this equation (it is treated differently in the literature!). 


5.2.4 Equivalence and Order Relations 


'The most important classes of binary relations with respect to a set Α are the equivalence and order 
relations. 
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1. Equivalence Relations 

A binary relation R with respect to a set A is called an equivalence relation if R is reflexive, symmetric, 

and transitive. For aRb also the notations a ^g b or a ~ b are used, if the equivalence relation R is 

already known, in words a is equivalent to b (with respect to R). 

Examples of Equivalence Relations: 

B A: A= Z, m € N \ {0}. a ~p b holds exactly if a and b have the same remainder when divided by 

m (they are congruent modulo m). 

W B: Equality relation in different domains, e.g., in the set Q of rational numbers: A = » € piq = 
1 2 

pod (Pı, po. qı, qo integer; q1, q2 # 0), where the first equality sign defines an equality in Q, while the 

second one denotes an equality in Z. 

W C: Similarity or congruence of geometric figures. 

W D: Logical equivalence of expressions of propositional calculus (see 5.1.1, 6., p. 324). 

2. Equivalence Classes, Partitions 

1. Equivalence Classes An equivalence relation in a set A defines a partition of A into non-empty 

pairwise disjoint subsets, into equivalence classes. 


la]r :— {b| be A^a cg b) (5.75) 
is called an equivalence class of a with respect to R. For equivalence classes the following is valid: 

lar #0, an be α]η = [b]r, and ambe [ann [bn = 0. (5.76) 
These equivalence classes form a new set, the quotient set A/R: 

A/R = (la]n | a € A). (5.77) 
A subset Z C P(A) of the power set P(A) is called a partition of A if 

042, X, YeZ^X2YoxnY-0, |) XA. (5.78) 

ΧεΖ 


2. Decomposition Theorem Every equivalence relation R in a set A defines a partition Z of A, 

namely Z = A/R. Conversely, every partition Z of a set A defines an equivalence relation R in A: 
ücgbedXeZ(ae XA^be X). (5.79) 

An equivalence relation in a set A can be considered as a generalization of the equality, where * in- 

significant " properties of the elements of A are neglected, and the elements, which do not differ with 

respect to a certain property, belong to the same equivalence class. 

3. Ordering Relations 

A binary relation R in a set A is called a partial ordering if R is reflexive, antisymmetric, and transitive. 

If in addition R is linear, then R is called a linear ordering or a chain. The set A is called ordered or 

linearly ordered by R. In a linearly ordered set any two elements are comparable. Instead of aRb also 

the notation a € g b or a € bis used, if the ordering relation R is known from the problem. 

Examples of Ordering Relations: 

ΒΑ: The sets of numbers N, Z, Q, R are completely ordered by the usual < relation. 

W B: The subset relation is also an ordering, but only a partial ordering. 


W C: The lexicographical order of the English words is a chain. 

Remark: If Z = (A, B} is a partition of Q with the property a € A Ab € B =a < b, then (A, B) is 
called a Dedekind cut. If neither A has a greatest element nor B has a smallest element, so an irrational 
number is uniquely determined by this cut. Besides the nest of intervals (see 1.1.1.2, p. 2) the notion 
of Dedekind cuts is another way to introduce irrational numbers. 

4. Hasse Diagram 

Finite ordered sets can be represented by the Hasse diagram: Let an ordering relation € be given on a 
finite set A. The elements of A are represented as points of the plane, where the point b € A is placed 
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above the point a € A ifa < b holds. If there is no c € A for which a < c < b, one says a and b are 
neighbors or consecutive members. Then one connects a and b by a line segment. 


A Hasse diagram is a “simplified” arrow diagram, where all the loops, arrow- 4ο 
heads, and the arrows following from the transitivity of the relation are elimi- 
nated. The arrow diagram of the divisibility relation T of the set A = {1, 2,3, 4} 
is given in Fig. 5.7. T also denotes an ordering relation, which is represented 2. 03 
by the Hasse diagram in Fig. 5.8. 


5.2.5 Cardinality of Sets 


In 5.2.1, p. 327 the number of elements of a finite set was called the cardinality 
of the set. This notion of cardinality can be extended to infinite sets. Figure 5.8 


le 


1. Cardinal Numbers 

Two sets A and B are called equinumerous if there is a bijective mapping between them. To every set A 
a cardinal number |A| or card A is assigned, so that equinumerous sets have the same cardinal number. 
A set and its power set are never equinumerous, so no “ greatest " cardinal number exists. 

2. Infinite Sets 

Infinite sets can be characterized by the property that they have proper subsets equinumerous to the 
set itself. The "smallest" infinite cardinal number is the cardinal number of the set IN of the natural 
numbers. This is denoted by No (aleph 0). 


A set is called enumerable or countable if it is equinumerous to IN. This means that its elements can be 
enumerated or written as an infinite sequence a1, 2, . . .. 


A set is called non-countable if it is infinite but it is not equinumerous to IN. Consequently every infinite 
set which is not enumerable is non-countable. 

W A: The set Z of integers and the set Q of the rational numbers are countable sets. 

W B: The set R of the real numbers and the set C of the complex numbers are non-countable sets. 
These sets are equinumerous to P(N), the power set of the natural numbers, and their cardinality is 
called the continuum. 


5.3 Classical Algebraic Structures 
5.3.1 Operations 


1. nary Operations 

The notion of structure has a central role in mathematics and its applications. Next to investigate are 
algebraic structures, i.e., sets on which operations are defined. An n ary operation y on a set A is a 
mapping y: A” — A, which assigns an element of A to every n tuple of elements of A. 

2. Properties of Binary Operations 

Especially important is the case n = 2, which is called a binary operation, e.g., addition and multipli- 
cation of numbers or matrices, or union and intersection of sets. A binary operation can be considered 
as a mapping * : A x A — A, where instead of the notation “κ(α, 5)" in this chapter mostly the infix 
form “ax b? will be used. A binary operation x in A is called associative if 


(ax b)*c=ax (bxc), (5.80) 
and commutative if 
axb=bxa (5.81) 


holds for every a,b,c € A. 
An element e € A is called a neutral element with respect to a binary operation » in A if 


ü*e-—exa-a holds for every a€ A. (5.82) 
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3. Exterior Operations 
Sometimes exterior operations are to be considered. That are the mappings from K x A to K, where 
K is an "exterior" and mostly already structured set (see 5.3.8, p. 365). 


5.3.2 Semigroups 

The most frequently occurring algebraic structures have their own names. A set H having one associa- 
tive binary operation * , is called a semigroup. The notation: is H — (H, x). 

Examples of Semigroups: 

W A: Number domains with respect to addition or multiplication. 

W B: Power sets with respect to union or intersection. 

W C: Matrices with respect to addition or multiplication. 

WB D: The set Α” of all * words” (strings) over an 
semigroup). 


« 


alphabet " A with respect to concatenation (free 


Remark: Except for multiplication of matrices and concatenation of words, all operations in these 
examples are also commutative; in this case one talks about a commutative semigroup. 


5.3.3 Groups 

5.3.3.1 Definition and Basic Properties 

1. Definition, Abelian Group 

A set G with a binary operation x is called a group if 

e x is associative, 

ο x has a neutral element e, and for every element a € G there exists an inverse element a^! such that 
I2alta-e. (5.83) 


A group is a special semigroup. 


axa 


The neutral element of a group is unique, i.e., there exists only one. Furthermore, every element of the 
group has exactly one inverse. If the operation * is commutative, then the group is called an Abelian 
group. If the group operation is written as addition, +, then the neutral element is denoted by 0 and 
the inverse of an element a by —a. 

The number of elements of a finite group is called the order of the group (see 5.3.3.2,3., p. 338). 


Examples of Groups: 
W A: The number of domains (except IN) with respect to addition. 
B B: Q\ {0}, RV {0}, and CV {0} with respect to multiplication. 
B C: Su :={f:M—> MA fbijective} with respect to composition of mappings. This group is called 
symmetric. If M is finite having n elements, then δ, is written instead of Swm. Sn has n! elements. 
The symmetric group S, and its subgroups are called permutation groups. So, the dieder groups D, 
are permutation groups and subgroups of η. 
E D: The set D, of all covering transformations of a regular n-gon in the plane is considered. Here a 
covering transformationis the transition between two symmetric positions of the n-gon, i.e., the moving 
of the n-gon into a superposable position. Denoting by d a rotation by the angle 27/n and by o the 
reflection with respect to an axis, then D,, has 2n elements: 

D, = (ed, d?,..., d", o, do,..., d" lo]. 
With respect to the composition of mappings D, is a group, the dihedral group. Here the equalities 


d" = o? = e and ed = αἲ 1σ hold. 


E E: All the regular matrices (see 4.1.4, p. 272) over the real or complex numbers with respect to 
multiplication. 
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Remark: Matrices have a very important role in applications, especially in representation of linear 
transformations. Linear transformations can be classified by matrix groups. 

2. Group Tables or Cayley’s Tables 

For the representation of finite groups Cayley’s tables or group tables are used: The elements of the 
group are denoted at the row and column headings. The element a x b is the intersection of the row of 
the element a and the column of the element b. 
E If M = {1,2,3}, then the symmetric group Sw is also denoted by S3. $5 consists of all the bijective 
mappings (permutations) of the set {1, 2,3} and consequently it has 3! = 6 elements (see 16.1.1, p. 805). 
Permutations are mostly represented in two rows, where in the first row there are the elements of M 
and under each of them there is its image. So one gets the six elements of S3 as follows: 


...[1 23 [123 ft 23 
ε-(] 2 F n-(i 3 2): m- (3 2 1 
- 28 =f{123 _({1 23 
m= (5 1 3) n- (3 3 D) r= (3 12 


With the successive application of these mappings (binary operations) the following group table is 
obtained for δη: 


(5.84) 


e From the group table it can be seen that the identity per- 


ΠΩ 2 pa Pa P5 E : 
| ΞΕ mutation e is the neutral element of the group. 


€ |E pi P2 P3 Pa ps e In the group table every element appears exactly once in 
Pi | Pi E ps Pa ps po τος} every row and in every column. 

Pa | Pa Pa € Ps Pi Pa ο It is easy to recognize the inverse of any group element in 
Ps Ps Ps Pae Papi the table, i.e., the inverse of p, in 55 is the permutation ps, 
P 4 P 7 P 2 ps Pi ps € because at the intersection of the row of p, with the column 
Ps | Ps Ps Pı P2€ pa of ps is the neutral element ε. 


ο Ifthe group operation is commutative (Abelian group), then the table is symmetric with respect to 
the “main diagonal”; S5 is not commutative, since, e.g., pi © po F pa o pi. 
e The associative property cannot be easily recognized from the table. 


5.3.3.2 Subgroups and Direct Products 


1. Subgroups 
Let G = (G,*) be a group and U C G. If U is also a group with respect to *, then U = (U, κ) is called 
a subgroup of G. 
A non-empty subset U of a group (G, κ) is a subgroup of G if and only if for every a, b € U, the elements 
a * b and α are also in U (subgroup criterion). 
1. Cyclic Subgroups The group G itself and E = {e} are subgroups of G, the so-called trivial 
subgroups. Furthermore, a subgroup corresponds to every element a € G, the so-called cyclic subgroup 
generated by a: 
«αντι 9 550,0 2. (5.86 
If the group operation is addition, then one writes the integer multiple ka as a shorthand notation o 
the k times addition of a with itself instead of the power a^, i.e., as a shorthand notation of the k times 
operation of a by itself, 
<a> = (...,(-2)a, —a,0,a,2a,...). (5.87 
Here < a > is the smallest subgroup of G containing a. If < a > = G holds for an element a of G, 
then G is called cyclic. 
There are infinite cyclic groups, e.g., Z with respect to addition, and finite cyclic groups, e.g., the se 
Zm the residue class modulo m with residue class addition (see 5.4.3, 3., p. 377). 
W if the number of elements of a finite G group is a prime, then G is always cyclic. 
2. Generalization The notion of cyclic groups can be generalized as follows: If M is a non-empty 
subset of a group G, then the subgroup of G whose elements can be written in the form of a produc 
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of finitely many elements of M and their inverses, is denoted by « M >. The subset M is called the 
system of generators of < M >. If M contains only one element, then < M > is cyclic. 

3. Order of a Group, Left and Right Cosets In group theory the number of elements of a finite 
group is denoted by ord G. If the cyclic subgroup < a > generated by one element a is finite, then this 
order is also called the order of the element a, i.e., ord < a >= orda. 

If U is a subgroup of a group (G, *) and a € G, then the subsets 

aU := {ax uļju E U} and Ua:= (u«aju € U} (5.88) 
of G are called left co-sets and right co-sets of U in G. The left or right co-sets form a partition of G, 
respectively (see 5.2.4, 2., p. 334). 

All the left or right co-sets of a subgroup U in a group G have the same number of elements, namely 
ordU. From this it follows that the number of left co-sets is equal to the number of right co-sets. This 
number is called the index of U in G. The Lagrange theorem follows from these facts. 

4. Lagrange Theorem The order of a subgroup is a divisor of the order of the group. 

In general it is difficult to determine all the subgroups of a group. In the case of finite groups the 
Lagrange theorem as a necessary condition for the existence of a subgroup is useful. 

2. Normal Subgroup or Invariant Subgroup 

For a subgroup U, in general, aU is different from Ua (however |aU| = |ζ7α| is valid). If aU = Ua for all 
a € G holds, then U is called a normal subgroup or invariant subgroup of G. These special subgroups 
are the basis of forming factor groups (see 5.3.3.3, 3., p. 339). 

In Abelian groups, obviously, every subgroup is a normal subgroup. 

Examples of Subgroups and Normal Subgroups: 

B A: RV {0}, QV {0} form subgroups of CV {0} with respect to multiplication. 

W B: The even integers form a subgroup of Z with respect to addition. 


η 


W C: Subgroups of 53: According to the Lagrange theorem the group 5» having six elements can have 
subgroups only with two or three elements (besides the trivial subgroups). In fact, the group 53 has 
the following subgroups: E = {£}, ζι = (e, pi}, U2 = {e, po}, Us = {ει pa}, Us = {e, pa, ps}, S3- 

The non-trivial subgroups Ui, Uz, U3, and U are cyclic, since the numbers of their elements are primes. 
But the group $5 is not cyclic. The group δα has only U, as a normal subgroup, except the trivial 
normal subgroups. 

Anyway, every subgroup U of a group G with |U| = |G|/2 is a normal subgroup of G. 

Every symmetric group Sy and their subgroups are called permutation groups. 

W D: Special subgroups of the group GL(n) of all regular matrices of type (n, n) with respect to matrix 
multiplication: 


= 


SL(n) group of all matrices A with determinan 
O(n) group of all orthogonal matrices, 

SO(n) group of all orthogonal matrices with determinant 1. 

The group S L(n) is anormal subgroup of GL(n) (see 5.3.3.3, 3., p. 339) and SO(n) is a normal subgroup 
of O(n). 
E E: As subgroups of all complex matrices of type (n, n) (see 4.1.4, p. 272): 


U(n) group of all unitary matrices, 
SU(n) group of all unitary matrices with determinant 1. 
3. Direct Product 
1. Definition Suppose A and B are groups, whose group operation (e.g., addition or multiplication) 
is denoted by -. In the Cartesian product (see 5.2.2, 4., p. 331) A x B (5.65a) an operation * can be 
introduced in the following way: 

(αι; by) * (a2, b2) = (αι: az, bı : b2). (5.89a) 
A x B becomes a group with this operation and it is called the direct product of A and B. 
(e, e) denotes the unit element of A x B, (a~', b!) is the inverse element of (a, b). 
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For finite groups A, B 

ord (A x B) = ord A. ord B (5.89b) 
holds. The groups A’ := {(a,e)|a € A} and B’ := {(e,b)|b € B) are normal subsets of Ax B isomorphic 
to A and B, respectively. 
The direct product of Abelian groups is again an Abelian group. 
The direct product of two cyclic groups A, B is cyclic if and only if the greatest common divisor of the 
orders of the groups is equal to 1. 
E A: With Z; = (e, a} and Z; = (e, b, b°}, the direct product Zo x Z3 = {(e, e), (e, b), (e, 03). (a, e), (a, 
b), (a, b*)}, is a group isomorphic to Ze (see 5.3.3.3, 2., p. 339) generated by (a, b). 
WB B: On the other hand Z x Z = {(e, e), (e, b), (a, e), (a, ϐ)} is not cyclic. This group has order 4 
and it is also called Klein's four group, and it describes the covering operations of a rectangle. 
2. Fundamental Theorem of Abelian Groups Because the direct product is a construction 
which enables to make “larger” groups from “smaller” groups, the question can be reversed: When is 
it possible to consider a larger group G as a direct product of smaller groups A, B, i.e., when will G be 
isomorphic to A x B? For Abelian groups, there exists the so-called fundamental theorem: 
Every finite Abelian group can be represented as a direct product of cyclic groups with orders of prime 
powers. 


5.3.3.3 Mappings Between Groups 


1. Homomorphism and Isomorphism 
1. Group Homomorphism Between algebraic structures, not arbitrary mappings, but only “struc- 
ture keeping" mappings are considered: 
Let αι = (Οι. κ) and Gy = (60.9) are two groups. A mapping h: G4 — Gp is called a group homo- 
morphism, if for all a, b € (αι holds: 
h(a ϐ) = h(a) o h(b) (“image of product = product of images” ) (5.90) 
Β As an example, consider the multiplication law for determinants (see 4.2.2, 7., p. 279): 
det(AB) = (det A)(det B). (5.91) 
Here on the right-hand side there is the product of non-zero numbers, on the left-hand side there is the 
product of regular matrices. 
If h: G4 — Gp is a group homomorphism, then the set of elements of G4, whose image is the neutral 
element of Gy, is called the kernel of h, and it is denoted by ker h. The kernel of h is a normal subgroup 
of (αι. 
2. Group Isomorphism Ifa group homomorphism / is also bijective, then h is called a group iso- 
morphism, and the groups G4 and G% are called isomorphic to each other (notation: (αι = G3). Then 
ker h = E is valid. 
Isomorphic groups have the same structure, i.e., they differ only by the notation of their elements. 
W The symmetric group 55 and the dihedral group Ds are isomorphic groups of order 6 and describe 
the covering mappings of an equilateral triangle. 
2. Cayley's Theorem 
The Cayley theorem says that every group can be interpreted as a permutation group (see 5.3.3.2, 2., 
p. 338): 
Every group is isomorphic to a permutation group. 
The permutation group P, whose elements are the permutations πρ (g € G) mapping a to G,*g, isa 
subgroup of Sg isomorphic to (G, κ). 
3. Homomorphism Theorem for Groups 
The set of co-sets of a normal subgroup N in a group G is also a group with respect to the operation 
aN obN — abN. (5.92) 
It is called the factor group of G with respect to N, and it is denoted by G/N. 
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The following theorem gives the correspondence between homomorphic images and factor groups of a 
group, because of what it is called the homomorphism theorem for groups: 

A group homomorphism h: ΟἽ — G» defines a normal subgroup of G1, namely ker h = (a € G|h(a) = 
e}. The factor group (αι / ker h is isomorphic to the homomorphic image h(G,) = {h(a)|a € σι}. Con- 
versely, every normal subgroup N of G defines a homomorphic mapping naty: Gi — G/N with 
naty(a) = aN. This mapping naty is called a natural homomorphism. 

E Since the determinant construction det: GL(n) > R (0) is a group homomorphism with ker- 
nel SL(n), SL(n) is a normal subgroup of GL(n) and (according to the homomorphism theorem): 
GL(n)/SL(n) is isomorphic to the multiplicative group IRA (0) of real numbers (for notation see 5.3.3.2, 
2., p. 338). 


5.8.4 Group Representations 
5.3.4.1 Definitions 


1. Representation 
A representation D(G) of the group G is a map (homomorphism) of G onto the group of non-singular 
linear transformations D on an n-dimensional (real or complex) vector space V,,: 

D(G):a— D(a), αες. (5.93) 
The vector space V, is called the representation space; n is the dimension of the representation (see 
also 12.1.3, 2., p. 657). Introducing the basis {e;} (i = 1,2,...,n) in V, every vector x can be written 
as a linear combination of the basis vectors: 


n 
x=) we, x€V,. (5.94) 
il 
The action of the linear transformation D(a), a € G, on x can be defined by the quadratic matrix 
D(a) = (Di(a)) (i,k = 1,2,...,n), which provides the coordinates of the transformed vector x’ 
within the basis {e;}: 
n n 
x’ = D(a)x = So aie, i; = X Dix (a)xg. (5.95) 
j=l k-i 
This transformation may also be considered as a transformation of the basis {e;} — (e): 
n 
€; = e,D(a) = X` Dyila)eg. (5.96) 
k=1 


Thus, every element a of the group is assigned to the representation matriz D = (Dix(a)): 
D(G):a>D=(Dyla)) (i,k =1,2,...,n),a€G. (5.97) 

The representation matrix depends on the choice of basis. 

WB A: Abelian Point Group Cn. A regular polygon (see 3.1.5, p. 138) with n sides has a symmetry 


such that rotating it around an axis, which is perpendicular to the y 

plane of the figure and goes through its center M (Fig.5.9) by an an- 

gle yk = 2zk/n,k —0,1,...,n—1 the resulted polygon is identical 

to the original one (invariance of the system under certain rotations). 

The rotations Πμ(ρµ) form the Abelian group of points Cn. C, isa 

cyclic group (see 5.3.3.2, p. 337), i.e. every element of the group can 

be represented as a power of a single element F4, whose n-th power 

is the unit element e = Ro: x 
C= ο RP} Res (5.98a) 

Let the center of an equilateral triangle (n = 3) be the origin (see 

Fig.5.9), then the angles of rotations and the rotations are in accor- Figure 5.9 


A 
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dance with (5.98b). 


k = 0, qo = 0 or 2m, 
k —1, pı = 20/3, (5.98b) 
k = 2, p2 = 47/3. 


Ryo: A> A,B > B,C >C, 
Rı : A> B,B > C,C > A, (5.98c) 
Ry: ASC,BOA,COB. 


The rotations (5.98c) satisfy the relations 


Ry = È, RR = Εξ = Ro =e. (5.98d) 


They form the cyclic group C3. 
The matrix of rotation (see (3.432), p. 230) 


X _ (cosy —sing 
R(») = Py dd (5.989) 


of a geometric transformation of this triangle (for rotation of this figure in a fixed coordinate system see 
3.5.3.3,3., p. 213) gives the representation of group C3 if y is substituted by the angles given in (5.98b): 


Ῥ(ο) = R(0) = f d , D(R) = R(27/3) = (A By l (5.98f) 
D(R;) = R(47/3) = ( V i3 (5.982) 


The same relations hold for the matrices of this representation given in (5.98f) and (5.98g) as for the 
group elements Ry (5.98d): 


D(R2) = D(R,R)) = D(Rj)D(R)), D(Rj)D(R;) = D(e). (5.98h) 


W B: Dihedral Group Ds. The equilateral triangle is invariant with 
respect to rotations by angle π about its bisectors (see Fig.5.10). 
These rotations correspond to reflections S4, Sp, Sc with respect to 
a plane being perpendicular to the plane of the triangle and containing 
one of the rotation axes. 

S4: Rotations A => A, B => C,C— B; 

Sp: Rotations A > C, B > B,C > A; 5.99a) 

So: Rotations A > B, B > A,C > C. 
For the reflections there is: 

S,9,—e (c—A,B,C). 5.99b) 


Figure 5.10 


The product SS- (σ # τ) results in one of the rotations R1, Ro, e.g. using S4 Sp for the triangle 
AABC: 


Sa Sp(AABC) = SA(ACBA) = ACAB = ΒΙ(ΔΑΒΟ). (5.99c) 


consequently S4 Sg = Γι. Here S4, Sp, Sc correspond to the outcomes on Fig.5.10. 


342 ο. Algebra and Discrete Mathematics 


. . | e fü Ry Sa Sp Sc 

The cyclic group C3 and the reflections δα, Sg, Sc together ele Ri Ro 84 Sp Sc 
form the dihedral group Ds. The reflections do not forma — mm Rə e Se Sa Sp 
subgroup because of (5.99c). A summary of relations is rep- RaRo e Γι Sp So S4 (5.99d) 
resented in group-table (5.99d). SalS4 Sg Sc e Ri Rs 
Only the signs of the z-coordinates of points B and C are Sp|Sp So S4 Ro e Ri 
changed at reflection S4 (see Fig.5.9). This coordinate — §./So S, Sp R, Ro e 
transformation is given by the matrix 

D(S4)— ( "d i : (5.990) 
The matrices representing reflections $5 and Sc can be found in the group-table (5.994) and from the 
matrices of representation in (5.98f) and (5.98g) 


D(Sp) = D(Ra)D(S4) = (Ap d e" n 25 (5990 


D(Se) = D(RIDISa)= (73/2 V2) (o 3) - (Cio ue] (5.999) 


Matrices (5.98f) and (5.98g) together with matrices (5.99f) and (5.99g) form a representation of the 
dihedral group D3. 
2. Faithful Representation 
A representation is called faithful if G — D(G) is an isomorphism, i.e., the assignment of the element 
of the group to the representation matrix is a one-to-one mapping. 
3. Properties of the Representations 
A representation with the representation matrices D(a) has the following properties (a,b € G,I unit 
matrix): 

D(a*b) = D(a)-D(b), D(a!)-D (a, D(e) =I. (5.100) 
5.3.4.2 Particular Representations 


1. Identity Representation 
Any group G has a trivial one-dimensional representation (identity representation), for which every 
element of the group is mapped to the unit matrix I: a > I for all a € G. 
2. Adjoint Representation 
The representation D*(G) is called adjoint to D(G) if the corresponding representation matrices are 
related by complex conjugation and reflection in the main diagonal: 

D*(G) = D*(G). (5.101) 
3. Unitary Representation 
For a unitary representation all representation matrices are unitary matrices: 

D(G)-D*(G) =I, (5.102) 
where E is the unit matrix. 
4. Equivalent Representations 
Two representations D(G) and D'(G) are called equivalent if for each element a of the group the cor- 
responding representation matrices are related by the same similarity transformation with the non- 
singular matrix T = (Τι): 


D'(a) - T !.D(a). T, Dj(a)-— s T5! - Dala) + Tir, (5.103) 


jl=1 
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where Τη denotes the elements of the inverse matrix T^! of T. If such a relation does not hold two 


representations are called non-equivalent. The transition from D(G) to D'(G) corresponds to the trans- 
formation Τ᾽: (e1,e5,...,e,) — {e},e,...,e,} of the basis in the representation space V: 


e-—eT, & = S Tye, (i =1,2,...,n). (5.104) 


Any representation of a finite group is equivalent to a unitary representation. 
5. Character of a Group Element 


In the representation D(G) the character x(a) of the group element a is defined as the trace of the 
representation matrix D(a) (sum of the main diagonal elements of the matrix): 


x(a) = Tr(D) = Y D.) (a) (5.105) 


The character of the unit T e is given by the dimension n of the representation: x(e) = n. Since 
the trace of a matrix is invariant under similarity transformations, the group element a has the same 
character for equivalent representations. 
W Within the shell model of atomic or nuclear physics two out of three particles with space coordinates 
T; (i = 1,2,3) can be described by the wave function Φα(τ) while the third particle has the wave 
function vg(r) (configuration a?8()). The wave function i of the system is a product of the three one- 
particle wave functions: v = YaYays. In accordance with the possible distributions of the particles 
1,2,3 to the wave functions one gets the three functions 

Vi = Φα(Τι)φα(ζα)φρ(τη) Vo = PalFi) Pa (F2) Pa(T3) , V3 = Ya(Fi)PalF2)Pa(Fs) , (5.106a) 
which, when realizing permutations, transform among one another according to 5.3.3.1, 2., p. 337. This 
way one gets for the functions 2v a three dimensional representation of the symmetric group $5. 
According to (5.93) the matrix elements of the representation matrices can be found by investigating the 
action of the group elements (5.84) on the coordinate subscripts in the basis elements e;. For example: 

pii = piga(Yi)ea 2)eg (Ya) = eo(ri)ea(2)po (rs) = Doi(pi)s, 

pii» = Ῥιρα(Τι)φρ(τ»}ρα(Τα) = ea(ri)es(12)ea(ra) = Dis(pi)i, 

pits = pyga(ri)eo(£2)ga(Ya) = Φβ(Τι)φα((2}φΦα(Τα) = Ῥοα(ρι)ψα. (5.106b) 
Altogether one finds: 


100 0 
D(e) - (010), D(pi) -(: 
001 0 

100 0 

001 0 

1 


D(p3) -( ) D(pi) -( 
010 


For the characters one has: 
x(e) = 3, χίρι) = x(p2) = x(ps) = 1, x(pa) = x(ps) = 0. (5.106d) 


5.3.4.3. Direct Sum of Representations 


01 
ni 

23 (5.106c) 
00 

10 


The representations D? (G), DO (G) of dimension n; and n» can be composed to create a new repre- 
sentation D(G) of dimension n = n, + n? by forming the direct sum of the representation matrices: 


: D(a) ο 
a) c (2) 
D(a) = DY (a) 6 D” (a) = ( 0 Dla) (5.107) 
The block-diagonal form of the representation matrix implies that the representation space V,, is the 
direct sum of two invariant subspaces γι V: 


Van = Van Θ γης, n = ni + na. (5.108) 
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A subspace Vm (m < n) of V, is called an invariant subspace if for any linear transformation D(a), a € 
G, every vector x € Vm is mapped onto an element of Vm again: 
x —D(a)x with x,x € Vn. (5.109) 
The character of the representation (5.107) is the sum of the characters of the single representations: 
x(a) = x” (a) + x9 (a). (5.110) 


5.3.4.4 Direct Product of Representations 
If e; (i= 1,2,...,ni) and εἰ (k = 1,2,...,n2) are the basis vectors of the representation spaces Vn, 
and V,,,, respectively, then the tensor product 

e&,— (ee) (i212,.., m; k=1,2,...,n2) (5.111 


forms a basis in the product space Vn, Ὁ Vn, of dimension n, : na. With the representations DO(G 


and D (G) in Vn, and V,,, respectively an n; * n;-dimensional representation D(G) in the produc 
space can be constructed by forming the direct or (inner) Kronecker product (see 4.1.5,9., p. 276) o 
the representation matrices: 
D 1 2) 
D(G) = DÓ(G)$ DG), (D@)au = Di (a) DP (a) 

with ik-12,..,m; 1 1,2... πο. (5.112 
The character of the Kronecker product of two representations is equal to the product of the characters 
of the factors 


x) (a) = x(a) - x(a). (5.113 


5.3.4.5 Reducible and Irreducible Representations 
If the representation space V,, possesses a subspace Vm (m < n) invariant under the group operations 
the representation matrices can be decomposed according to 
Di(a A (m TOWS 

0 D»(a)/] { n— m rows 
by a suitable transformation T of the basis in V4. Di(a) and D(a) themselves are matrix representa- 
tions of a € G of dimension m and n — m, respectively. 


T- D(a). T= ( (5.114) 


A representation D(G) is called irreducible if there is no proper (non-trivial) invariant subspace in Υπ. 
The number of non-equivalent irreducible representations of a finite group is finite. If a transformation 
T of a basis can be found which makes V,, to a direct sum of invariant subspaces, i.e., 


Vna = V19:- Vs, (5.115) 
then for every a € G the representation matrix D(a) can be transformed into the block-diagonal form 
(A = 0 in (5.114)): 

D(a) 0 
T?. D(a): T =D (a) Φ... BD) (a) = Ἂς : (5.116) 
0 D(a) 


by a similarity transformation with T. Such a representation is called completely reducible. 


Remark: For the application of group theory in natural sciences a fundamental task consists in the 
classification of all non-equivalent irreducible representations of a given group. 
W The representation of the symmetric group S3 given in (5.106c), p. 343, is reducible. For example, 
in the basis transformation {6ι, 60.61) —> (e; =e, +e,+e3, e; =e, e} = es} one obtains for the 
representation matrix of the permutation ps (with V; = ej, Y2 = es, Y3 = e): 


100 


— = Di(p3) 0 
Dips) = (27) = (C pa) ΕΞ 
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with A = (9) , Di(p3) = 1 as the identity representation of $5 and Do(p3) = (1 Ji : 
5.3.4.6 Schur's Lemma 1 


If C is an operator commuting with all transformations of an irreducible representation D of a group 
(C. D(a)| = C- D(a) - D(a): C 20, a € G, and the representation space V, is an invariant subspace 
of C, then C is a multiple of the unit operator, i.e., a matrix (cip) which commutates with all matrices 
of an irreducible representation is a multiple of the matrix I, C = à- L, A € C. 


5.3.4.7 Clebsch-Gordan Series 


In general, the Kronecker product of two irreducible representations D (G), D) (G) is reducible. By 


a suitable basis transformation in the product space D® (G) Ὁ D? (G) can be decomposed into the 


direct sum of its irreducible parts D? (o = 1,2,...,n) (Clebsch-Gordan theorem). This expansion 
is called the Clebsch-Gordan series: 
D(a) ὦ DO Sr d D(9 (a). (5.118) 
a=1 


Here, Ma is the multiplicity with which the irreducible representation D?) (G) occurs in the Clebsch- 
Gordan series. 
The matrix elements of the basis transformation in the product space causing the reduction of the 
Kronecker product into its irreducible components are called Clebsch-Gordan coefficients. 


5.3.4.8 Irreducible Representations of the Symmetric Group Sm 
1. Symmetric Group Sm 
The non-equivalent irreducible representations of the symmetric group Sm are characterized uniquely 
by the partitions of M, i.e., by the splitting of M into integers according to 
[A] = [An àz A], Art Ate + An = M, ALS Ag S++ SA 20. (5.119) 


The graphic representation of the partitions is done by arranging boxes in Young diagrams. 


W For the group S4 one obtains five Young diagrams a 4 
as shown in the figure. [3,1] [22] [2.1.1] [1] 
The dimension of the representation [λ] is given by H 
IL (X-A +j- i) H 
nl = MISES 5.120 
n ΠΕ (A +k — i)! (5-120) 


The Young diagram [A] conjugated to [A] is constructed by the interchange of rows and columns. 
In general, the irreducible representation of Sm is reducible if one restricts to one of the subgroups 
Syu-i SM-2; t+ 
W In quantum mechanics for a system of identical particles the Pauli principle demands the construc- 
tion of many-body wave functions that are antisymmetric with respect to the interchange of all coordi- 
nates of two arbitrary particles. Often, the wave function is given as the product of a function in space 
coordinates and a function in spin variables. If for such a case due to particle permutations the spatial 
part of the wave function transforms according to the irreducible representation [A] of the symmetric 


group, then it has to be combined with a spin function transforming according to [A] in order to get a 
total wave function which is antisymmetric if two particles are interchanged. 


5.3.5 Applications of Groups 


In chemistry and in physics, groups are applied to describe the "symmetry" of the corresponding 
objects. Such objects are, for instance, molecules, crystals, solid structures or quantum mechanical 
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systems. The basic idea of these applications is the von Neumann principle: 
If a system has a certain group of symmetry operations, then every physical observational quantity of 
this system must have the same symmetry. 


5.3.5.1 Symmetry Operations, Symmetry Elements 


A symmetry operation s of a space object is a mapping of the space into itself such that the length of 
line segments remains unchanged and the object goes into a covering position to itself. The set of fixed 
points of the symmetry operation s is denoted by Fix s, i.e., the set of all points of space which remain 
unchanged for s. The set Fix s is called the symmetry element of s. The Schoenflies symbolism is used 
to denote the symmetry operation. 
Two types of symmetry operations are distinguished: Operations without a fixed point and operations 
with at least one fixed point. 


1. Symmetry Operations without a Fixed Point, for which no point of the space stays un- 
changed, cannot occur for bounded space objects, but now only such objects are considered. A sym- 
metry operation without a fixed point is for instance a parallel translation. 

2. Symmetry Operations with at least One Fixed Point are for instance rotations and reflec- 
tions. The following operations belong to them. 


a) Rotations Around an Axis by an Angle y: The axis of rotation and also the rotation itself is 
denoted by C, for y = 27 /n. The axis of rotation is then called of n-th order. 


b) Reflection with Respect to a Plane: Both the plane of reflection and the reflection itself are 
denoted by c. If additionally there is a principal rotation axis, then one draws it perpendicularly and 
denote the planes of reflections which are perpendicular to this axis by σι, (h from horizontal) and the 
planes of reflections passing through the rotational axis are denoted by σι (v from vertical) or og (d 
means dihedral, if certain angles are halved). 

c) Improper Orthogonal Mappings: An operation such that after a rotation Ον a reflection σι, 
follows, is called an improper orthogonal mapping and it is denoted by S,. Rotation and reflection 
commute. The axis of rotation is then called an improper rotational axis of n-th order and it is also 
denoted by Sn. This axis is called the corresponding symmetry element, although only the symme- 
try center stays fixed under the application of the operation S,. For n = 2, an improper orthogonal 
mapping is also called a point reflection or inversion (see 4.3.5.1, p. 287) and it is denoted by i. 


5.3.5.2 Symmetry Groups or Point Groups 
For every symmetry operation S, there is an inverse operation S7}, which reverses S “back”, i.e., 
SSt=S'S =e. (5.121) 
Here e denotes the identity operation, which leaves the whole space unchanged. The family of symmetry 
operations of a space object forms a group with respect to the successive application, which is in general 
a non-commutative symmetry group of the objects. The following relations hold: 
a) Every rotation is the product of two reflections. The intersection line of the two reflection planes is 
the rotation axis. 
b) For two reflections σ and σ’ 
σσ' = σσ (5.122) 
if and only if the corresponding reflection planes are identical or they are perpendicular to each other. 
In the first case the product is the identity e, in the second one the rotation Co. 
c) The product of two rotations with intersecting rotational axes is again a rotation whose axis goes 
through the intersection point of the given rotational axes. 
d) For two rotations C» and C5 around the same axis or around axes perpendicular to each other: 


CC} = Cy. (5.123) 
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The product is again a rotation. In the first case the corresponding rotational axis is the given one, in 
the second one the rotational axis is perpendicular to the given ones. 


5.3.5.3 Symmetry Operations with Molecules 

It requires a lot of work to recognize every symmetry element of an object. In the literature, for instance 
in [5.10], [5.13], it is discussed in detail how to find the symmetry groups of molecules if all the symmetry 
elements are known. The following notation is used for the interpretation of a molecule in space: The 
symbols above C in Fig. 5.11 mean that the OH group lies above the plane of the drawing, the symbol 
to the right-hand side of C means that the group OC;H; is under C. 

The determination of the symmetry group can be made by the following method. 

1. No Rotational Axis 

a) If no symmetry element exists, then G = {e} holds, i.e., the molecule does not have any symmetry 
operation but the identity e. 

Β The molecule hemiacetal (Fig.5.11) is not planar and it has four different atom groups. 

b) If ø is a reflection or i is an inversion, then G = {ε,σ} =: C, or G = (e, i) = C; hold, and with this 
it is isomorphic to Za. 

W The molecule of tartaric acid (Fig.5.12) can be reflected in the center P (inversion). 


H 
OH 
Y HO —— Cllr: COE 
P 
CH, — = Clb Οἱ 

i | | CO, H= C —— OH 

H H 

Figure 5.11 Figure 5.12 Figure 5.13 


2. 'There is Exactly One Rotational Axis C 

a) If the rotation can have any angle, i.e., C = Cx, then the molecule is linear, and the symmetry 
group is infinite. 

ΒΑ: For the molecule of sodium chloride (common salt) NaCl there is no horizontal reflection. The 
corresponding symmetry group of all the rotations around C is denoted by Cov- 

WB B: The molecule Ου has one horizontal reflection. The corresponding symmetry group is generated 
by the rotations and by this reflection, and it is denoted by Don- 

b) The rotation axis is of n-th order, C — C, , but it is not an improper rotational axis of order 2n. 

If there is no further symmetry element, then G is generated by a rotation d by an angle m/n around 
Chn, i.e., G =< d > Zn. In this case G is also denoted by Ch. 

If there is a further vertical reflection σι, then G = < d, oy >= D,, holds (see 5.3.3.1, p. 336), and G is 
denoted by Cnv. 

If there exists an additional horizontal reflection op, then G = < d, oy » € Zn X Z» holds. G is denoted 
by Cnn and it is cyclic for odd n (see 5.3.3.2, p. 337). 

W A: For hydrogen peroxide (Fig.5.13) these three cases occur in the order given above for 0 < ὃ < 
1/2,ó = 0 and 6 = 7/2. 

W B: The molecule of water H2O has a rotational axis of second order and a vertical plane of reflection, 
as symmetry elements. Consequently, the symmetry group of water is isomorphic to the group Ds», 
which is isomorphic to the Klein four-group V4 (see 5.3.3.2, 3., p. 338). 

c) The rotational axis is of order n and at the same time it is also an improper rotational axis of order 
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2n. We have to distinguish two cases. 

a) There is no further vertical reflection, so G = Zən holds, and G is denoted also by Sən. 
W An example is the molecule of tetrahydroxy allene with formula C3(OH), (Fig.5.14). 
B) If there is a vertical reflection, then G is a group of order 4n, which is denoted by Dən. 


WB τι = 2 gives G = Dj, i.e., the dihedral group of order eight. An example is the allene molecule 
(Fig.5.15). 


- C 


Figure 5.14 Figure 5.15 Figure 5.16 


3. Several Rotational Axes If there are several rotational axes, then one has to distinguish further 
cases. In particular, if several rotational axes have an order n > 3, then the following groups are the 
corresponding symmetry groups. 

a) Tetrahedral group Τα: Isomorphic to S4, ordTy = 24. 

b) Octahedral group Ομ: Isomorphic to S4 x Za, ordO;, = 48. 

c) Icosahedral group Τη: ordI; = 120. 

These groups are the symmetry groups of the regular polyhedron discussed in 3.3.3, Table 3.7, p. 155, 
(Fig.3.63). 

W The methane molecule (Fig.5.16) has the tetrahedral group T; as a symmetry group. 


5.3.5.4 Symmetry Groups in Crystallography 


1. Lattice Structures 

In crystallography the parallelepiped represents, independently of 
the arrangement of specific atoms or ions, the elementary (unit) cell 
of the crystal lattice. It is determined by three non-coplanar ba- 
sis vectors a; starting from one lattice point (Fig. 5.17). The infi- 
nite geometric lattice structure is created by performing all primitive 


translations tn: 


t, = ma + nã + 383, n = (πα, Πο ng) n; € Z. (5.124) 


Figure 5.17 


Here, the coefficients n; (i = 1,2,...) are integers. 


All the translations t, fixing the space points of the lattice L = {t,} in terms of lattice vectors form 
the translation group T with the group element T(t,,), the inverse element T~!(t,,) = T(—t,), and 
the composition law T (t5) * T(tm) = T(t, + tm). The application of the group element T(t,) to the 
position vector r is described by: 


T(E) — F + t. (5.125) 
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2. Bravais Lattices 


Taking into account the possible combinations of the relative lengths of the basis vectors aj and the pair- 
wise related angles between them (particularly angles 90° and 120°) one obtains seven different types 
of elementary cells with the corresponding lattices, the Bravais lattices (see Fig. 5.17, and Table 5.4). 
This classification can be extended by seven non-primitive elementary cells and their corresponding 
lattices by adding additional lattice points at the intersection points of the face or body diagonals, pre- 
serving the symmetry of the elementary cell. In this way one may distinguish one-side face-centered 
lattices, body-centered lattices, and all-face centered lattices. 


Tabelle 5.4 Primitive Bravais lattice 


Elementary cell | Relative lengths | Angles between 
of basis vectors | basis vectors 


triclinic a, É 5 Faz a x B X390? 
monoclinic αι Fay Faz a= 7=90° 728 
rhombic a, É αυ Faz α-β-η-90 
trigonal ay = à3 = a3 a = B = y < 120°(F 90°) 
hexagonal ài = à» # ag a = B = 90°, y = 120° 
tetragonal ài = à» σαι a = 8 = y = 90° 

cubic a, = ag = a3 a = 8 = y = 90° 


3. Symmetry Operations in Crystal Lattice Structures 

Among the symmetry operations transforming the space lattice to equivalent positions there are point 
group operations such as certain rotations, improper rotations, and reflections in planes or points. But 
not all point groups are also crystallographic point groups. The requirement that the application of a 


group element to a lattice vector ty leads to a lattice vector t% € L (L is the set of all lattice points) 
again restricts the allowed point groups P with the group elements P(R) according to: 


P={R: Rt E L}, ΕΙ. (5.126) 
Here, R denotes a proper (R € SO(3)) or improper rotation operator (R = IR’ ε Ο(3). R' € 
SO(3), I is the inversion operator with /r = —r, r isa position vector). For example, only n-fold 


rotation axes with n = 1,2,3,4 or 6 are compatible with a lattice structure. Altogether, there are 32 
crystallographic point groups P. 

The symmetry group of a space lattice may also contain operators representing simultaneous applica- 
tions of rotations and primitive translations. In this way one gets gliding reflections, i.e., reflections in 
a plane and translations parallel to the plane, and screws, i.e., rotations through 27 /n and translations 
by ma/n (m = 1,2,...,n — 1, & are basis translations). Such operations are called non-primitive 
translations V (R), because they correspond to “fractional” translations. For a gliding reflection R is 
a reflection and for a screw R is a proper rotation. 

The elements of the space group G, for which the crystal lattice is invariant are composed of elements 
P of the crystallographic point group P, primitive translations T (t) and non-primitive translations 


V(R): 
G= {{R|V(R)+ta: Re P, t, e L)). (5.127) 


The unit element of the space group is {e|0} where e is the unit element of R. The element {εἰζη} 
means a primitive translation, {R|0} represents a rotation or reflection. Applying the group element 
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{ΠΕ} to the position vector T one obtains: 


4. 


From the 14 Bravais lattices, L = { tn} , the 32 crystallographic point groups P = {R} and the allowed 


non- 
grow 


are not a subgroup of another point group but contain further point groups as a subgroup. Each ο 
these seven point groups form a crystal system (holohedry). The symmetry of the seven crystal systems 
is reflected in the symmetry of the seven Bravais lattices. The relation of the 32 crystallographic classes 
to the seven crystal systems is given in Table 5.5 using the notation of Schoenflies. 

Remark: The space group G (5.127) is the symmetry group of the “empty” lattice. The real crysta 
is obtained by arranging certain atoms or ions at the lattice sites. The arrangement of these crysta 


cons 


{Rt,}F = ΠΕ + ty. (5.128 
Crystal Systems (Holohedry) 


primitive translations V(R) one can construct 230 space groups G = (R[V(R) + ta}. The poin 
ps correspond to 32 crystallographic classes. Among the point groups there are seven groups that 


ituents exhibits its own symmetry. Therefore, the symmetry group Go of the real crystal possesses 


a lower symmetry than G (G D Go), in general. 


Table 5.5 Bravais lattice, crystal systems, and crystallographic classes 


Notation: Cn — rotation about an n-fold rotation axis, D,, — dihedral group, T, — tetrahedral group, 


5.3. 


On — octahedral group, Sn — mirror rotations with an n-fold axis. 


Lattice type | Crystal system | Crystallographic class 
(holohedry) 

triclinic C; Ci, C; 

monoclinic Con C2, Ch, Con 

rhombic Don Cw, D», Don 

tetragonal Dan Ca, S4, Can, Da, Cav, Doa, Dan 

hexagonal Den Cs; Can; Con; Do, Cov; Dan, Den 

trigonal Dza C3, S6, Ds; Cav, Daa 

cubic On T, Th, Τα, O, On 


5.5 Symmetry Groups in Quantum Mechanics 


Linear coordinate transformations that leave the Hamiltonian H of a quantum mechanical system (see 


9.2.4 


The commutation property of g and É implies that in the application of the product of the operators 
g and H to a state φ the sequence of the action of the operators is arbitrary: 


acy rn 


g(Ho) = Η(σΦ). (5.130 
Hence, one has: If gg, (o = 1,2,...,n) are the eigenstates of H with energy eigenvalue E of degener- 
1, le., 
Hog, = Evra (o — 1,2,...,n); (5.131 
the transformed states gY p, are also eigenstates belonging to the same eigenvalue E: 


then 


, 2., p. 593) invariant represent a symmetry group G, whose elements g commute with H: 


(9. A] 2 gH - Hg -0, geG. (5.129 


gH ona E Hora = Egy. (5.132 
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The transformed states gogo can be written as a linear combination of the eigenstates «ga: 


n 
σφεα = ? , Ῥρα(σ)φερ- (5.133) 
8-1 
Hence, the eigenstates Yga form the basis of an n-dimensional representation space for the representa- 
tion D(G) of the symmetry group G of the Hamiltonian H with the representation matrices (Dag(g)) . 
This representation is irreducible if there are no “hidden” symmetries. One can state that the energy 
eigenstates of a quantum mechanical system can be labeled by the signatures of the irreducible repre- 
sentations of the symmetry group of the Hamiltonian. 


Thus, the representation theory of groups allows for qualitative statements on such patterns of the en- 
ergy spectrum of a quantum mechanical system which are established by the outer or inner symmetries 
of the system only. Also the splitting of degenerate energy levels under the influence of a perturbation 
which breaks the symmetry or the selection rules for the matrix elements of transitions between energy 
igenstates follows from the investigation of representations according to which the participating states 
and operators transform under group operations. 


[e] 


The application of group theory in quantum mechanics is presented extensively in the literature (see, 
e.g., [5.6], [5.7], [5.8], [5.10], [5.11]). 
5.3.5.6 Further Applications of Group Theory in Physics 


Further examples of the application of particular continuous groups in physics can only be mentioned 
here (see, e.g., [5.6], [5.10]). 
U(1): Gauge transformations in electrodynamics. 


SU (2): Spin and isospin multiplets in particle physics. 

SU (3): Classification of the baryons and mesons in particle physics. Many-body problem in nuclear 
physics. 

SO(3): Angular momentum algebra in quantum mechanics. Atomic and nuclear many-body problems. 
SO(4): Degeneracy of the hydrogen spectrum. 

SU (4): Wigner super-multiplets in the nuclear shell model due to the unification of spin and isospin 


degrees of freedom. Description of flavor multiplets in the quark model including the charm degree of 
freedom. 


SU(6): Multiplets in the quark model due to the combination of flavor and spin degrees of freedom. 
Nuclear structure models. 


U (n): Shell models in atomic and nuclear physics. 

SU (n), SO(n): Many-body problems in nuclear physics. 

SU (2) & U(1): Standard model of the electro weak interaction. 

SU(5) D SU(3) & SU(2) ® U(1): Unification of fundamental interactions (GUT). 

Remark: The groups SU (n) and SO(n) are Lie groups, i.e. continuous groups (see, 5.3.6, p. 351 and 
e.g., [5.6]). 

5.3.6 Lie Groups and Lie Algebras 


5.3.6.1 Introduction 

Lie groups and Lie algebras are named after the Norwegian mathematician Sophus Lie (1842-1899). In 
this chapter only Lie groups of matrices are considered since they are most important in applications. 
Main examples of matrix-Lie groups are: 


— 


e the group O(n) of orthogonal matrices, 


ο the subgroup SO(n) of orthogonal matrices of determinants +1, i.e. the orthogonal matrices de- 
scribing rotations in R”, 
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e the Euclidean group S E(n), which describes rigid-body motions. 

These groups have many applications in computer graphics and in robotics. 

'The most important relation between a Lie group and the corresponding Lie algebra will be described 

by the exponential mapping. This relation is explained by the following example. 

W The solution of initial value problems of first order differential equations or of a system of differential 

equations can be determined with the help of the exponential function. 

The initial value problem (5.134a) for y = y(t) has the following solution (5.134b): 
dy 
dt 


Similarly, for the system of first order differential equations with unknown vector y — y(t) and with 
the constant coefficient matrix X the initial value problem (5.135a) 


=xy (rconst) with y(0) = yo, (5.134a) y(t) = ety, (5.134b) 


dy ly, dy: lyn V 

F = (= a Jis τ ) — Xy (matrix X const) with y(0) = Yo, (5.135a) 
has the solution (5.135b) with the matrix-exponential function εἰ 

S s = m 

¥(t)=e'¥o, d d si = Inxn + σι at kyk, (5.135b) 


The special matrix-exponential function e’* for a given quadratic n x n matrix X has the following 
properties: 


X — | 


ee nxn Where Inxn denotes the unit matrix. 


ο eX is invertible, because det eX = eSpuX 0), 


ο chiX eX — ει ΣΧ — et2Xet1X for every tı, ta € IR, but in general is εἴ οκ: A eX2eX1 A eX1tX2 


-tX eX _ etXe-tX 


ο In particular e = laxa. 


* ee =X. 


t=0 
t=0 


d 
e 
[η 


Consequently, the elements e'* (for a fixed X) form a multiplicative group with respect to matrix 
multiplication. Since t € R, the matrices e^* form a one dimensional group. At the same time it is one 
of the simplest examples of Lie groups. It will be shown that matrices X and tX are elements of the Lie 
algebra belonging to this Lie group (see 5.3.6.4, p. 356). In this way the exponential function generates 
the Lie group from the elements of the Lie algebra. 
5.3.6.2 Matrix-Lie Groups 
For matrix-Lie groups it is not necessary to define Lie groups in general. For general Lie groups there 
should be introduced the notion of differentiable manifolds, which is not needed here. For matrix-Lie 
groups the following definitions are important, while in further discussions the main topic will be the 
general linear group. 
1. General Linear Group 
1. Group A group (see 5.3.3, p. 336) is a set G with a map 

GxGoG, (g,h)egxh, (5.136a) 
which is the so called group operation or group multiplication with the following properties: 
e Associativity: for every g, h, k € G 

g*(h»xk)=(g*h)*k, (5.136b) 
e Existence of identity: There is an element e € G , such that for every g € G 

gxe=exg=q, (5.136c) 
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e Existence of an inverse: For every g € G there is an element h € G such that 

gxh=hxg=e. (5.136d) 
Remark 1: If g * h = h * g for everyg,h € G, then the group is called commutative. The matrix 
groups considered here are not commutative. It follows obviously from the definition, that the product 
of two elements of the group also belongs to the group, so the group is closed with respect to group 
multiplication. 


Remark 2: Let M, (IR) the vector space of all n x n matrices with real entries. M, (IR) is obviously 
not a group with respect to matrix multiplication, since not every n x n matrix is invertible. 

2. Definition of the General Linear Group The set of all real, invertible, n x n matrices, which 
obviously form a group with respect to matrix multiplication, is called the general linear group and is 


denoted by GL(n, R). 


2. Matrix-Lie Groups 


1. Convergence of Matrices A sequence {A,,}°_, of matrices Am = (ats a where Am € 


M, (R) converges to the n x n matrix A, if every sequence of entries {(α{}}}}55 | converges to the 


corresponding matrix entry ap in the sense of convergence of real numbers. 
2. Definition of the Matrix-Lie Groups A matrix-Lie group is a subgroup G of GL(n, R) with 
the property: Let {Am}; be an arbitrary sequence of matrices from G converging to a matrix A € 
M, (R) in the sense of convergence in M, (IR). Then either A € G or A is not invertible. 
This definition can be also formulated in the following way: A matrix-Lie group is a subgroup which is 
also a closed subset of G.L(n , R). (It does not mean, that G must be closed in M,(R)). 
3. Dimension of the Matrix-Lie Group The dimension of a matrix-Lie group is defined as the 
dimension of the corresponding Lie algebra (see 5.3.6.4, p. 356). The matrix-Lie group GL(n , R) has 
dimension n?. 
3. Continuous Groups 
Matrix-Lie groups can be introduced also with the help of continuous groups (see [22.22], [5.9], [5.7]). 
1. Definition A continuous group is a special infinite group whose elements are given uniquely by 
a continuous parameter vector Y = (Y1, φον... Pn): 

a — a(q). (5.137) 


E Group of rotation matrices in IR? (see (3.432), p. 230): 


_ [cosy —siny) _ à » É 
il dre hend =a(y) mit 0€ o € 2n. (5.138) 
The group elements depend only on one real parameter «ο. 
2. Product The product of two elements a, = α(φῃ), a» = a(p,) ofa continuous group with elements 
a = α(ᾳ) is given by 
αι * a2 = a3 = α(φα) with (5.139a) 
φα =f(¥1,%2), (5.139b) 
where the components of f (p1, ρα) are continuously differentiable functions. 
W The product of two rotation matrices a = α(φι) and a = α(φο) with 0 € φι,φο < 2m (a(y) as in 
(5.138), is a3 = α(φι) κ α(φο) = a(qs) with es = f(v1.92) = οι + p2. Using the Falk’s scheme (see 
4.1.4, 5., p. 273) and addition theorems one gets: 
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| α(φο) 
α(ρι)[α(φα) = a(ei + v) 


or detailed 


COS (Q3 — sin ο 

sin p2 COS Yo 
cosy; — sin yı | COS p1 COS Y2 — Sin yı SİN Y2 — COS Y1 SİN Y2 — sing, cosq» ` 
sin qi cos 91 | sin %1 COS p2 + COSY) sin y2 — — sin q Sin Y2 + cos Y1 COS Yo 


3. Dimension The parameter vectors y are elements of a vector space which is called parameter 
space. In this parameter space there is a domain which is given as the domain of the continuous group, 
and it is called the group space. The dimension of this group space is considered as the dimension of 
the continuous group. 

Wl A: The group of the real quadratic n x n invertible matrices has the dimension n?, since every entry 
can be considered as a parameter. 

WB B: The group of the rotation matrices (with respect to matrix multiplication) D in (5.138) has 
dimension 1. The rotation matrices are of type 2x2, but their four entries depend only on one parameter 
p (0< e € 2n). 

4. Lie Groups 


1. Definition of the Lie Group A Lie group is a continuous group where all elements of the group 
are given as continuous functions of the parameters. 


2. Special Matrix-Lie Groups and their Dimension 
B A Group SO(n) of Rotations R: The group SO(n) of rotations R acts on the elements X c R” 
with matrix multiplication as X' = RX € R”. SO(n) is an n(n — 1)/2-dimensional Lie group. 
E B Special Euclidean Group S E(n) : The special Euclidian group S E(n) consists of elements 
g — (R, b) with R € SO(n) and b € R” and with group multiplication gı o g2 = (R4 Re, Rib; + bi). 
It acts on the elements of Euclidean spaces IR" as 
X'— RX« b. (5.140) 
S E(n) is the group of rigid-body motions of n-dimensional Euclidean space, it is an n(n+1) /2-dimensional 
Lie group. Discrete subgroups of S E(n) are e.g. the crystallographic space groups, i.e. the symmetry 
group of a regular crystal-lattice. 
B C Scaled Euclidean Group SIM (n): The scaled Euclidian group SIM (n) consists of all pairs 
(eR, b) with a € R, R € SO(n), be R”, with group multiplication gı o go = (e^ * ^R, Ro, Ribs + 
B1). It acts on the elements of R” by translation, rotation and dilatation (=stretching or shrinking): 
X’=cRX+b. (5.141) 
The scaled Euclidean group has the dimension 1 + n(n + 1)/2. 
E D Real Special Linear Group SL (n , IR): The real special linear group consists of all (real) n x n 
matrices with determinant 4-1. It acts on the elements of R" with X' — Lx by rotation, distortion and 
shearing so that the volume remains the same and parallel lines remain parallel. The dimension is 
2 
u^ 
B E Special Affine Group: The special affine groups of R”, which consists of all pairs (e^ L, b) 
with L € SL(n) and bem , acts on the objects in IR" as rotation, translation, shearing, distortion 
and dilatation. This Lie group is the most general group of deformations in Euclidean spaces mapping 
parallel lines into parallel lines; it has dimension n(n 4 1). 
E F Group SO(2): The group SO(2) describes all rotations about the origin in R°: 
50) = ioe E ο. pe R} (5.142) 


sing cosy 


5.8 Classical Algebraic Structures 355 


E G Group SL(2): Every element of SL(2) can be represented as 


cos@ —sin@\ (e' 0 1 £ 
eee cos 0 ) Γ EN 5 i) : (5.143) 


B H Group SE(2): The elements of the group SE (2) can be represented as 3 x 3 matrices: 

cos@ — sin xı η 

sin@ cosÜ zo | with 0 € R and X= τ c R?. (5.144) 

ο οι A 

Remark: Beside real matrix-Lie groups complex matrix-Lie groups also can be considered. So, e.g. 
S L(n, C) is the Lie group of all complex n x n matrices with determinant +1. Similarly there are 
matrix-Lie groups whose entries are quaternions. 
5.3.6.3 Important Applications 


1. Rigid Body Movement 
1. The group S E(3) is the group of rigid-body motions in the Euclidean space R?. That is why it is so 
often applied in control of robots. The 6 independent transformations are defined usually as follows: 


1. Translation in x-direction, 4. Rotation about the x-axis, 
2. Translation in y-direction, 5. Rotation about the y-axis, 
3. Translation in z-direction, 6. Rotation about the z-axis. 


These transformations can be represented by 4 x 4 matrices applied to homogeneous coordinates (see 
3.5.4.2, p. 231) in 3 dimensions, i.e. (x, y, z)™ € R? is represented as a vector (x, y, z, 1)? with four 
coordinates (see 3.5.4.2, p. 231). 

Matrices corresponding to the transformations 1 until 6 are: 


100a 1000 1000 
0100 010b 0100 
Mi-|ggip|* “= |o010]% Me |op3a (5.1452) 
0001 0001 0001 
1 0 0 0 d ieee cosy —siny 0 0 
| | 0 cosa —sina 0 _ 1 siny cosy 00 
Mi= | sina cosa 0 |? Ms= ΚΤ Ἴνα. 0 0 10 .(5.145b) 
0 0 0 1 0 0 01 


The matrices M4, M;, Mg describe the rotations in m consequently SO(3) is a subgroup of S E(3). 
The group SE(3) acts on X = (a, y, z)™ € R? with homogeneous coordinates (X, 1)? as follows: 


Xx’) (RV) /(X)\_ (RX+Vv : 
“νε WE 
where R € SO(3) is a rotation, and V = (a, b, c)? is a translation vector. 
2. Affine Transformations of 2-Dimensional Space 
The group GA(2) of affine transformations of the 2-dimensional space is a 6-dimensional matrix Lie 
group with the following 6 dimensions: 


1. Translation in x-direction, 4. Stretching or shrinking with respect to the origin, 
2. Translation in y-direction, 5. Shearing (stretching with resp. to y, with resp. to x), 
3. Rotation about the origin, 6. 45°-shearing with respect to 5. 


Also these transformations are described by matrices in homogeneous coordinates (x, y, i)? for (v, y)? 


10a 100 cosa — sina 0 
M,=[{010], M:=|01b], M;= | sina cosa 0], (5.147a) 
001 001 0 0 1 
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ε 00 c" 00 cosh v sinh v 0 
ΜΙ = ι e o) , M;= ( 0 e^ o) » M= [si cosh v o) : (5.147b) 


001 0 01 0 0 1 
This group has as essential subgroups the translation group, given by M4 and Mg, the Euclidean group 
S E(2), given by My, M» and Ms, the similarity group, given by Mi, M», Ms, Mg. 
Application: The group GA(2) can be applied to describe all transformations of a planar object 
which is recorded under slight angle modifications by a camera moving in the 3 dimensional space. 
If also large changes in angles of perspectivity can occur, then group P(2) the group of all transforma- 
tions of projective spaces can be used. The matrix-Lie group is generated by the matrices M, until Mg 
and by the two further matrices 


100 100 
M; = (029). Μι = (119). (5.14Tc) 
801 051 


These two additional matrices correspond to a change of the horizon or vanishing of an edge of the plane 
picture. 
5.3.6.4 Lie Algebra 
1. RealLie algebra 
A real Lie algebra A is a real vector space with an operation 
[5] 2X A A, (5.148) 


which is called the Lie bracket and for which the following properties are valid for all a, b, c € A: 


ο |., ] is bilinear, 
ο [a, b] = —[b, αἰ, i.e. the operation is skew-symmetric or anticommutative, 
e the so called Jacobi identity is valid (as a replacement of the missing associativity) 
(a. [b, cl] + [e, la, b] + [b, [ο, a]] = 0. (5.149) 
Obviously [a , a] = 0 holds. 
2. Lie Bracket 
For (real) n x n matrices X and Y a Lie bracket is given by the commutator, i.e. 
(X, Y] := XY - YX. (5.150) 
3. Special Lie-Algebras 
There are associated Lie algebras to matrix-Lie groups. 
1. A function g : R — GL(n) is a one-parameter subgroup of G L(n), if 
e g is continuous, 
9 (0) = Inxn; 
ο g(t +s) = g(t)g(s) for every t, s € R. 


In particular: 
2. If g is a one-parameter subgroup of GL(n), then there exists a uniquely defined matrix X such that 


g(t) — ο (see 5.3.6.1, p. 351). (5.151) 
3. For every n x n matrix A the logarithm log A is defined by 
jer 
log A = SE ——_(A-I)™, (5.152) 
m=1 


if this series is convergent. In particular, the series converges if ||A — I|| < 1. 
4. Correspondence between Lie Group and Lie Algebra 
The correspondence between a matrix-Lie group and the associated Lie algebra is as follows. 
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1. Let G be a matrix-Lie group. The Lie algebra of G, which is denoted by g, is the set of all matrices 
X such that e"* € G holds for all real numbers t. 

In a given matrix-Lie group the elements close to the unit matrix can be represented as g(t) = e'* with 
X € g, and { close to zero. If the exponential map is surjective, as in the case of SO(n) and S E(n), 
then the elements of the group can be parameterized with the help of the matrix-exponential function 


dg dg 
by elements of the corresponding Lie algebra. The matrices x g`" and g~t ο. respectively are called 


tangent vectors or tangent elements to g € G. Calculating these elements for t = 0, one gets X itself, 
ie. gis the tangent space Τα at the identity matrix I. 

2. It can be shown that the Lie algebra assigned to a Lie group in this way is a Lie algebra also in the 
abstract sense. 

Let G be a matrix-Lie group with the associated matrix-Lie algebra g and X and Y elements of g. 
Then: 

e sX c g for any real numbers s, 

eX+Ye€g, 

e (X, Y}=XY-YX €g. 

ΒΑ: The Lie algebra so(2) associated to the Lie group SO(2) is calculated from the representation 
cos θ — sin 0 


of the elements g(0) — ( sif -posD 


) by SO(2) with the help of the tangential elements 


dg 4 — sinf — cos 0 cos@ sind 0-1 
do PT = ( cos 0 E ο um "e = ti 0 ) : (5.153a 
Consequently 
0-1 
so(2) -- 45 10)» SE Rz: (5.153b 
Conversely, from 
X= à 2) comes ο = coss G Ü --sins n a) - Ce pd | (5.153c 


E B: The following matrices form a basis for the Lie algebra so(3): 


00 0 001 0-10 
x;-[00-1], X= 000], x,=(10 0). (5.154) 
01 0 -100 000 


Remark: The surjectivity of the exponential mappings so(3) — SO(3) and se(3) — SE(3) implies 
the existence of a (many-valued) logarithmic function. Nevertheless this logarithm function can be 
applied to interpolation. 

E.g. if rigid-body motions Βι, B» € SE(3) are given, then log Bi, log B» can be calculated which are 
elements of the Lie algebra so(3). Then between these logarithms linear interpolation (1 — t) log By + 
tlog B» can be taken and then the exponential map can be applied in order to get an interpolation 
between the rigid-body motions B, and Bz by 


exp ((1 — t) log Bi + tlog B5). (5.155) 


B C: The matrix-Lie algebra se(3) associated to the matrix-Lie group S E(3) is generated by the 
matrices: 


0001 0000 0000 
0000 0001 0000 

Ei-l0000 E»—-|0000 Es-|0001 (5.1562) 
0000 0000 0000 
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00 0 0 0 010 0-100 
00-10 0 000 1000 

E —-|50100|» Es-|.1000|; Es7|0 0 00 (5.156b) 
00 0 0 0 000 0 0 00 


5. Inner Product 

For a given finite dimensional matrix-Lie group it is always possible to find an orthonormal basis for 
the associated Lie algebra if a suitable inner product (scalar product) is defined. In this case from any 
basis of the Lie algebra an orthonormal basis can be obtained by the Gram-Schmidt orthogonalization 
process (see 4.6.2.2, 4. p. 316). 

In the case of a real matrix-Lie group the Lie algebra consists of real matrices and so an inner product 
is given by 


(X, Y) = Spur (ΧΥΝΥΤ (5.157) 


with a positive definite real symmetric matrix W. 
W A: The group of rigid-body motions S E(2) can be parametrized as 


cos@ — sinf xı 
g(x1, z2, 0) = ο... (sine cos 0 2) with (5.1582) 
0 0 1 
001 000 0—10 
X12[000], X.={001)], X3;=]/1 0 0]. (5.158b) 
000 000 000 


Here X4, X5, X; form an orthonormal basis of Lie algebra se(2) with respect to an inner product given 
by the weight matrix 


100 
w=(010 (5.158c) 
002 
W B: A basis of Lie algebra sl(2, IR) is 
0-1 10 01 
x a x,- (αι) and X= (19): (5.159) 


These elements form an orthonormal basis with respect to the weight matrix W = Igy. = ( a 4 i 


5.3.6.5 Applications in Robotics 
1. Rigid Body Motion 
The special Euclidean group SE(3), which describes the rigid-body motions in IR?, is the semidirect 
product of group SO(3) (rotation about the origin) and R? (translations): 
SE(3) = SO(3) x R*. (5.160) 


In a direct product the factors have no interaction, but this is a semidirect product since rotations act 
on translations as it is clear from matrix multiplication: 


R6GV/R& V. (RoR, Rot, t " 
τ Bm :|-ί 0 1 | (5.161) 


i.e. the first translation vector is rotated before the second translation vector is added. 
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2. Theorem of Chasles 
This theorem tells that every rigid-body motion which is not a pure translation can be described as a 
(finite) screw motion. A (finite) screwing motion along an axis through the origin has the form 


Op. 
A(6) = i =) (5.162a) 
0 1 


where X is a unit vector in the direction of the axis of rotation, 0 is the angle of rotation and p is the 
angular coefficient. Since X is the axis of rotation RX = X, i.e. X is an eigenvector of matrix R belonging 
to unit eigenvalue 1. 

When the axis of rotation does not go through the origin, then a point u of the axis of rotation is chosen 
which is shifted into the origin, then after the screwing it is shifted back: 


os Op. ^u Op. m 
t 1) R gn t fig _{R z% tI- R)u , (5.162b) 
0 1 0 1 
The theorem of Chasles tells that an arbitrary rigid-body motion can be given in the above form, i.e. 
d Op. E 
Rt| [R 34,X t (0- Β)ά (5.163) 
0 1 0 1 


for given R, t and appropriate p and d . Assuming that the angle of rotation 0 and the axis of rotation 
X are already known from R 

-Xx.t (5.164) 
is valid, so the angular coefficient p can be calculated. Then the solution of a linear system of equations 
gives ū: 

2 Op, > p 

(I-R)t= 3,x-t (5.165) 
This is a singular system of equations, where X is in its kernel. Therefore the solution ü is unique except 
to a manyfold of X. In order to determine U it is reasonable to require that 11 is perpendicular to X. 
When the rigid body motion is a pure rotation, then it is not possible to determine an appropriate 
vector uU. 
3. Mechanical Joints 
Joints with one degree of freedom can be represented by a one-parameter subgroup of the group S E(3). 
For the general case of screw joints the corresponding subgroup is 


0p, » 
Ag j-|R mti RE], (5.166) 
0 1 
where X is the axis of rotation, ϐ is the angle of rotation, p gives the angular coefficient and d is an 
arbitrary point on the axis of rotation. 
The most often occurring types of joints are the rotational joints which can be described by the following 
subgroup: 
_/R (I- R)u 
app a) 


The subgroup corresponding the shift joints is 


A(0) = m a , (5.168) 


(5.167) 
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where t describes the direction of the shifting. 

4. Forward Kinematics 

'The goal in the case of industrial robots is the moving and control of the end effectors, which is done 
by joints in a kinematic chain. If all joints are of one parameter and the robot consists e.g. of 6 joints, 
then every position of the robot can be described by the joint-variables gT = (01, 02, θη, 04, θη, 06). 
The output state of the robot is described by the null vector. Then the motions of the robot can be 
described so that first the farest joint together the end effector are moved and this motion is given by 
the matrix A(05). Then the 5-th joint is moved. Since the axis of this joint should not be influenced 
by the motion of the last joint, this motion is given by the matrix A(05). In this way all the joints are 
moved, and the complete motion of the end effector is given by 


Κ(Θ) = Αι(Θι) As (02) As (05) A4 (04) As (05) Αα(θ0). (5.169) 
5. Vector Product and Lie Algebra 


A screw is given by 


0p. : 

α[ὴ-.|- $,*-(I-RM|. (5.170) 
0 1 

and it represents rigid body motions parameterized by the angle 0. Obviously, 0 — 0 gives the identity 

transformation. If the derivative is calculated at 0 = 0, ie. the derivative at the identity, then the 

general element of the Lie algebra is the following: 


dR p dR Ps - 
1A e ce Mes cuam >X -Q 
νε, =| m m g” = on M ; (5.171a) 
dO |, 0 0 6—0 0 0 
dR. : ; , : 
where Q = πρ) is a skew symmetric matrix. It can be shown that R is an orthogonal matrix, so 
RR” = I and RR? = I holds and therefore 
d dR dR?” di 
RR? ΕΤ ΕΒ : 5.171l 
j TOS πω τα, Ρ 
Since R = I for 6 = 0 
dR {ΒΤ 
πο + ο) Ξ0: (5.171c) 
So every skew symmetric matrix 
0 =w; Wy 
Ω -- w 0 s) (5.171d) 
—Wy Wr 0 


can be identified with a vector W” = (Wr, Wy, wz). In this way the multiplication of any three dimen- 
sional vector p by matrix Q corresponds to the vector product with vector W : 

Qp-u xp. (5.171) 
Consequently QJ is the angular velocity of the rigid body with an amplitude w. 
Hence a general element of the Lie algebra se(3) has the form 

"E: 

= E ae (5.1714) 
These matrices form a 6-dimensional vector space which is often identified with the 6-dimensional vec- 
tors of the form 


ge ‘ea (5.172) 


ν 
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5.3.7 Rings and Fields 
In this section, there are discussed algebraic structures with two binary operations. 
5.3.7.1 Definitions 
1. Rings 
A set R with two binary operations + and * is called a ring (notation: (R ,+ , «)), if 
ο (R,+) is an Abelian group, 
ο (R,*) is a semigroup, and 
e the distributive laws hold: 

ax(b+c)=(a*b)+(axc), (b+c)*a= (bxa)+ (c*a). (5.173) 
If (R, *) is commutative or if (R, κ) has a neutral element, then (R, +, κ) is called a commutative ring 
or a ring with identity (ring with unit element), respectively. 
Α commutative ring with a unit element and without zero divisor is called the domain of integrity. 
A nonzero element of a ring is called zero divisor or singular element if there is a nonzero element of the 
ring such that their product is equal to zero. 
In a ring with zero divisor the following implication is generally false: a * b = 0 (a —0 V b=0). 
If Risa ring with a unit element, then the characteristic of the ring R is the smallest natural number k 
such that kl =1+1+...+1=0 (k times 1 equals to zero), and it is denoted by char R = k. If such 
a k does not exist, then char R = 0. 
char R = k means that the cyclic subgroup (1) of the additive group (R, +) generated by 1 has order 
k, so the order of every element is a divisor of k. 
If char R = k and for all r € R, then r +r 4- ...4- r (k times)is equal to zero. The characteristic of a 
domain of integrity is zero or a prime. 
2. Division Ring, Field 
A ring is called division ring or skew field if (R \ {0}, κ) is a group . 
If (RN {0}, κ) is commutative, then R is a field. So, every field is a domain of integrity and also a division 
ring. Reversed, every finite domain of integrity and every finite division ring is a field. This statement 
is a theorem of Wedderburn. 


Examples of rings and fields 

W A: The number domains Z, Q, R, and C are commutative rings with identity with respect to ad- 
dition and multiplication; Q , R, and C are also fields. The set of even integers is an example of a ring 
without identity. 

W B: Theset M, of all square matrices of order n with real (or complex) elements is a non-commutative 
ring with respect to matrix addition and multiplication. It has a unit element which is the identity ma- 


trix. Mn has zero divisors, e.g. for n = 2, (i 7 e 1 = (a 3 ie. both matrices (i η and 


e "d are zero divisors in Mo. 


E C: The set of real polynomials p(x) = anx” + a, 42"^! +- -- + a,x + αρ forms a ring with respect 
to the usual addition and multiplication of polynomials, the polynomial ring IR[r]. 

More generally, instead of polynomials over IR, polynomial rings over arbitrary commutative rings with 
identity element can be considered. 

W D: Examples of finite rings are the residue class rings Z, modulo n. Z,, consists of all the classes 
[a] of integers having the same residue on division by n. ([α]μ is the equivalence class defined by the 
natural number a with respect to the relation ~g introduced in 5.2.4, 1., p. 334.) The ring operations 
®,© on Zn are defined by 
a], & [b], = (a+b, and [a], © [b], = [a - Bn. (5.174) 
If the natural number n is a prime, then (Z, , 9, Ὁ) is a field. Otherwise Z, has zero divisors, e.g. in 
Ze (numbers modulo 6) [3]s - [3]ᾳ = [0]g. Usually Z,, is considered as Z, = (0,1,...,n — 1}, ie. the 
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residue classes are replaced by representatives(see 5.4.3,3., S. 377). 

3. Field Extensions 

If K and L are fields and K C L, then L is an extension field or an over-field of K. In this case L can 
be considered as a vector space over K. 

If L is a finite dimensional space over K, then L is called a finite extension field. If this dimension is n, 
then L is called also an extension of degree n of K (Notation: [L : K] =n). 

E.g. C is a finite extension of R. C is two-dimensional over R, and (1, i) is a basis. IR is an infinite- 
dimensional space over Q. 

For a set M C L, K(M) denotes the smallest field (an over-field of K) which contains the field K and 
the set M. 

Especially important are the simple algebraic extensions (a), where a € L is a root of a polynomial 
from K[r]. The polynomial of lowest degree with a leading coefficient 1 having a as a root is called the 
minimal polynomial ofa over K. If the degree of the minimal polynomials of a € L is n, then K (a) is 
an extension of degree n, i.e. the degree of the minimal polynomials is equal to the dimension of L as 
a vector space over ΚΚ. 

E.g. C = R(i) and i € C is the root of the polynomial x? + 1 € R{z], i.e. C is a simple algebraic 
extension and [C : R] = 2. 

A field, which does not have any proper subfield, is called a prime field. 

Every field K contains a smallest subfield, the prime field of K. 

Out of isomorphism, Q (for fields of characteristic 0) and Z, (p prime, for fields of characteristic p) are 
the single prime fields. 


5.3.7.2 Subrings, Ideals 


1. Subring 

Suppose R = (R,+,*)isaringandU C R. IfU with respect to + and * is also a ring, then U = (U, +, κ) 
is called a subring of R. 

A non-empty subset U of a ring (R, +, κ) forms a subring of R if and only if for all a, b € U also a + (—b) 
and a * b are in U (subring criterion). 

2. Ideal 

A subring { is called an ideal if for allr € R and a € I also r κα and a * r are in J. These special 
subrings are the basis for the formation of factor rings (see 5.3.7.3, p. 363). 

The trivial subrings (0) and R are always ideals of R. Fields have only trivial ideals. 

3. Principal Ideal 

If all the elements of an ideal can be generated by one element according to the subring criterion, then 
it is called a principal ideal. All ideals of Z are principal ideals. They can be written in the form 
mZ = (mg|g € Z} and they are denoted by (m). 


5.3.7.3 Homomorphism, Isomorphism, Homomorphism Theorem 


1. Ring Homomorphism and Ring Isomorphism 
1. Ring Homomorphism: Let Rı = (fà, +, κ) and Ry = (R2, ος, οι) be two rings. A mapping 
h: R4 > [ο is called a ring homomorphism if for all a,b € ΠῚ 

h(a--b) — h(a) οι h(b) and h(a*b) = h(a) o, h(b) (5.175) 
hold. 
2. Kernel: The kernel of h is the set of elements of Rı whose image by h is the neutral element 0 of 
(Re, +), and it is denoted by ker h: 

kerh = (a € Ri|h(a) = 0). (5.176) 
Here ker h is an ideal of R4. 
3. Ring Isomorphism: If h is also bijective, then h is called a ring isomorphism, and the rings Rı 
and Rə are called isomorphic. 
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4. Factor Ring: If] is an ideal of a ring (R, +, κ), then the sets of co-sets {a + I|a € R} of I in the 
additive group (R, +) of the ring R (see 5.3.3, 1., p. 337) form a ring with respect to the operations 
(a+ [οι (ὁ -- I) 2 (a--b) -g-I and (α- D) o. (b 4- I) = (as b) +I. (5.177) 
This ring is called the factor ring of R by I, and it is denoted by R/T. 
The factor ring of Z by a principal ideal (m) is the residue class ring Zm = Z/(m) (see examples of rings 
and fields on p. 361). 
2. Homomorphism Theorem for Rings 
If the notion of a normal subgroup is replaced by the notion of an ideal in the homomorphism theorem 
for groups, then the homomorphism theorem for rings is obtained: A ring homomorphism A: Rı — [ο 
defines an ideal of R4, namely ker h = (a € Rı|h(a) = 0}. The factor ring Ft; / ker h is isomorphic to the 
homomorphic image h(R,) = {h(a)|a € R1). Conversely, every ideal I of R, defines a homomorphic 
mapping natr: Ry + Ha3/I with nat;(a) = a--I. This mapping nat; is called a natural homomorphism. 


5.3.7.4 Finite Fields and Shift Registers 
1. Finite Fields 


The following statements give an overview of the structure of finite fields. 
1. Galois Field GF For every power of primes p" there exits a unique field with p" elements (out 
of an isomorphism), and every finite field has p" elements. The fields with p" elements are denoted by 
GF(p") (Galois field). 
Note: For n > 1 GF(p") and Zp» are different. 
In constructing finite fields with p" elements (p is prime, n > 1), the ring of polynomials over Z, (see 
5.3.7,2., p. 361, W C) and irreducible polynomials are needed: Z, [v] consists of all polynomials with 
coefficients from Z, . The cocfficients are calculated modulo p. 
2. Algorithm of Division and Euclidean Algorithm In a ring of polynomials K [x] the division 
algorithm is applicable (dividing polynomials with a remainder), i.e. for f(x), g(x) € K[x], degf(x) < 
degg(x) there exist q(x), r(x) € K[v] such that 

g(a) = α(α): f(x) 4 r(x) and degr(r) < deg f(x). (5.178) 
This relation is denoted by r(x) = g(x)(modf(x)). Repeatedly performed division with remainders is 
known as the Euclidean algorithm for rings of polynomials and the last nonzero remainder gives the 
greatest common divisor of f(x) and g(x). 
3. Irreducible Polynomials A polynomial f(x) € K[v] is irreducible if it can not be represented 
as a product of polynomials of lower degrees, i.e. (analogously to the prime numbers in Z) f(x) isa 
prime in K[r]. E.g. for polynomials of second or third degree irreducibility means, that they do not 
have roots in K. 
It can be shown that there are irreducible polynomials of arbitrary degree in K[v]. If f(x) € K[x] is 
an irreducible polynomial, then 

K[v]/ f(x) :— (p(z) € Κ|α| | deg p(x) < deg f(x)} (5.179) 
is a field, where addition and multiplication are performed modulo f(x), ie. g(a) * h(x) = g(x) - 
h(x) (mod f(x)). 
If K = Z, and deg f(x) = n, then K [z]/ f (x) has p" elements, i.e. GF(p") = Ζρ[α]/ f(x), where f(x) is 
an irreducible polynomial of degree n. 
4. Calculation Rule in GF(p") In GF(p") the following useful rule is valid: 

(a 4- b)" =a? +0 τεμ. (5.180) 
So, in GF(p") —Z,[r]/ f(x) there is an element a = x, a root of the polynomial f(x) irreducible in 
Z, (x), and GF(p") = Z,[x]/ f (x) = Z,(a). It can be proven that Z, (o) is the splitting field of f(x). 
The splitting field of a polynomial from Z,[:] is the smallest extension field of Z, which contains all 
roots of f (x). 
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5. Algebraic Closure, Fundamental Theorem of Algebra A field K is algebraically closed if 
all roots of the polynomials from K [x] are in K. The fundamental theorem of algebra tells that the field 
C of complex numbers is algebraically closed. An algebraic extension L of K is called the algebraic 
closure of K if L is algebraically closed. The algebraic closure of a finite field is not finite. So there are 
infinite fields with characteristic p. 


6. Cyclic and Multiplicative Group The multiplicative group K* = K \ {0} of a finite field 
K is cyclic, ie. there is an element a € K such that every element of K* is a power of a: K* = 
{1,a,a7,...,a7-*}, if K has q elements. 

An irreducible polynomial f(x) € K[x] is called primitive, if the powers of a represents all nonzero 
elements of L --- K[v]/ f (z), i.e. the multiplicative group L* of L can be generated by x. 

With a primitive polynomial f(x) of degree n it is possible to construct a ,, Table of logarithm” for 
GF(p") from GF (p)[x], which makes calculations easier. 

H Construction of field GF(2?) and its table of logarithm. 

f(x) =1+2 + a? is irreducible over Z[r], since neither 0 nor 1 are roots of it: 


GF(2*) = Za[v]/ f(x) = (ao + aix + azz? | ao, a1, a2 € Zo ^ 3? 214 x). (5.181) 
f (x) is primitive, so a table of logarithm can be created for GF(2?): 
Two expressions are assigned to every polynomial αρ + ayx + asa? from Ζο[α]/ f(x). The coefficient 


vector αρ, a1, a? and the so called logarithm which is a natural number i such that z^ = ag + a,x + a2? 
modulo 1 + x + z?. The table of logarithm is: 


leg 
o 
oe 


e Addition of the field elements (KE) in GF(8): 


᾽ ο Addition of the coordinate vectors (KV) componentwise mod 2 
9 (in general mod p). 
e Multiplication of the field elements (KE) in GF(8): 
à e Addition of logarithms (Log) mod 7 (in general mod (p" — 1)). 
a qund "τ 
5 Example: Z rt Tolg —g? 
6 


235425 ê 


Remark: Finite fields are extremely important in coding theory as linear codes, where vector spaces 
in form (GF(q))" are considered. A subspace of such a vector space is called linear code (see 5.4.6.2,3., 
p. 385). The elements (code words) of a linear code are also n-tuples with elements from a finite field 
GF(q"). In applications in code theory it is important to know the divisors of X" — 1. 

The splitting field of X" — 1 € K[X] is called the n-th cyclotomic field over K. 

If the characteristic of K is not a divisor of n and a is a primitive n-th unit root, then: 

a) The extension field K (a) is the splitting field of X" — 1 over K. 

b) In K (a), the field X" — 1 has exactly n pairwise different roots which form a cyclic group, and among 
them there are y(n) primitive n-th unit roots, where y(n) denotes the Euler function (5.4.4,1., p. 381). 
By the k-th powers (k < n, g.c.d.(k,n)=1) of a primitive n-th unit root o all unit roots can be got. 


2. Applications of Shift Registers 


Calculations with polynomials can be performed well by a linear feedback shift register (see Fig.5.18). 


With a linear feedback shift register based on the feedback polynomial f(a) = fo+ fixe: -+ f, ia" η 
x’ and from the state polynomial s(z) = sọ + δια +--+ + 5, τα} one gets the state polynomial 


s(x): x — δρα. f(x) = s(x) : x (mod f(z)). 

Especially, if s(x) = 1, after i steps (i-times applications) the state polynomial is x’ (mod f(x)). 

Bl Demonstration with the example from page 364: The primitive polynomial f(x) = 14-x--2? € Zo[v] 
is chosen as feedback polynomial. Then the shift register with lengh 3 has the following sequence of 
states: 
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i 
+H 
5 


-fo f T 
4 So f+ — 81 mt ot Spo |I Y --η Sia 
Figure 5.18 
From the initial state: 10051 (mod f (1) 
the states follow as: 0102 x (mod f (x)) 
001222 (modf(x)) 
110423 =l+z (mod f (z)) 
011521 =24+2? (mod f (z)) 
111525 Έ 1 γα γα (mod f(z)) 
101536 =1+4+2? (mod f (z)) 
1005237 =1 (mod f (x)) 


The states are considered as coefficient vectors of a state polynomial so + s1% + sox. 
In general: A linear feedback shift register with length r gives a sequence of states of maximal length 


with period 2” — 1 if and only if the feedback polynomial is a primitive polynomial of degree r. 
5.3.8 Vector Spaces * 
5.3.8.1 Definition 


A vector space over a field F consists of an Abelian group V = (V, +) of “ vectors” written in additive 
form, of a field F = (F,+,*) of “ scalars” and an exterior multiplication F x V — V, which assigns 
to every ordered pair (k, v) for k € F and v € V a vector kv € V. These operations have the following 
properties: 


(V1) (u+v)+w=u+(v+w) for allu,v,w € V. (5.182 
(V2) Thereisa vector 0 € V such that v +0 = v for every v € V. (5.183 
(V3) To every vector v thereisa vector — v such that v + (—v) = 0. (5.184 
(V4) v+w=w+vu for every v,w € V. (5.185 
(V5) 1v-— v for every v € V, 1 denotes the unit element of F. (5.186 
(V6) r(sv) — (rs)u for every r,s € F and every v € V. (5.187 
(ντ) (r-s)u 2 rv 4- sv forevery r,s € F and every v € V. (5.188 
(V8) r(v-- w) — rv -- rw for every r € F and every v,w € V. (5.189 


If F — R holds, then it is called a real vector space. 

Examples of vector spaces: 

ΒΑ: Single-column or single-row real matrices of type (n, 1) and (1, n), respectively, with respect to 
matrix addition and exterior multiplication with real numbers form real vector spaces R” (the vector 
space of column or row vectors; see also 4.1.3, p. 271). 

W D: All real matrices of type (m,n) form a real vector space. 

E C: All real functions continuous on an interval [a,b] with the operations 


(f εί.) = fl) +90) and (kf)(z) — κ. f(x) (5.190) 


*In this paragraph, generally, vectors are not printed in bold face. 
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form a real vector space. 
Function spaces have a fundamental role in functional analysis (see Ch. 12, p. 654). For further exam- 
ples see 12.1.2, p. 655. 


5.3.8.2 Linear Dependence 

Let V be a vector space over F. The vectors vi, v, ..., Um € V are called linearly dependent if there 
are ky, ko,..., km € K not all of them equal to zero such that 0 kıvı + kava + +++ + kmUm holds. 
Otherwise they are linearly independent. Linear dependence of at least two vectors means that one of 
them is a multiple of the other. 


If there is a maximal number n of linearly independent vectors in a vector space V, then the vector 
space V is called n dimensional. This number n is uniquely defined and it is called the dimension. 
Every n linearly independent vectors of V form a basis. If such a maximal number does not exist, then 
the vector space is called infinite dimensional. The vector spaces in the above examples are n, m : n, 
and infinite dimensional. 

In the vector space R”, n vectors are independent if and only if the determinant of the matrix, whose 
columns or rows are these vectors, is not equal to zero. 

If (v1, v2,-.-,Un} form a basis of an n-dimensional vector space over F, then every vector v € V has a 
unique representation v = kv, + kova 4- +++ + knUn with Κι, ko... kn € F. 

Every set of linearly independent vectors can be completed into a basis of the vector space. 


5.3.8.3 Linear Operators 


1. Definition of Linear Operators 
Let V and W be two real vector spaces. A mapping f : V — W from V into W is called a linear 
mapping or linear transformation or linear operator (see also 12.1.5.2, p. 658) from V into W if 


f(u+v) = fu- fv forall u,v € V, (5.191) 
f(Au) =Afu for all u € V and all real A. (5.192) 

B 
WB A: The mapping fu :— f u(t) dt, which transforms the space Cla, 8] of continuous real functions 


a 
into the space of real numbers is linear. 
In the special case when W = R’, as in the previous example, linear transformations are called linear 
functionals. 
ΒΒ: Let V = R” and let W be the space of all real polynomials of degree at most n — 1. Then the 


mapping (ao, d1, ... ,Q4.1) = do + Gx + aga” +- -+ + a, χα} is linear. In this case each n-element 
vector corresponds to a polynomial of degree € n — 1. 

B C: If V = R” and W = R”, then all linear operators f from V into W (f: R” - R”) can 
be characterized by a real matrix A = (αικ) of type (m,n). The relation Ax = y corresponds to the 


system of linear equations (4.174a) 


Qj G12 t Gin Ti yı 
Q21 Q22 ''* Gan T2 82 
απ Am2 *** Amn Tn Ym 


2. Sum and Product of Two Linear Operators 
Let f: V — W, g: V — W and h: W — U be linear operators. Then the 


sum f+g:V — W is defined as (f + g)u = fu + gu forall u € V and the (5.193) 
product hf: V — U is defined as (hf)u = h( fu) forall u € V. (5.194) 
Remarks: 


1. If f, g and h are linear, then f + g and fh are also linear operators. 
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2. The product (5.194) of two linear operators represents the consecutive application of these operators 
fandh. 


3. The product of two linear operators is usually non-commutative even if the products exist: 


hf z fh. (5.195a) 
Commutability exists, if 
hf —fh=0 (5.195b) 


holds. In quantum mechanics the left-hand side of this equation hf — fh is called the commutator. In 
the case (5.195a) the operators f and h do not commutate, therefore we have to be very careful about 
the order. 


W Asa particular example of sums and products of linear operators one may think of sums and products 
of the corresponding real matrices. 
5.3.8.4 Subspaces, Dimension Formula 
1. Subspace: Let V be a vector space and U a subset of V. If U is also a vector space with respect to 
the operations of V, then U is called a subspace of V. 
A non-empty subset U of V is a subspace if and only if for every u1, uz € U and every k € F also u, + u2 
and k - u; are in U (subspace criterion). 
2. Kernel, Image: Let Vi, V2 be vector spaces over F. If f: V; > V3 is a linear mapping, then the 
linear subspaces kernel (notation: ker f) and image (notation: im f) are defined in the following way: 
ker f = {v E€ V|f(v) =0}, im f= {fwv EV}. (5.196) 
So, for example, the solution set of a homogeneous linear equation system Ax = 0 is the kernel of the 
linear mapping defined by the coefficient matrix A. 
3. Dimension: The dimension dim ker f and dim im f are called the defect f and rank f, respectively. 
For these dimensions the equality 
defect f + rank f = dim V, (5.197) 
is valid and is called the dimension formula. In particular, if the defect f = 0, i.e., ker f = {0}, then 
the linear mapping f is injective, and conversely. Injective linear mappings are called regular. 


5.3.8.5 Euclidean Vector Spaces, Euclidean Norm 


In order to be able to use notions such as length, angle, orthogonality in abstract vector spaces we 
introduce Euclidean vector spaces. 


1. Euclidean Vector Space 
Let V be a real vector space. If: V x V — R is a mapping with the following properties (instead of 
p(v, w) one writes v - w) for every u,v, w € V and for every r € R 

(S1) v-w=w:v, 

(S2) (u+v)-w=u:w+v:w, 
(S3) r(v-w)= (rv) w= v. (rw) 
(S4) v-v > Oifand only if v Z 0, 


then y is called a scalar product on V. If there is a scalar product defined on V, then V is called a 
Euclidean vector space. 


5.198 
5.199 
5.200 
5.201 


(5.198) 
(5.199) 
(5.200) 
(5.201) 


These properties are used to define a scalar product with similar properties on more general spaces, too 
(see 12.4.1.1, p. 673). 
2. Euclidean Norm 
The value 
lvl] = vv -v (5.202) 
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denotes the Euclidean norm (length) of v. The angle a between v, w from V is defined by the formula 


Miis (5.203 
TOTO. 5. 
llull- Iwll 
If v - w = 0 holds, then v and w are called orthogonal to each other. 


B Orthogonality of Trigonometric Functions: In the theory of Fourier series (see 7.4.1.1, p. 474), 
there are functions of the form sin kx and cos kx. Theses fubctions can be considered as elements o 
C[0, 27]. In the function space Ca, b] the formula 


1:27 [ fot dn (5.208 


defines a scalar product. Since 


cosa = 


2π 2π 
/ sinkz.sinlrdr — 0 (k#1), (5.905) / coskx-coslede=0 (41), (5.206 
0 0 


2π 
[ sin kx + cos lx dx = 0, (5.207 
Jo 


the functions sin kr € C[0, 27] and cosla € C[0, 27] for every k,l € N are pairwise orthogonal to each 
other. This orthogonality of trigonometric functions is used in the calculation of Fourier coefficients in 
harmonic analysis (see 7.4.1.1, p. 474). 
5.3.8.6 Bilinear Mappings, Bilinear Forms 
Bilinear mappings can be considered as generalizations of different products between vectors. In that 
case bilinearity uses the distributivity of the corresponding product with respect to vector addition. 
1. Definition 
Let U, V, W be vector spaces over the same field K. A mapping f: U x V —> W is called bilinear if 
for every u € U the mapping v > f(u, v) is a linear mapping of V into W and 
for every v € V the mapping u > f(u, v) is a linear mapping of U into W. (5.208) 
It means that a mapping f: U x V —>+ W is bilinear, if for every k € K,u,u’ € U, and v, v' € V holds: 
f(utu',v) = f(u,v) + f(u', v), f(ku,v) =kf(u,v) and 
f(u,v v) = f(u, v) }- f(u v), f(u, kv) = kf(u,v). (5.209) 
If f is replaced by the dot product or vector product or by a multiplication in a field, these relations 
describe the left sided and right sided distributivity of this multiplication with respect to vector addi- 
tion. 
Especially, if U — V, and W — K which is the underlying field, then f is called a bilinear form. In this 
book only the real (K = R) or complex (X = C) cases are considered. 
Examles of Bilinearforms 
E A: U =V = R”, W = R, f is the dot product in R^: f(u,v) = αυ = Y? , wv; , where u; and v; 
(i = 1,2,...,n) denote the Cartesian coordinates of u and v. 
BB:U-V-W-IHR?, f isthe cross product in R°: 
f(u,v) = u x v = (ου) — vet, V1U3 — u1U3 , τυ) — viu» 
2. Special Bilinear Forms 
A bilinear form f: V x V — R is called 
e symmetric, if f(v, v) = f(v',v) for every v, v € V, 
e skew-symmetric, if f(v, v) = — f (v', v) for every v, v’ € V and 
e positive definite, if f(v, v) > 0 for every v € V v Z 0. 
So an Euclidean dot product in V (see 5.3.8.5, p. 367) can be characterized as a symmetric, positive 


definite bilinear form. The canonical Euclidean dot product in R” is defined as f(u, 0) = utv. 


yr 
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In finite dimensional spaces V a bilinear form can be represented by a matrix: If f:= V x V — Ris 
a bilinear form, and B = (b1, be,..., bn) is a basis of V, then the matrix 


Aalf) = (ου b;)i;) (5.210) 
is the representation matrix of f with respect to basis B. The bilinear form then can be written in 
matrix product form: 

f(v,v) =vTAp(f)u'’, (5.211) 

where v and v' are given with respect to basis B. 
The representation matrix is symmetric, if the bilinear form is symmetric. In complex vector spaces 
(because 27 can be a negative number) symmetric, positive definite bilinear forms do not have much 
sense. To define an unitary dot product and also distances and angles with it instead of bilinear form 
the concept of the so called sesquilinear form is used [5.6], [5.12]. 


3. Sesquilinear Form 
A mapping f: V x V — C is called sesquilinear form if for every v, v' € V and k € C: 

flu+ 0) = f(u,v) + (αυ), f(ku,v) = kf(u,v) and 

f(u,v v) = αυ) + f(u v), f(u kv) = k* f(uv). (5.212) 
where k* denotes the complex conjugate of k. The function is linear in the first argument and ,,semi- 
linear" in the second argument. Analogously to the real case ,,symmetry" is defined in the following 
Way: 
A Salinas form f: V x V — Cis called hermitian if f(v, v’) = f(v’, v)* for every v,v' € V. 
In this way a (unitary) dot product is characterized by an hermitian, positive definite sesquilinear form. 
The canonical unitary dot product in C" is defined as f(u, v) = uTv*. 
If V is finite dimensional, then a sesquilinear form can be represented by a matrix (like in the real 
case): 
If c V x V — Cis a sesquilinear form, and B = (bı, b2,...,bn) is a basis of V, then the matrix 
Ap(f) = (f (bi, b;))i; is the representation matrix of f with respect to basis B. The sesquilinear form 
can be written in matrix product form: 

fw, v’) 2 v  Ap(f)v', (5.213) 
where v and v' are given with respect to basis B. A representation matrix is hermitian if and only if 
the sesquilinear form is hermitian. 
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5.4 Elementary Number Theory 
Elementary number theory investigates divisibility properties of integers. 
5.4.1 Divisibility 
5.4.1.1 Divisibility and Elementary Divisibility Rules 
1. Divisor 
An integer b € Z is divisible by an integer a without remainder iff * there is an integer q such that 

qa — b (5.214) 
holds. Here a is a divisor of b in Z, and q is the complementary divisor with respect to a; b is a multiple 
of a. For “a divides b” we write also a|b. For “a does not divide b” we can write a/b. The divisibility 
relation (5.214) is a binary relation in Z (see 5.2.3, 2., p. 331). Analogously, divisibility is defined in 
the set of natural numbers. 


2. Elementary Divisibility Rules 


(DR1) For every a € Z we have lja, aja and αἱθ. (5.215 
(DR2) Ifaļb, then (—a)|b and αἰ(--θ). (5.216 
(DR3)  a|b and b|a implies a = b or a = —b. (5.217 
(DR4)  a|l implies a = 1 or a = —1. (5.218 
(DR5)  a|b and b £ 0 imply |a| < |b]. (5.219 
(DRG) aļ|b implies a|zb for every z € Z. (5.220 
(DR?)  a|bimplies az|bz for every z € Z. (5.221 
(DRS)  az|bz and z £0 implies a|b for every z € Z. (5.222 
(DR9) αἰ and dc imply alc. (5.223 
(DR10) a|b and c|d imply ac|bd. (5.224 
(DR11) a|band alc imply aļ|(z1b + 2ος) for arbitrary 2, z2 € Z. (5.225 
(DR12) ab and a|(b +c) imply alc. (5.226 
5.4.1.2 Prime Numbers 

1. Definition and Properties of Prime Numbers 

A positive integer p (p > 1) is called a prime number iff 1 and p are its only divisors in the set IN o 


positive integers. Positive integers which are not prime numbers are called composite numbers. 

For every integer, the smallest positive divisor different from 1 is a prime number. There are infinitely 
many prime numbers. 

A positive integer p (p > 1) isa prime number iff for arbitrary positive integers a,b, p|(ab) implies pla 
or p|b. 

2. Sieve of Eratosthenes 

By the method of the *Sieve of Eratosthenes" , every prime number smaller than a given positive integer 
n can be determined: 

a) Write down the list of all positive integers from 2 to n. 

b) Underline 2 and delete every subsequent multiple of 2. 

c) If p is the first non-deleted and non-underlined number, then underline p and delete every p-th 
number (beginning with 2p and counting the numbers of the original list). 

d) Repeat step c) for every p (p € yn) and stop the algorithm. 


“if and only if 
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Every underlined and non-deleted number is a prime number. In this way, all prime numbers € n are 
obtained. 

'The prime numbers are called prime elements of the set of integers. 

3. Prime Pairs 

Prime numbers with a difference of 2 form prime pairs (twin primes). 

W (3,5), (5,7), (11, 13), (17, 19), (29, 31), (41, 43), (59, 61), (71, 73), (101, 103) are prime pairs. 

4. Prime Triplets 

Prime triplets consist of three prime numbers occuring among four consecutive odd numbers. 

E (5,7, 11), (7, 11, 13), (11, 13, 17), (13, 17, 19), (17, 19, 23), (37, 41, 43) are prime triplets. 

5. Prime Quadruplets 

If the first two and the last two of five consecutive odd numbers are prime pairs, then they are called a 
prime quadruplet. 

W (5,7, 11,13), (11, 13, 17, 19), (101, 103, 107, 109), (191, 193, 197, 199) are prime quadruplets. 

The conjecture that there exist infinitely many prime pairs, prime triplets, and prime quadruplets, is 
not proved still. 

6. Mersenne Primes 
If 2 — 1, ΚΕΝ, isa prime number, then k is also a prime number. The numbers 2? — 1 (p prime) 
are called Mersenne numbers. A Mersenne prime is a Mersenne number 2? — 1 which is itself a prime 
number. 
W 2’ — 1 isa prime number for the first ten values of p: 2, 3, 5, 7, 13,17, 19, 31, 61, 89, 107, etc. 
Remark: Since a few years the largest known prime is always a Mersenne prime, e.g. 219112609 — 1 in 
2008, 257885161 — 1 in 2013. In contrary to other natural numbers the numbers of the form 2* — 1 can 
be tested in a relatively simple way whether they are primes: Let p > 2 be a prime and a sequence of 
natural numbers is defined by s, = 4, 8:41 :— 52 — 2 (i > 1). The number 2? — 1 is a prime if and only 
if the term of the sequence 5». 1 is divisible by 2? — 1. 

'The prime test based on this statement is called Lucas-Lehmer test . 


7. Fermat Primes 

Ifa number 2*+1, k € IN, isan odd prime number, then k is a power of 2. The numbers 2^--1, k € IN, 
are called Fermat numbers. If a Fermat number is a prime number, then it is called a Fermat prime. 
W For - 0,1,2,3,4 the corresponding Fermat numbers 3, 5, 17, 257, 65537 are prime numbers. It is 
conjectured that there are no further Fermat primes. 

8. Fundamental Theorem of Elementary Number Theory 

Every positive integer n > 1 can be represented as a product of primes. This representation is unique 
except for the order of the factors. Therefore n is called to have exactly one prime factorization. 

B 360 =2-2-2-3-3-5=23.3?.5, 

Remark: Analogously, the integers (except —1,0, 1) can be represented as products of prime elements, 
unique apart from the order and the sign of the factors. 

9. Canonical Prime Factorization 

It is usual to arrange the factors of the prime factorization of a positive integer according to their size, 
and to combine equal factors to powers. If every non-occurring prime is assigned exponent 0, then every 
positive integer is uniquely determined by the sequence of the exponents of its prime factorization. 

E 101533312 = 27 - 3? - 11? belongs the sequence of exponents (7, 2,0,0,3,0,0,...). 

For a positive integer n, let pı, po, .. . Pm be the pairwise distinct primes divisors of n, and let αι. denote 
the exponent of a prime number p; in the prime factorization of n. Then 


m 


n= [[ ορ, (5.227a) 
k=1 


372 5. Algebra and Discrete Mathematics 


and this representation is called the canonical prime factorization of n. It is often denoted by 


n= Pee. a D 
ν΄») 5.99Τ] 
Ῥ 


where the product applies to all prime numbers p, and where νρ(τ) is the multiplicity of p as a divisor 
of n. It always means a finite product because only finitely many of the exponents vj (n) differ from 0. 
10. Positive Divisors 
If a positive integer n > 1 is given by its canonical prime factorization (5.227a), then every positive 
divisor t of n can be written in the form 

m 


t= [[ p with m% € {0,1,2,...,a,} fork = 1,2,...,m. (5.228a) 
k=1 
The number 7(n) of all positive divisors of n is 
r(n) = Tf (ax + Ὁ. (5.228b) 
k=1 
ΒΛ: 7(5040) = τ(2!. 32.5. 7) = (4+ 1)(2+1)(1 + 1)(1 + 1) = 60. 
W B: r(pipo::: pr) = 2", if pi, po, .. - , p, are pairwise distinct prime numbers. 


The product P(n) of all positive divisors of n is given by 
P(n) 2 n7, (5.228c) 
E A: P(20) = 20? = 8000. E B: P(p?) = p^, if pis a prime number. 
E C: P(pq) = p?q’, if p and q are different prime numbers. 
The sum c (n) of all positive divisors of n is 


m pakti — 1 


a(n) = ΠΠ Pr 


ku Pol 
E A: σ(120) = o(2? - 3-5) = 15-4- 6 = 360. E B: o(p) =p +1, ifp is a prime number. 


(5.228d) 


5.4.1.3 Criteria for Divisibility 
1. Notation 


Consider a positive integer given in decimal form: 


n = (ακαμ.ι’:' a9a109)10 = ap 10" + aj. 10571 +--+ + a910? + a110 + ag. (5.2292. 
Then 

Qi(n) = ag + ay + ag + +++ + ak (5.229b 
and 

Qi (ο) = ao — αι + ag — +++» + (—1)"ar (5.229c 


are called the sum of the digits (of first order) and the alternating sum of the digits (of first order) of n, 
respectively. Furthermore, 


Q»(n (a1a9)10 + (a3a2)19 + (a504)10 ++ °° and (5.229d 

Q5(n) = (a1a0)10 — (a3a2)10 + (a544)10 — +++: (5.229e 
are called the sum of the digits and the alternating sum of the digits, respectively, of second order and 

Q3(n) = (asa1a9)10 + (a5a403)10 + (ακαταρ)ιο +++: (5.229f 
and 

Qs(n) = (asa120)10 — (a5a4a3)10 + (aga7a6)10 — ++ °° (5.229g 


are called the sum of the digits and alternating sum of the digits, respectively, of third order . 
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E The number 123 456 789 has the following sum of the digits: Qı = 9+8+7+6+5+4+3+2+1 = 45, 
Qi = 9—8--7—64-5—44-3—24-1 = 5, Qs = 89+ 6744542341 = 225, Q5 = 89—67--45—23--1 = 45, 
Q5 = 789 + 456 + 123 = 1368 and Q} = 789 — 456 + 123 = 456. 

2. Criteria for Divisibility 

There are the following criteria for divisibility: 


DC-1: 3|n > 3|Qi (n), (5.230a) DC-2: 7|n e T|Q5(n), (5.230b 
DC-3: 9|n 5 9|Q: (n), (5.230c) DC-4: ll|n e 11|Q! (n), (5.230d 
DC-5: 13|n 5 13|Q5(n) (5.230e) DC-6: 37|n e 37|Q3(n), (5.230f 
DC-7: 101|[n & 101|Q5(n), (5.230g) DC-8: 2|n e 2|ao. (5.230h 
DC-9: 5|n e 5ļao, (5.2301) DC-10: 2"|n & 2ΐ|(αχ.ταχ.α--:αταρ}ιο,  (5.230j 


DC-11: 5*|n & 5"|(ay ταν ο. αιαρ)ιο. (5.230k) 

E A: a = 123456 789 is divisible by 9 since Qi(a) = 45 and 9/45. but it is not divisible by 7 since 
Q3(a) = 456 and 7/456. 

E B: 91619 is divisible by 11 since Q (91 619) = 22 and 11/22. 

li C: 99994096 is divisible by 24 since 2*|4 096. 

5.4.1.4 Greatest Common Divisor and Least Common Multiple 


1. Greatest Common Divisor 


For integers a1, α0,...., ἄμ, which are not all equal to zero, the largest number in the set of common 
divisors of a4,45,...,a, is called the greatest common divisor of a1, az, ...,an, and it is denoted by 
gcd(ai, a2, ..., απ). If ged(ai, a2, . . ., an) = 1, then the numbers aj, a2, ..., an are called coprimes. 


Το determine the greatest common divisor, it is sufficient to consider the positive common divisors. If 
the canonical prime factorizations 


a; = [z^ *? (5.2312) 
a 


of a1, a2, . . . , an are given, then 


{ min [v,(a;)] } 
gcd(ai, d5,..., an) = II» ‘ . (5.231b) 
p 

E For the numbers αι = 15 400 = 23 . 52 . 7-11, a2 = 7875 = 3? - 5° «τας 3850 = 2-5?- 7-11, the 
greatest common divisor is gcd(a;, a2, a3) = 5?- 7 = 175. 
2. Euclidean Algorithm 
The greatest common divisor of two integers a, b can be determined by the Euclidean algorithm without 
using their prime factorization. To do this, a sequence of divisions with remainder, according to the 
following scheme, is performed. For a > b let αρ = a,a, = b. Then: 

ao = 141 + ae, 0 < a2 < a, 

αι = Q202 + às, 0 < ag «αρ, 


l (5.232a) 
απο = η ιαπ ΙΓ An, 0 < Gn < an1, 


ün—1 = (nOn. 
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The division algorithm stops after a finite number of steps, since the sequence as, a3,... is a strictly 
monotone decreasing sequence of positive integers. The last remainder an, different from 0 is the great- 
est common divisor of αρ and αι. 


W gcd(38, 105) = 1, as can be seen by the help of the table to the right. 


By the recursion formula pu = : ` » T A 
gcd(a1, a2, ..., a4) = gcd(ged(a1, a2, .. . αμ. 1) απ), (5.232b) 20 =3-942 
the greatest common divisor of n positive integers with n > 2 can be de- 9=4-241 
termined by repeated use of the Euclidean algorithm. 2=2-1 
W ecd(150, 105,56) = ged(ged(150, 105), 56) = ged(15, 56) = 1. 
W The Euclidean algorithm to determine the ged (see also 1.1.1.4, 1., 55 =1-34421 
p. 3) of two numbers has especially many steps, if the numbers are adja- 34 =1-21413 
cent numbers in the sequence of Fibonacci numbers (see 5.4.1.5, p. 375). 21 =1.13+8 
The annexed calculation shows an example where all quotients are al- 13 =1-84+5 
ways equal to 1. 8=1-54+3 
3. Theorem for the Euclidean Algorithm 5=1-3+4+2 
For two natural numbers a,b with a > b > 0, let A(a, b) denote the 3=1-2+1 
number of divisions with remainder in the Euclidean algorithm, and let 2=1-1+1 
&(b) denote the number of digits of b in the decimal system. Then 121.1. 
A(a, b) € 5+ k(b). (5.233) 


4. Greatest Common Divisor as a Linear Combination 
It follows from the Euclidean algorithm that 

az = ao — NA, = codo + doar, 

a3 = d, — Φα = cao + diay, 


(5.234a) 
απ = An—2 — Qn-1ün-1 = Cn—249 + d, 30. 


Here c,» and d,» are integers. Thus the gcd(ao, αι) can be represented as a linear combination of αρ 
and a, with integer coefficients: 


gcd(ag, αι) = ον ορ + d 3a. (5.234b) 
Moreover ged(ai, a2,...,@n) can be represented as a linear combination of a1, a2, .. . , an, since: 
gcd(ai, a5,..., an) = ged(gcd(ay, ag,...,An—1), an) = c: ged(a1,a9,...,a4 1) + day. ο (5.234c) 


WI gcd(150, 105,56) = ged(ged(150,105),56) = ged(15,56) = 1 with 15 = (—2) - 150 4- 3. 105 and 
1 = 15-15 + (--4) - 56), thus ged(150, 105, 56) = (—30) - 150 + 45. 105 + (—4) - 56. 


5. Least Common Multiple 


For integers a1,05,...,a4, among which there is no zero, the smallest number in the set of positive 
common multiples of a1, a5, .. . ἂμ is called the least common multiple of a4, a5, ... , an, and it is denoted 
by lem(a;, a2,...,@n). 

If the canonical prime factorizations (5.231a) of a4, a2, . . . , an are given, then: 


ru [vy(a;)] } 

lem(a1,@2,...,an) = [[pt ' 3 (5.235) 
p 

E For the numbers a, = 15400 = 23 . 5? . 7- 11, a2 = 7875 = 32.55. ται = 3850— 2-5?-7-11 the 

least common multiple is lem(a;, a5, a3) = 2° - 32-55. 7-11 = 693 000. 
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6. Relation between gcd and lcm 
For arbitrary integers a, b: 

|ab| = gcd(a, b) : lem(a, b). (5.236) 
Therefore, the lcm(a, b) can be determined with the help of the Euclidean algorithm without using the 
prime factorizations of a and b. 


5.4.1.5 Fibonacci Numbers 


1. Recursion Formula 
The sequence 

(Fi)ueN with Fy = F;—1 and Frio = Fn + Επι (5.237) 
is called Fibonacci sequence. It starts with the elements 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 
233,971, ses 
W The consideration of this sequence goes back to the question posed by Fibonacci in 1202: How many 
pairs of descendants has a pair of rabbits at the end of a year, if every pair in every month produces a 
new pair, which beginning with the second month itself produces new descended pairs? The answer is 
Fa = 877. 
2. Explicit Formula 


Besides the recursive definition (5.237) there is an explicit formula for the Fibonacci numbers: 


pes (Hy [2:5]. (5.238 


ν5 2 2 

Some important properties of Fibonacci numbers are the followings. For m,n € IN: 

(1) Prin = Fm-1Fn + FF (πι 1). (5.239a (2). Επ] Prin: (5.239b 

(3) ged(m, n) = dimplies gcd(Fm, Fn) = Fu. (5.239c (4) ged( Fs, F,41) = 1. (5.239d 

(5) Εν) Fk holds iff m|k holds. (5.239e (6) 5 F? = Ε.Ε. (5.239f 
i=1 

(7) gcd(m, n) = 1 implies Fm Fh|Finn- (5.239g (8) XY F = Fap- 1. (5.239h 
i=1 

(9) F, Fio — F2, = (-1)"*. (5.239% (10) F? + F2, = Fong. (5.239j 

(11) Εὖ» — Fn = Fane. (5.239k 


5.4.2 Linear Diophantine Equations 
1. Diophantine Equations 


An equation f(r4,25,...,x4) = b is called a Diophantine equation in n unknowns iff f (zi, r9, ...,2,,) 
is a polynomial in z4,25,...,r, with coefficients in the set Z of integers, b is an integer constant and 
only integer solutions are of interest. The name “Diophantine” reminds of the Greek mathematician 
Diophantus, who lived around 250 AD. 

In practice, Diophantine equations occur for instance, if relations between quantities are described. 
Until now, only general solutions of Diophantine equations of at most second degree with two variables 
are known. Solutions of Diophantine equations of higher degrees are only known in special cases. 
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2. Linear Diophantine Equations in n Unknowns 
A linear Diophantine equation in n unknowns is an equation of the form 


απι + Q222 +++ Ont, =b (a; €Z, b € Z), (5.240) 
where only integer solutions are searched for. A solution method is described in the following. 


3. Conditions of Solvability 
If not all the coefficients a; are equal to zero, then the Diophantine equation (5.240) is solvable iff 
gcd(a, a2, ..., an) is a divisor of b. 
W 114% + 315y = 3 is solvable, since gcd(114, 315) = 3. 
If a linear Diophantine equation in n unknowns (n > 1) has a solution and Z is the domain of variables, 
then the equation has infinitely many solutions. Then in the set of solutions there are n—1 free variables. 
For subsets of Z, this statement is not true. 
4. Solution Method for n — 2 
Let 

azı +a2z2 =b (a4,a3) Æ (0,0) (5.241a) 
be a solvable Diophantine equation, i.e., gcd(a1,a2)|b. To find a special solution of the equation, the 
equation is divided by ged(a;, αὐ) and one obtains ajx} + aha = U' with ged(a,, αὐ) = 1. 
As described in 5.4.1, 4., p. 374, gcd(a}, a5) is determined to obtain finally a linear combination of αι 
and a5: ac, + ach = 1. 
Substitution in the given equation demonstrates that the ordered pair (c{b’, c4b’) of integers is a solution 
of the given Diophantine equation. 
WB 114r--315y = 6. The equation is divided by 3, since 3 = gcd(114, 315). That implies 38r 4-105y = 2 
and 38 - 47 + 105 - (—17) = 1 (see 5.4.1, 4., p. 374). The ordered pair (47 - 2, (C17) - 2) = (94, —34) is 
a special solution of the equation 114x + 315y = 6. 
The family of solutions of (5.241a) can be obtained as follows: If (2, x9) is an arbitrary special solution, 
which could also be obtained by trial and error, then 
{(α9 +t- αὐ, αϑ —t-a))|t € Z} (5.241b) 
is the set of all solutions. 
E The set of solutions of the equation 114x + 315y = 6 is {(94 + 3151, —34 — 114t)|t € Z}. 
5. Reduction Method for n > 2 
Suppose a solvable Diophantine equation 


αισι + απο + +++ + ann = b (5.242a) 
with (a1, a2,- . , an) Æ (0,0,...,0) and ged(ai, a2,...,@n) = Lis given. If ged(a1, a2, . . . , an) Æ 1, then 
the equation should be divided by ged(a1, a2, . . . αμ). After the transformation 

αισι + a22 + +++ + An ια 1 = b — Anin (5.242b) 
Tn is considered as an integer constant and a linear Diophantine equation in n — 1 unknowns is obtained, 
and it is solvable iff gcd(a1, a2, . . . , an—1) is a divisor of b — a,,,. 
The condition 

gcd(ai, az, ..., ἄπ. 1)|b — aux (5.242c) 
is satisfied iff there are integers c, c,, such that: 

gcd(a, a2, ..., Q5 1) € anc, = b. (5.242d) 


This is a linear Diophantine equation in two unknowns, and it can be solved as shown in 5.4.2,4., p. 376. 
If its solution is determined, then it remains to solve a Diophantine equation in only n — 1 unknowns. 
This procedure can be continued until a Diophantine equation in two unknowns is obtained, which can 
be solved with the method given in 5.4.2, 4., p. 376. 

Finally, the solution of the given equation is constructed from the set of solutions obtained in this way. 
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W Solve the Diophantine equation 

2a + 4y + 3z = 3. (5.243a 
This is solvable since gcd(2, 4, 3) is a divisor of 3. 
The Diophantine equation 

2x + 4y = 3 — 3z (5.243b 
in the unknowns z, y is solvable iff gcd(2, 4) is a divisor of 3 — 3z. The corresponding Diophantine 
equation 22' + 3z = 3 has the set of solutions ((—3 + 31, 3 — 2t)|t € Z}. This implies, z = 3 — 2t, and 
now the set of solutions of the solvable Diophantine equation 21 + 4y = 3 — 3(3 — 24) or 

a+ 2y = —-3+ δέ (5.243c 
is sought for every t € Z. 
The equation (5.243c) is solvable since gcd(1,2) = 1|(—3 + 3t). Now 1-(—1) +2- 1 = 1 and 1- (3 — 
3t) +2-(—3+3t) = —3 + 3t. The set of solution is {((3 — 3t) + 2s, (—3 + 3t) — s)|s € Z}. That implies 
x = (3 — 3t) + 2s, y = (—3 + 3t) — s, and {(3 — 3t + 25, —3 + 3t — s,3 — 2t)|s,t € Z} so obtained is 
the set of solutions of (5.2432). 


5.4.8 Congruences and Residue Classes 


1. Congruences 

Let m be a positive integer m, m > 1. If two integers a and b have the same remainder, when divided 
by m, then a and b are called congruent modulo m, denoted by a = b mod m or a = b(m). 
M3=13mo0d5, 38=13mod5, 3= —2 mod 5. 


Remark: Obviously, a = b mod m holds iff m is a divisor of the difference a — b. Congruence modulo 
m is an equivalence relation (see 5.2.4, 1., p. 334) in the set of integers. Note the following properties: 


a = a mod m for every a € Z, (5.244a 
αξ b mod m => b= a mod m, 5.244b 
a z b mod m ^b z c mod m => a= c mod m. 5.244c 


2. Calculating Rules 


αξ b mod m ^c z d mod m — ας b 4 d mod m, (5.245a 

a=bmodmAc=dmodm>a-c=b-dmodm, 5.245b 

a:cz b-cmod m ^ ged(c,m) —1 > az b mod m, 5.245c 

a-c=b-cmodmA cz 0 a = b mod — ———.. 5.245d 
gcd(c, m) 


3. Residue Classes, Residue Class Ring 
Since congruence modulo m is an equivalence relation in Z, this relation induces a partition of Z into 
residue classes modulo m: 

[a], = (x|x € Z^ x = a mod m]. (5.246 
The residue class “ a modulo m ” consists of all integers having equal remainder if divided by m. Now 
[a]. = [b] iff a = b mod m. 
There are exactly m residue classes modulo m, and normally they are represented by their smalles 
non-negative representatives: 

Olm, [1]; - .., [m — Ί]ηι. (5.247 
In the set Zm of residue classes modulo m, residue class addition and residue class multiplication are 


defined by 
[α]ηι Θ [Bln = [a + bl; (5.248 
[a]; © [b]; ο (5.249 
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These residue class operations are independent of the chosen representatives, i.e., 
la]m = [α]μι and [b],, = [b'|m imply 

[α]ν ® [b]. = [a]. © [P]. and [a] © [b]. = [a]. © [V]. (5.250) 

The residue classes modulo m form a ring with unit element, with respect to residue class addition 

and residue class multiplication (see 5.4.3, 1., p. 377), the residue class ring modulo m. 1f p is a prime 

number, then the residue class ring modulo p is a field (see 5.3.7, 2., p. 361). 

4. Residue Classes Relatively Prime to m 

A residue class [a]m with gcd(a, m) = 1 is called a residue class relatively prime to m. If p is a prime 

number, then all residue classes different from [0], are residue classes relatively prime to p. 

The residue classes relatively prime to m form an Abelian group (5.3.3.1,1., p. 336) with respect to 

residue class multiplication, the so-called group of residue classes relatively prime to m. The order of 

this group is (m), where φ is the Euler function (see 5.4.4, 1., p. 381). 

W A: [1]s, [3]s, [5]s, [7]s are residue classes relatively prime to 8. 

WB B: [1|-.[2]5.|3]5. [4]; are residue classes relatively prime to 5. 

MC: ϱ(8) = φ(5) = 4 is valid. 

5. Primitive Residue Classes 

A residue class [a] relatively prime to m is called a primitive residue class if it has order y(m) in the 

group of residue classes relatively prime to m. 

Bl A: [2]; is a primitive residue class modulo 5, since ([2]5)? = [4]5, ([2]5}} = [3]5, ([2]5)* = [1]s. 

E B: There is no primitive residue class modulo 8, since [1] has order 1, and [3]s, [5]s, [7]s have order 

2 in the group of residue classes relatively prime to m. 


Remark: There is a primitive residue class modulo m, iff m = 2, m = 4, m = pë or m = 2p", where p 
is an odd prime number and k is a positive integer. 

If there is a primitive residue class modulo m, then the group of residue classes relatively prime to m 
forms a cyclic group. 

6. Linear Congruences 

1. Definition If a, b and m > 0 are integers, then 


ax = b(m) (5.251) 
is called a linear congruence (in the unknown 2). 
2. Solutions An integer α” satisfying ax* = b(m) is a solution of this congruence. Every integer, 


which is congruent to z* modulo m, is also a solution. In finding all solutions of (5.251) it is sufficient 
to find the integers pairwise incongruent modulo m which satisfy the congruence. 

The congruence (5.251) is solvable iff ged(a, m) is a divisor of b. In this case, the number of solutions 
modulo m is equal to ged(a, m). 
In particular, if gcd(a, m) = 1 holds, the congruence modulo m has a unique solution. 

3. Solution Method There are different solution methods for linear congruences. It is possible to 
transform the congruence ax = b(m) into the Diophantine equation ax + my = b, and to determine 
a special solution (x°, y?) of the Diophantine equation a/z + m'y = b! with a’ = a/ged(a, m), m = 
m/gcd(a, m), b! = b/gcd(a, m) (see 5.4.2, 1., p. 375). 


The congruence a'z = b'(m’) has a unique solution since gcd(a', m") = 1 modulo τη’, and 


x = a? (m). (5.2522) 
The congruence ax b(m) has exactly gcd(a, m) solutions modulo m: 
a9, a9 +m, a? + 2m,...,2° + (ged(a, m) — 1)m. (5.252b) 


W 114r = 6 mod 315 is solvable, since gcd(114, 315) is a divisor of 6; there are three solutions modulo 
315. 

38r = 2 mod 105 has a unique solution: x = 94 mod 105 (see 5.4.2, 4., p. 376). 94, 199, and 304 are 
the solutions of 114r = 3 mod 315. 
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7. Simultaneous Linear Congruences 
If finitely many congruences 


x = (πι), £ = be(ma),..., £ = Ότι) (5.253 
are given, then (5.253) is called a system of simultaneous linear congruences. A result on the set ο 
solutions is the Chinese remainder theorem: Consider a given system v = by(mi), x = bo(ma),...,0 = 
b,(m,), where mi, ma, ..., m, are pairwise coprime numbers. If 
m m m 
M Ξ M1: Το" Mi, αι = «θα = E (5.254a 
mi m» mi 
and z; is chosen such that ajx; = b;(m;) for j = 1,2,....t, then 
a! = aya, + aot +++ + UT (5.254b 


is a solution of the system. The system has a unique solution modulo m, i.e., if z' is a solution, then x” 
is a solution, too, iff x” = a'(m). 

Wi Solve the system x = 1(2), x = 2(3), x = 4(5), where 2,3, 5 are pairwise coprime numbers. 
Then m = 30,a; = 15,a2 = 10,a3 = 6. The congruences 1511 = 1(2), 102» = 2(3), 643 = 4(5) 
have the special solutions xı = 1, x2 = 2, x3 = 4. The given system has a unique solution modulo m: 
x =15-1410-2+6-4(30), i.e., x = 29 (30). 

Remark: Systems of simultaneous linear congruences can be used to reduce the problem of solving 
non-linear congruences modulo m to the problem of solving congruences modulo prime number powers 


(see 5.4.3, 9., p. 380). 
8. Quadratic Congruences 


1. Quadratic Residues Modulo m One can solve every congruence ax? + br + c = 0(m) if one 
can solve every congruence 23 = a(m): 

aa? + bx +c = 0(m) & (2ax 4- b)? = b? — 4ac(m). (5.255) 
First quadratic residues modulo m are considered: Let m € IN, m > 1 and a € Z, gcd(a, m) = 1. The 
number a is called a quadratic residue modulo m iff there is an x € Z with x? = a(m). 


If the canonical prime factorization of m is given, i.e., 
oo 
m= II». (5.256) 
il 
then r is a quadratic residue modulo m iff r is a quadratic residue modulo p; for i = 1,2,3,.... 
: : : ς - a ae : 
If ais a quadratic residue modulo a prime number p, then this is denoted by (0) = 1; ifa is a quadratic 
P 
: T α 
non-residue modulo p, then it is denoted by (2) = —1 (Legendre symbol). 
p 
W The numbers 1,4, 7 are quadratic residues modulo 9. 


2. Properties of Quadratic Congruences 


(E1)  p[ab and a = b(p) imply (0) = C) ; (5.257a) 
p 


(E2) C) =1. (5.257b) 


(E3) e -(-ηΖ. (5.257c) 


380 5. Algebra and Discrete Mathematics 


(E4) (2) = (0) C) in particular (=) = (0) | (5.2574) 


(ES) (=) zu. (52876) 


(E6) Quadratic reciprocity law: If p and q are distinct odd prime numbers, 


then (2) l (3) spa. (5.2571) 


a wr) (απ) | (3η) C) as) (=) | (5) ( D^ (5) (5) ~ 
-(-1 8 =1 


In General: A congruence x? = a(2?), gcd(a, 2) = 1, is solvable iff a = 1(4) for a = 2 anda = 1(8) 
for a > 3. If these conditions are satisfied, then modulo 2° there is one solution for a = 1, there are 
two solutions for a = 2 and four solutions for a > 3. 
A necessary condition for solvability of congruences of the general form 
27 =a(m), m= 2 pt pg? p, ged(a,m) = 1, (5.258a) 
is the solvability of the congruences 
a a a 
a = 1(4) fora =2, a=1(8) fora>3, (<) =1, (<) =A, bes, (<) Ξ 1. (5.258b) 
Pı P2 Pt 
If all these conditions are satisfied, then the number of solutions is equal to 2! for a = 0 anda = 1, 
equal to 2/11 for a = 2 and equal to 2'*? for a > 3. 
9. Polynomial Congruences 


If m4, mo,..., m, are pairwise coprime numbers, then the congruence 

f(x) = αμα" + a, 4x" 3 + +++ + ag = ΟΠ m) (5.259a) 
is equivalent to the system 

f(x) = 0(mi), f(x) = 0(ms3), ..., f(x) = O(m). (5.259b) 
If kj is the number of solutions of f(x) = 0(m;) for j = 1,2,...,t, then kiko: ks is the number of 
solutions of f(x) = O(mqmoa--- m). This means that the solution of the congruence 

f(x) = 0 (pt pg? - - - pe), (5.259c) 
where pi, po, ..., p, are primes, can be reduced to the solution of congruences f(x) = 0(p*). Moreover, 


these congruences can be reduced to congruences f(x) = 0(p) modulo prime numbers in the following 
way: 

a) A solution of f(x) = 0(p?) is a solution of f(x) = 0(p), too. 

b) A solution z = zi(p) of f(x) = O(p) defines a unique solution modulo p? iff f'(x) is not divisible 
by p: 

Suppose f (x1) = 0(p). Let x = x, + pt; and determine the unique solution t} of the linear congruence 


e + f'(z1)ti = O(p). (5.260a) 


Substitute tı = ti + pty into £ = σι + pti, then £ = x + p?t is obtained. Now, the solution t5 of the 
linear congruence 


. --/ωυ)η = 0(p) η 
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has to be determined modulo p?. By substitution of tg = th + pt3 into £ = zo + p?t3 the result 
x = x3 + ρα is obtained. Continuing this process yields the solution of the congruence f(x) = 0 (p°). 
WI Solve the congruence f(x) = xt + Τα +4 = 0(27). f(x) = x! ἠ- Τα +4 = 0(3) implies x = 
1 (3), ie. x = 1 + 3tı. Because of f'(x) = 4a? + 7 and 3/f’(1) now the solution of the congruence 
f(1)/3 + f'(1) -tı = 4 + 11t, = 0 (3) is searched for: tı = 1 (3), i.e., tı = 1+ δὲ and x = 45965. 
Then consider f(4)/9 + f'(4) - t2 = 0 (3) and the solution tz = 2 (3) is obtained, i.e., tg = 2 + 349 and 
x = 22+ 2Tt3. Therefore, 22 is the solution of z* + 7x + 4 = 0 (27), uniquely determined modulo 27. 


5.4.4 Theorems of Fermat, Euler, and Wilson 
1. Euler Function 


For every positive integer m with m > 0 one can determine the number of coprimes x with respect to 
m for 1 < x € m. The corresponding function y is called the Euler function. The value of the function 
(m) is the number of residue classes relatively prime to m (s. 5.4.3, 4., p. 378). 

For instance, φ(1) = 1, φ(2) = 1, (8) = 2, φ(4) = 2, φ(5) = 4, (6) = 2, (7) = 6, ϱ(8) = 4, etc. 
In general, φ(ρ) = p — 1 holds for every prime number p and o(p*) = ρ΄ — p^^! for every prime number 
power p^. If m is an arbitrary positive integer, then y(m) can be determined in the following way: 


1 
φίπι) 5 m][ (: - :) ; (5.261a) 


pim 


where the product applies to all prime divisors p of m. 


E (360) = φ(25. 93. 5) = 360 - (1 — 1). (1 — Ὁ). (1-- τ) = 96. 


Furthermore 
» (6) =m (5.261b) 
d|m 

is valid. If ged(m, n) = 1 holds, then we get y(mn) = y(m)y(n). 

Β (000) = y(2° -3?- 5) = (25) - (3?) - ρ(5) = 4-6: 4 = 96. 

2. Fermat-Euler Theorem 

The Fermat-Euler theorem is one of the most important theorems of elementary number theory. If a 

and m are coprime positive numbers, then 


a? = 1(m). (5.262) 


E Determine the last three digits of 99" in decimal notation. This means, determine x with x = 
99" (1000) and 0 € x < 999. Now (1000) = 400, and according to Fermats theorem 94° = 1 (1000 
Furthermore 9° = (80 + 1)^ -9 = ((¢)80°- 14 + (580! - 13) -9 = (1 1-4: 80) -9 = —79 - 9 = 89 (400 
From that it follows that 9” = 999 = (10-- 15 = ed 10°. (—1)994 (5) 10! (—1)954 (2) 10?-(—1)*" = 
—1-- 89-10 — 3916. 100 = —1 — 110 + 400 = 289(1000). The decimal notation of 9% ends with the 
digits 289. 

Remark: The theorem above for m = p, i.e., φ(ρ) = p— 1 was proved by Fermat; the general form was 
proved by Euler. This theorem forms the basis for encoding schemes (see 5.4.6). It contains a necessary 
criterion for the prime number property of a positive integer: If p is a prime, then a?~' = 1(p) holds for 
every integer a with p fa. 

3. Wilson's Theorem 


There is a further prime number criterion, called the Wilson theorem: 
Every prime number p satisfies (p — 1)! = —1(p). 
The inverse proposition is also true; and therefore: 


). 
; 
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The number p is a prime number iff (p — 1)! = —1(p). 


5.4.5 Prime Number Tests 


In the followings two stochastic prime tests will be presented which are useful at large numbers to test 
the prime property with a sufficiently small probability of mistakes. With these tests it is possible to 
show that a number is not a prime, without knowing its prime factors. 

1. Fermat-Prime Number Test 

Let n be an odd natural number and a an integer such that gcd(a, n) = 1 and a^^! = 1 (mod n).. Then 
n is called a pseudoprime to base a. 

W A: 341 is a pseudo prime to basis 2; 341 is not a pseudo prime to basis 3. 

Test: Let an odd natural number n > 1 be given. Choose a € Z, \ {0}. 


e If the gcd (a, n) > 1, then n is not prime. 

e If the ged (a, n) = 1 and { a - (modh) } then n { Los pase } the test to base a. If n did 
a" z] (modnm) did not pass 

not pass the test, then n is not a prime. If n did pass the test, then it may be a prime, but more tests 

are needed with other base, i.e. tests with further values of a. 

E B: n = 15: The test with a = 4 gives 411 = 1(mod15). The test with a = 7 gives 71^ = 4 Æ 

1 (mod 15). Hence 15 is not a prime. 

Bl C: n = 561: The test with arbitrary a € Zsg; V {0} with ged(a,561) = 1 results in à?" = 

1 (mod 561). But 561 = 3- 11: 17 is not a prime. 

Remark: A composite number n for which a^^! = 1 (mod n) for all a € Z, V {0} with gcd(a, n) = 1 is 

called a Carmichael number. 


If n is not a prime and not a Carmichael number, then one can show that the level of error of the first 
kind to get a false result using k numbers with gcd(a, n) =1 is at most 1/28. At least for the half of the 
numbers in Zn \ {0} with βοά(α, n) —1 the relation a^! z 1 (mod n) holds. 


2. Rabin-Miller Prim Number Test 

The Rabin-Miller primality test is based on the following statement (*): 

Let n > 2 be a prime, n — 1 = 2'u (u is odd), g.c.d(a, n) =1. Then: 

a" = 1 (mod τι) or a?” = —1 (mod n) for some j € {0,1,...,t— 1}. (κ) 
Every odd natural number n > 1 can be tested about prime property in the following way: 

Test: Choose a € Z, V {0} and find the representation n — 1 = 2*u (u is odd). 

e If g.c.d(a, n) > 1, then n is not a prime. 


e If g.c.d(a, n) =1, then the sequence a" (mod n), a?" (modn), ..., αὖ 


; τα mod n) is calculated until a 
value is found which satisfies (κ). These elements are calculated by repeated squaring mod n. If there 
is no such value, then n is not a prime. Otherwise n did pass the test to basis a. 
W A: n — 561, and should be tested by different values of a: 
255 = 263 # +1 (mod 561), 
2° 21662 —1 (mod 561) 
yp 2941.95 m. ; 
n=1=2 35, a=2: yn 674-1 (mod 561), 
2280 = 421 5: —] (mod 561). 
If choosing k different values randomly and independently and n passes the test to basis a for each, 
then the error rate of the first kind that n is not a prime is < 1/4*. In the practice k = 25 is chosen. 
E B: There is only one number < 2,5- 1010 such that it passes the test to basis a = 2, 3, 5, 7 and it is 
not a prime. 


561 is not a prime. 
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3. AKS Prime Number Test 

The AKS primality test is based on a polynomial algorithm to determine whether a number is prime or 
composite. Published by Agrawal, Kayal, and Saxena, in 2002, meanwhile it is evident that the prime 
property can be tested efficiently for any natural number. 


The test is based on the following statements: 

Ifm > lisa natural number and r is a prime satisfying the assumtions 

e n is not divisible by primes < r, 

e ri Z 1 (mod n) for i = 1,2,..., (logo n)?|*, 

e (x +a)" = x" +a (mod r” — 1,n) for every 1 € a € yr logn, 

Then n is a power of a prime. 

Let n > 1 be an odd natural number whose prime characteristic is to be tested, and m :— | (log, n)?]. 
If n « 5690034, then it is tested by comparing it to a list of known prime numbers whether n is a prime. 
For n > 5690034 holds n > m: 

Test: 

ο Check, whether n can be divided by a natural number from the interval [3, m]. If yes, then n is not 
a prime. 

ο Otherwise take a prime r < m, such that r^ # 1( modn) for i = 1,2,...,|(logyn)?|. (It can be 
proven, that such a prime r exists.) 

ο Check, whether the congruence (x + a)" = z^ + a( mod a” — 1,0) for a = 1,2, Vr | (logs n) | holds. 
If not, then n is not a prime. If yes, then n is a power of a prime. In this case it is to be tested, whether 
natural numbers q and k > 1 exist, for which n = q^. If not, then n is a prime. 

Different to the known and efficient stochastic algorithms, the result of the test can be trusted without 
even a negligible small error probability of mistakes. However in cryptography the Rabin-Miller test is 
preferred. 


5.4.6 Codes 
5.4.6.1 Control Digits 


In the information theory methods are provided to recognize and to correct errors in data combinations. 
Some of the simplest methods are represented in the form of the following control digits. 


1. International Standard Book Number ISBN-10 
A simple application of the congruence of numbers is the use of control digits with the Internationa 
Standard Book Number ISBN. A combination of 10 digits of the form 

ISBN a — bcd — ef ghi — p. (5.263a 
is assigned to a book. The digits have the following meaning: a is the group number (for example, a = 3 
tells us that the book originates from Austria, Germany, or Switzerland), bcd is the publisher's number, 
and ef ghi is the title number of the book by this publisher. A control digit p will be added to detec 
erroneous book orders and thus help reduce expenses. The control digit p is the smallest non-negative 
digit that fulfils the following congruence: 

10a + 9b + 8c + 7d + 6e + 5f + 4g + 3h + 2i + p = 0(11). (5.263b 
If the control digit p is 10, a unary symbol such as X is used (see also 5.4.6, 3., p. 384). A presented 
ISBN can now be checked for a match of the control digit contained in the ISBN and the control digi 
determined from all the other digits. In case of no match an error is certain. The ISBN control digi 
method permits the detection of the following errors: 


1. Single digit error and 
2. interchange of two digits. 


Statistical investigations showed that by this method more than 9096 of all actual errors can be detected. 
All other observed error types have a relative frequency of less than 196. In the majority of the cases 


*|a| is symbol for “greatest integer € x”. 
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the described method will detect the interchange of two digits or the interchange of two complete digit 
blocks. 
2. Central Codes for Drugs and Medicines 


In pharmacy, a similar numerical system with control digits is employed for identifying medicaments. 
In Germany, each medicament is assigned a seven digit control code: 


abcde f p. (5.264a) 
'The last digit is the control digit p. It is the smallest, non-negative number that fulfils the congruence 
2a + 3b + 4c + 5d + 6e + 7f = (11). (5.264b) 


Here too, the single digit error or the interchange of two digits can always be detected. 
3. Account Numbers 


Banks and saving banks use a uniform account number system with a maximum of 10 digits (depending 
on the business volume). The first (at most four) digits serve the classification of the account. The 
remaining six digits represent the actual account number including a control digit in the last position. 
The individual banks and saving banks tend to apply different control digit methods, for example: 

a) The digits are multiplied alternately by 2 and by 1, beginning with the rightmost digit. A control 
digit p will then be added to the sum of these products such that the new total is the next number 
divisible by 10. Given the account number abcd ef ghi p with control digit p, then the congruence 


2i -- h -- 2g 1- f 1-26 1 d - 2e b -- 2a 4- p € 0 (mod 10). (5.265) 
holds. 


b) As in method a), however, any two-digit product is first replaced by the sum of its two digits and 
then the total sum will be calculated. 


In case a) all errors caused by the interchange of adjacent digits and almost all single-digit errors will 
be detected. 

In case b), however, all errors caused by the change of one digit and almost all errors caused by the 
interchange of two adjacent digits will be discovered. Errors due to the interchange of non-adjacent 
digits and the change of two digits will often not be detected. 


The reason for not using the more powerful control digit method modulo 11 is of a non-mathematical 
nature. The non-numerical sign X (instead of the control digit 10 (see 5.4.6, 1., p. 383)) would require 
an extension of the numerical keyboard. However, renouncing those account numbers whose control 
digit has the value of 10 would have barred the smooth extension of the original account number in à 
considerable number of cases. 


4. European Article Number EAN 


EAN stands for European Article Number. It can be found on most articles as a bar code or as a string 
of 13 or 8 digits. The bar code can be read by means of a scanner at the counter. 

In the case of 13-digit strings the first two digits identify the country of origin, e.g., 40, 41, 42 and 
43 stand for Germany. The next five digits identify the producer, the following five digits identify a 
particular product. The last digit is the control digit p. 

This control digit will be obtained by first multiplying all 12 digits of the string alternately by 1 and 
3 starting with the left-most digit, by then totalling all values, and by finally adding a p such that the 
next number divisible by 10 is obtained. Given the article number abcde f ghikmn p with control digit 
p, then the congruence 


a 4- 3b -- c 4- 3d - e - 3f }- g --3h }- d -- 3k 4 m 4 3n +p z 0 (mod 10). (5.266) 
holds. 


This control digit method always permits the detection of single digit errors in the EAN and often the 
detection of the interchange of two adjacent digits. The interchange of two non-adjacent digits and the 
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change of two digits will often not be detected. 
5.4.6.2 Error correcting codes 


1. Model of Data Transmission and Error Correction 

At transmission of messages through noisy channels the correction of errors is often possible. The mes- 
sage is coded first, then after transmission the usually biased codes are corrected into the right ones, 
so after decoding them the original message can be recovered. That case is considered now, when the 
length of the words of the message is k, and the length of the coded words is n, and both of them consist 
of only zeros and ones. Then k is the number of information positions and n — k is the number of redun- 
dant positions. Every word of the message is an element of GF(2)* (see 5.3.7.4 p. 363) and every word 
of the code is an element of GF(2)". To simplify the notation the words of the message are written in 


the form a1, d2,...,a,, and the words of the code in the form c,c5,...,c,. The words of the message 
are not transmitted, only the words of the code are. 
An often used idea of error correction is to convert the transmitted word di, d», .. . , dn first into a valid 


codeword c1, c, ... , c, which differs from it in the least number of digits (decoding MLD). It depends 
on the properties of coding and the transmission channels that how many errors can be detected and 
corrected in this way. 
W At digit repeating codes the message word 0 is represented by the codeword 0000. If after trans- 
mission the receiver gets the word 0010, then he assumes that the original codeword was 0000, and 
it is decoded as message word 0. But if the received word is 1010, then similar assumption can not 
be applied, since the message word 1 is coded as 1111, so the difference is similar. At least it can be 
recognized that there is some error in the received word. 
2. t-Error Correcting Codes 
The set of all codewords is called code C. The distance of two codewords is the number of digits (po- 
sitions) in which the two words differ from each other. The minimal distance dy, (C) of codes is the 
smallest distance which occurs between the codewords of C. 
W For Cı = (0000, 1111}, dmin(C): = 4. For C = (000,011, 101, 110}, duin (C2) = 2, since there are 
codewords which have distance 2. For C3 = (00000, 01101, 10111, 11010}, dmin(C3) = 3, there are code- 
words in C3 whose distance is 3. 
If the minimal distance dmin(C) of a code C is known, then it is easy to recognize how many transmis- 
sion errors can be corrected. Codes, correcting t errors, are called t-error correcting. A code C is t-error 
correcting if dmin(C) > 2t 4- 1. 
W (Continuation) Οι is 1-error correcting, C2 is 0-error correcting (it means, that no error can be cor- 
rected), C3 is l-error correcting. ! 
For every t-error correcting code C € GM(2)" holds > (3 :|C| € 2”. If equality holds, then C is called 
i=0 


t-perfect. 

Β The digit repeating code C = (00...0,11...1) C GF(2)*"*" is t-perfect. 

3. Linear Codes 

A non-empty subset C C GF(2)" is called (binary) linear code, if C is a sub-vector space of GF(2)"^. If 
a linear code C C GF(2)" has dimension k, then it is called an (n, k) linear code. 

W (Continuation) C; is a (4,1) linear code, C» is a (3,2) linear code, C3 is a (5,2) linear code. In the case 
of linear codes the minimal distance (and as a consequence the number of correctible errors) is easy 
to determine: The minimal distance of such a code is the smallest distance of a non-zero vector from 
the zero vector of the vector space. The minimal distance can be found if the minimal number of ones, 
except with all zeros, in the codewords is given. 

For every (n, k) linear code there is a generating matriz G for which C = {aG | a € GF(2)*}: 


fur +++ Jin σι 
Gum =|: |. (5.267) 


θα +++ Din κκ Gk 
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The code is uniquely defined by the generating matrix; the codeword of the message word a1a2 .. . ap is 
determined in the following way: 


Q102...Qj > Q1g1 + a292 +... + ange - (5.268) 
Ἔ-----------μ-------------» 
aG 


In the case of an (n, k) linear code C a check matrix is needed for decoding: 
Aha aJ 
H= $ : d à (5.269) 


ου wee ha kn (n—k)xn 


The (binary) linear code C is 1-error correcting, if the columns of H are pairwise different and non-zero 
vectors. If the result of the transmission is the word d = didə... dn, then Hd" is calculated. If the 
result is the zero vector, then d is a codeword. Otherwise if H dT is the i-th column of the check matrix 
H, then the corresponding codeword is d + e;, where e; = (0,0,...,0,1,0,...,0) and the 1 is on the 
i-th position. 
4. Cyclic Codes 
Cyclic codes are the most investigated linear codes. They provide efficient coding and decoding. 
A (binary) (n, k) linear code is called cyclic if for every codeword cic» .. . Cn the codeword obtained by 
a cyclic right shift of the components is also a codeword, i.e. cgc1...c4 1 € C = €, 46€901...04 δες 
W C = (000,110, 101,011} is a cyclic (3,2) linear code. 
'To have an efficient work with cyclic codes, the codewords are represented by polynomials of degree 
X n — 1 with coefficients from GF(2): C = (000, 110, 101, 011} is a cyclic (3, 2)-linear code. 
Α (binary) (n, k) linear code C is cyclic if and only if for every c(x) 

c(r) € C > c(x): x(mod z^ -1) € C (5.210) 
A cyclic (n,k) linear code can be described by a generating polynomial and a control polynomial as 
follows: The generating polynomial g(a) of degree n — k (k € (1,2,...,n — 1}) is a divisor of x” — 1. 
The polynomial h(x) of degree k for which g(x)h(x) = x" — 1 is called the control polynomial. Coding 
of ajay... a, in polynomial representation a(x) is given by 

a(x) + α(α): g(a). (5.271) 
Polynomial d(x) is an element of the code, if the generator polynomial g(x) is a divisor of d(x), or the 
control polynomial h(x) satisfies the relation d(z)h(x) = 0mod x” — 1. 


An important class of cyclic codes are the BCH-codes. Here a lower bound 6 of the minimal distance 
and with it a lower bound for the number of errors can be required for which code should be corrected. 
Here 6 is called the design distance of the code. 

A (binary) (n, k) linear code C is a BCH-code with design distance ὁ if for the generating polynomial 
g(a): 

g(x) = lem(ma (x), mava(x), ..., mascs-2(m)) , (5.272) 
where a is a primitive n-th unit root and b is an integer. The polynomials πιω (x) are minimal polyno- 
mials of a. 

For a BCH-code C with design distance ὁ the relation d, (C) > ὃ must hold. 


5.5 Cryptology 
5.5.1 Problem of Cryptology 


Cryptology is the science of hiding information by the transformation of data. 

The idea of protecting data from unauthorized access is rather old. During the 1970s together with the 
introduction of cryptosystems on the basis of public keys, cryptology became an independent branch of 
science. Today, the subject of cryptological research is how to protect data from unauthorized access 
and against tampering. 
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Beside the classical military applications, the needs of the information society gain more and more in 
importance. Examples are the guarantee of secure message transfer via email, electronic funds transfer 
(home-banking), the PIN of EC-cards, etc. 

Today, the fields of cryptography and cryptanalysis are subsumed under the notion of cryptology. Cryp- 
tography is concerned with the development of cryptosystems whose cryptographic strengths can be 
assessed by applying the methods of cryptanalysis for breaking cryptosystems. 


5.5.2 Cryptosystems 


An abstract cryptosystem consists of the following sets: a set M of messages, a set C of ciphertexts, 
sets K and K’ of keys, and sets E and D of functions. A message m € M will be encrypted into a 
ciphertext c € C by applying a function E € E together with a key k € K, and will be transmitted via 
a communication channel. The recipient can reproduce the original message m from c if he knows an 
appropriate function D € D and the corresponding key k’ € K’. There are two types of cryptosystems: 


1. Symmetric Cryptosystems: The conventional symmetric cryptosystem uses the same key k for 
encryption of the message and for decryption of the ciphertext. The user has complete freedom in 
setting up his conventional cryptosystem. Encryption and decryption should, however, not become 
too complex. In any case, a trustworthy transmission between the two communication partners is 
mandatory. 


2. Asymmetric Cryptosystems: The asymmetric cryptosystem (see 5.5.7.1, p. 391) uses two keys, 
one private key (to be kept secret) and a public key. The public key can be transmitted along the same 
path as the ciphertext. The security of the communication is warranted by the use of so-called one-way 
functions (see 5.5.7.2, p. 391), which makes it practically impossible for the unauthorized listener to 
deduce the plaintext from the ciphertext. 


5.5.3 Mathematical Foundation 


An alphabet A = (ao, a1,..., an-ı} is a finite non-empty totally ordered set, whose elements a; are 
called letters. |A| is the length of the alphabet. A sequence of letters w = aja, . . . al, of length n € IN 
and a; € A is called a word of length n over the alphabet A. A" denotes the set of all words of length 
n over A. Let n, m € N, let A, B be alphabets, and let 5 be a finite set. 

A cryptofunction is a mapping t: A" x S — B™ such that the mappings ts: A" — B™: w > t(w,s) 
are injective for all s € S. The functions ἐς and ἐς are called the encryption and decryption function, 
respectively. w is called plaintext, ts(w) is the ciphertext. 

Given a cryptofunction t, then the one-parameter family {ts }ses is a cryptosystem Ts. The term cryp- 
tosystem will be applied if in addition to the mapping t, the structure and the size of the set of keys is 
significant. The set S of all the keys belonging to a cryptosystem is called the key space. Then 


Ts = (t: A” > A”|s € S} (5.273) 
is called a cryptosystem on A". 
If Ts is a cryptosystem over A” and n = 1, then t, is called a stream cipher; otherwise ts is called a 
block cipher. 


no 


Cryptofunctions of a cryptosystem over are suited for the encryption of plaintext of any length. 
The plaintext will be split into blocks of length n prior to applying the function to each individual 
block. The last block may need padding with filler characters to obtain a block of length n. The filler 
characters must not distort the plaintext. 

There is a distinction between context-free encryption, where the ciphertext block is only a function of 
the corresponding plaintext block and the key, and context sensitive encryption, where the ciphertext 
block depends on other blocks of the message. Ideally, each ciphertext digit of a block depends on all 
digits of the corresponding plaintext block and all digits of the key. Small changes to the plaintext or 
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to the key cause extended changes to the ciphertext (avalanche effect). 


5.5.4 Security of Cryptosystems 

Cryptanalysis is concerned with the development of methods for deducing from the ciphertext as much 
information about the plaintext as possible without knowing the key. According to A. Kerkhoff the 
security of a cryptosystem rests solely in the difficulty of detecting the key or, more precisely, the de- 
cryption function. The security must not be based on the assumption that the encryption algorithm is 
kept secret. There are different approaches to assess the security of a cryptosystem: 

1. Absolutely Secure Cryptosystems: There is only one absolutely secure cryptosystem based on 
substitution ciphers, which is the one-time pad. This was proved by Shannon as part of his information 
theory. 

2. Analytically Secure Cryptosystems: No method exists to break a cryptosystem systematically. 
The proof of the non-existence of such a method follows from the proof of the non-computability of a 
decryption function. 

3. Secure Cryptosystems according to Criteria of Complexity Theory: There is no algorithm 
which can break a cryptosystem in polynomial time (with regard to the length of the text). 

4. Practically Secure Cryptosystems: No method is known which can break the cryptosystem 
with available resources and with justified costs. 
Cryptanalysis often applies statistical methods such as determining the frequency of letters and words. 
Other methods are an exhaustive search, the trial-and-error method and a structural analysis of the 
cryptosystem (solving of equation systems). 
In order to attack a cryptosystem one can benefit from frequent flaws in encryption such as using stereo- 
type phrases, repeated transmissions of slightly modified text, an improper and predictable selection 
of keys, and the use of filler characters. 


5.5.4.1 Methods of Conventional Cryptography 

In addition to the application of a cryptofunction it is possible to encrypt a plaintext by means of cryp- 
tological codes. A code is a bijective mapping of some subset A’ of the set of all words over an alphabet 
A onto the subset B’ of the set of all words over the alphabet B. The set of all source-target pairs of 
such a mapping is called a code book. 


today evening 0815 
tomorrow evening 1113 


'The advantage of replacing long plaintexts by short ciphertexts is contrasted with the disadvantage 
that the same plaintext will always be replaced by the same ciphertext. Another disadvantage of code 
books is the need for a complete and costly replacement of all books should the code be compromised 
even partially. 

In the following only encryption by means of cryptofunctions will be considered. Cryptofunctions have 
the additional advantage that they do not require any arrangement about the contents of the messages 
prior to their exchange. 
Transposition and substitution constitute conventional cryptoalgorithms. In cryptography, a transpo- 
sition is a special permutation defined over geometric patterns. The substitutions will now be discussed 
in detail. There is a distinction between monoalphabetic and polyalphabetic substitutions according to 
how many alphabets are used for presenting the ciphertext. Generally, a substitution is termed polyal- 
phabetic even if only one alphabet is used, but the encryption of the individual plaintext letter depends 
on its position within the plaintext. 


A further, useful classification is the distinction between monographic and polygraphic substitutions. 
In the first case, single letters will be substituted, in the latter case, strings of letters of a fixed length 
ail. 
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5.5.4.2 Linear Substitution Ciphers 

Let A = {ao,@1,-..,@n—1} be an alphabet and k,s € {0,1,...,n — 1} with ged(k,n) = 1. The 
permutation t^, which maps each letter a; to t*(a;) = axis, is called a linear substitution cipher. There 
exist n y(n) linear substitution ciphers on A. 

Shift ciphers are linear substituting ciphers with k = 1. The shift cipher with s = 3 was already used 
by Julius Caesar (100 to 44 BC) and, therefore, it is called the Caesar cipher. 


5.5.4.3 Vigenére Cipher 

An encryption called the Vigenére cipher is based on the periodic application of a key word whose 
letters are pairwise distinct. The encryption of a plaintext letter is determined by the key letter that 
has the same position in the key as the plaintext letter in the plaintext. This requires a key that is as 
long as the plaintext. Shorter keys are repeated to match the length of the plaintext. 


A version of the Vigenère cipher attributed to L. Carroll utilizes E BOE UB s 
the so-called Vigenére tableau (see picture) for encryption and A A B CD EF 
decryption. Each row represents the cipher for the key letter B B CD EFG 
to its very left. The alphabet for the plaintext runs across the C CD EF GH 
top. The encryption step is as follows: Given a key letter D and D DEF GHI 
a plaintext letter C, then the ciphertext letter is found at the E EF GH I J 
intersection of the row labeled D and the column labeled C; the EFE E G H L J κ 


ciphertext is F. Decryption is the inverse of this process. 


[| Let the key be “ HUT”. 


Plaintext: O N C EU PON AT I M E 
Key: HU THUTHUTHUTH 
Ciphertex: V H V LO I V HT AC F L 


Formally, the Vigenère cipher can be written in the following way: let a; be the plaintext letter and a; be 
the corresponding key letter, then k = i + determines the ciphertext letter αμ. In the above example, 
the first plaintext letter is O = a14. The 15-th position of the key is taken by the letter H = a7. Hence, 
k=i+j = 14 + 7 = 21 yields the ciphertext letter a; = V. 


5.5.4.4 Matrix Substitution 


Let A = {αοναι,..., αμ. 1} be an alphabet and S = (sij), sij € {0,1,..., m — 1}, be a non-singular 
matrix of type (m, πι) with gcd(detS,n)= 1. The mapping which maps the block of plaintext a,(1), 
a4(2),-- - » 4t(m) to the ciphertext determined by the vector (all arithmetic modulo n, vectors transposed 
as required) 


T 
Qu) 


αι. 
iu (5.274) 
Gt(m) 
is called the Hill cipher. This represents a monoalphabetic matrix substitution. 
1483 Let the letters of the alphabet be enumerated ag = A, a4 =B,...,a25 = 
Oo S=| 852]. Z. For πι = 3 and the plaintext AUTUMN, the strings AUT and UMN 
321 correspond to the vectors (0, 20, 19) and (20, 12, 13). 
Then S. (0,20,19)' = (217, 138,59)" = (9,8, 7)' (mod26) and S - (20,12, 13)" = (415, 246,97)" = 
(25, 12, 19}! (mod26). Thus, the plaintext AUTUMN is mapped to the ciphertext JIHZMT. 


5.5.5 Methods of Classical Cryptanalysis 


The purpose of cryptanalytical investigations is to deduce from the ciphertext an optimum of infor- 
mation about the corresponding plaintext without knowing the key. These analyses are of interest not 
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only to an unauthorized “eavesdropper” but also help assess the security of cryptosystems from the 
user's point of view. 
5.5.5.1 Statistical Analysis 


Each natural language shows a typical frequency distribution of the individual letters, two-letter com- 
binations, words, etc. For example, in English the letter e is used most frequently: 


Letter | Relative frequency 
E, 12.7% 

T, A, O,1,N,5, H, R 56.9 % 

D,L 8.3 96 

C, U, M, W, F, G, Y, P,B | 19.9% 

V,K, J, X, Q,Z 2.2 96 


Given sufficiently long ciphertexts it is possible to break a monoalphabetic, monographic substitution 
on the basis of the frequency distribution of letters. 


5.5.5.2 Kasiski-Friedman Test 


Combining the methods of Kasiski and Friedman it is possible to break the Vignére cipher. The attack 
benefits from the fact that the encryption algorithm applies the key periodically. If the same string 
of plaintext letters is encrypted with the same portion of the key then the same string of ciphertext 
letters will be produced. A length > 2 of the distance of such identical strings in the ciphertext must 
be a multiple of the key length. In the case of several reoccurring strings of ciphertext the key length 
is a divisor of the greatest common divisor of all distances. This reasoning is called the Kasiski test. 
One should, however, be aware of erroneous conclusions due to the possibility that matches may occur 
accidentally. 


The Kasiski test permits the determination of the key length at most as a multiple of the true key length. 

The Friedman test yields the magnitude of the key length. Let n be the length of the ciphertext of some 

English plaintext encrypted by means of the Vignére method. Then the key length {15 determined by 
- 0.027n 

(n — 1)IC — 0.038n + 0.065" 


Here IC denotes the coincidence index of the ciphertext. This index can be deduced from the number 
n; of occurrences of the letter a; (i € {0,1,...,25}) in the ciphertext: 


(5.275a) 


26 
Σ ni(n; — 1) 


— i=l K 
IC = aae em (5.275b) 


In order to determine the key, the ciphertext of length n is split into | columns. Since the Vignére cipher 
produces the contents of each column by means of a shift cipher, it suffices to determine the equivalence 
of E on a column base. Should V be the most frequent letter within a column, then the Vignére tableau 
points to the letter R 
E 
: (5.275c) 
ROV 

of the key. The methods described so far will not be successful if the Vignére cipher employs very long 
keys (e.g., as long as the plaintext). It is, however, possible to deduce whether the applied cipher is 
monoalphabetic, polyalphabetic with short period or polyalphabetic with long period. 


5.5.6 One-Time Pad 


The one-time pad is the only substitution cipher that is considered theoretically secure. The encryption 
adheres to the principle of the Vignére cipher, where the key is a random string of letters as long as the 
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plaintext. 

Usually, one-time pads are applied as binary Vignére ciphers: Plaintext and ciphertext are represented 
as binary numbers with addition modulo 2. In this particular case the cipher is involutory, which means 
that the twofold application of the cipher restores the original plaintext. A concrete implementation 
of the binary Vignére cipher is based on shift register circuits. These circuits combine switches and 
storage elements, whose states are 0 or 1, according to special rules. 


5.5.7 Public Key Methods 


Although the methods of conventional encryption can have efficient implementations with today's com- 
puters, and although only a single key is needed for bidirectional communication, there are a number 
of drawbacks: 

1. The security of encryption solely depends on keeping the next key secret. 

2. Prior to any communication, the key must be exchanged via a sufficiently secured channel; sponta- 
neous communication is ruled out. 


3. Furthermore, no means exist to prove to a third party that a specific message was sent by an iden- 
tified sender. 


5.5.7.1 Diffie-Hellman Key Exchange 

The concept of encryption with public keys was developed by Diffie and Hellman in 1976. Each partic- 
ipant owns two keys: a public key that is published in a generally accessible register, and a private key 
that is solely known to the participant and kept absolutely secret. Methods with these properties are 
called asymmetric ciphers (see 5.5.2, p. 387). 
The public key KP; of the i-th participant controls the encryption step E;, his private key KS; the 
decryption step D;. The following conditions must be fulfilled: 
1. Dj o E; constitutes the identity. 

2. Efficient implementations for E; and D; are known. 

3. The private key K S; cannot be deduced from the public key K P; with the means available in the 
foreseeable future. If in addition 

4. also E; o D; yields the identity, 

then the encryption algorithm qualifies as an electronic signature method with public keys. The elec- 
tronic signature method permits the sender to attach a tamperproof signature to a message. 


If A wants to send an encrypted message m to B, then A retrieves B's public key K Pg from the register, 
applies the encryption algorithm Ep, and calculates Eg(m) = c. A sends the ciphertext c via the public 
network to B who will regain the plaintext of the message by decrypting c using his private key K δ» in 
the decryption function Dg: Dp(c) = Dg(Epg(m)) = m. In order to prevent tampering of messages, A 
can electronically sign his message m to B by complying with an electronic signature method with the 
public key in the following way: A encrypts the message m with his private key: D4(m) = d. A attaches 
to d his signature “A” and encrypts the total using the public key of B: Eg(Da(m), “A”) = Ep(d, 
^A") = e. The text thus signed and encrypted is sent from A to B. 

The participant B decrypts the message with his private key and obtains Dg(e) = Dg(Eg(d, *A")) 
= (d, ^A"). Based on this text B can identify A as the sender and can now decrypt d using the public 
key of A: E4(d) = E4(Da(m)) = m. 


5.5.7.2 One-Way Function 


The encryption algorithms of a method with public key must constitute a one-way function with a “trap 
door”. A trap door in this context is some special, additional information that must be kept secret. An 
injective function f: X —> Y is called a one-way function with a trap door, if the following conditions 
hold: 


1. There is an efficient method to compute both f and f~t. 
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2. The calculation of f^! cannot be deduced from f without the knowledge of the secret additional 
information. 


The efficient method to get f^! from f cannot be made without the secret additional information. 
5.5.7.3 RSA Codes and RSA Method 
1. RSA Codes 


Rivest, Shamir and Adleman (see [5.16]) developed an encryption scheme for secret messages on the 
basis of the Euler-Fermat theorem (see 5.4.4, 2., p. 381). The scheme is called the RSA algorithm after 
the initials of their last names. Part of the key required for decryption can be made public without 
endangering the confidentiality of the message; for this reason, the term public key code is used in this 
context as well. 

In order to apply the RSA algorithm the recipient B chooses two very large prime numbers p and q, 
calculates m = pq and selects a number r relatively prime to y(m) = (p — 1)(q— 1) and 1 < r < y(m). 
B publishes the numbers m and r because they are needed for decryption. 

For transmitting a secret message from sender A to recipient B the text of the message must be con- 
verted first to a string of digits that will be split into N blocks of the same length of less than 100 decimal 
positions. Now A calculates the remainder R of N” divided by m. 


N" = R(m). (5.276a) 
Sender A calculates the number R for each of the blocks N that were derived from the original text 
and sends the number to B. The recipient can decipher the message R if he has a solution of the linear 
congruence rs = 1(y(m)). The number N is the remainder of R° divided by m: 

RS = (N°): = Nite = Ν. (ΝΟΕ = N(m). (5.276b) 


Here, the Euler-Fermat theorem (see 5.4.4, 2., p. 381) with N?™ = 1(m) has been applied. Eventu- 
ally, B converts the sequence of numbers into text. 


W A recipient B who expects a secret message from sender A chooses the prime numbers p = 29 and 
q = 37 (actually too small for practical purposes), calculates m = 29-37 = 1073 (and y(1073) = 
(29) : (37) = 1008)), and chooses r = 5 (it satisfies the requirement of gcd(1008, 5) = 1). B passes 
the values m = 1073 and r = 5 to A. 
A intends to send the secret message N = 8 to B. A encrypts N into R = 578 by calculating N” = 8° = 
578 (1073), and just sends the value R = 578 to B. B solves the congruence 5 - s = 1 (1008), arrives at 
the solution s = 605, and thus determines R° = 57890? = 8 = N (1073). 
Remark: The security of the RSA code correlates with the time needed by an unauthorized listener 
to factorize m. Assuming the speed of today's computers, a user of the RSA algorithm should choose 
the two prime numbers p and q with at least a length of 100 decimal positions in order to impose a 
decryption effort of approximately 74 years on the unauthorized listener. The effort for the authorized 
user, however, to determine an r relatively prime to φ(ρη) = (p — 1)(q — 1) is comparatively small. 
2. RSA Method 
'The RSA method is the most popular asymmetric encryption method. 
1. Assumptions Let p and q be two large prime numbers with pq zz 10294 and n = pq. The number 
of decimal positions of p and q should differ by a small number; yet, the difference between p and q 
should not be too large. Furthermore, the numbers p — 1 and q — 1 should contain rather big prime 
factors, while the greatest common divisor of p — 1 and q — 1 should be rather small. Let e > 1 be 
relatively prime to (p — 1)(q — 1) and let d satisfy d: e = 1 (mod(p — 1)(q — 1)). Now n and e represent 
the public key and d the private key. 
2. Encryption Algorithm 

E: (0,1,...,n 1} > (0,1,...,n — 1} E(x) :— z^ modnm. (5.277a) 
3. Decyphering Operations 

D: {0,1,...,n — 1} > {0,1,...,n— 1} D(x) := x^ modn. (5.277b) 
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Thus D(E(m)) = E(D(m) = m for message m. 

The function in this encryption method with n > 1020 constitutes a candidate for a one-way function 
with trap door (see 5.5.7.2, p. 391). The required additional information is the knowledge of how to 
factor n. Without this knowledge it is infeasible to solve the congruence d-e = 1 (modulo (p— 1)(q— 1)). 
The RSA method is considered practically secure as long as the above conditions are met. A disadvan- 
tage in comparison with other methods is the relatively large key size and the fact that RSA is 1000 
times slower than DES. 


5.5.8 DES Algorithm (Data Encryption Standard) 


The DES method was adopted in 1976 by the National Bureau of Standards (now NIST) as the official 
US encryption standard. The algorithm belongs to the class of symmetric encryption methods (see 
5.5.2, p. 387) and still plays a predominant role among cryptographic methods. The method is, however, 
no longer suited for the encryption of top secret information because today's technical means permit 
an attack by an exhaustive test trying all keys. 


The DES algorithm combines permutations and non-linear substitutions. The algorithm requires a 
56-bit key. Actually, a 64-bit key is used, however, only 56 bits can freely be chosen; the remaining 
eight bits serve as parity bits, one for each of the seven-bit blocks to yield odd parity. 

The plaintext is split into blocks of 64 bits each. DES transforms each 64-bit plaintext block into a 
ciphertext block of 64 bits. First, the plaintext block will be subject to an initial permutation and is 
then encrypted in 16 rounds, each operating with a different subkey Ay, K5,..., Aig. The encryption 
completes with a final permutation that is the inverse of the initial permutation. 


Decryption uses the same algorithm with the difference that the subkeys are employed in reverse order 
Kig, Kis, — Kı. 
The strength of the cipher rests on the nature of the mappings that are part of each round. It can be 
shown that each bit of the ciphertext block depends on each bit of the corresponding plaintext and on 
each bit of the key. 
Although the DES algorithm has been disclosed in full detail, no attack has been published so far that 
can break the algorithm without an exhaustive test of all 256 keys. 


5.5.9 IDEA Algorithm 

(International Data Encryption Algorithm) 
The IDEA algorithm was developed by LAI and MASSAY and patented 1991. It is a symmetric en- 
cryption method similar to the DES algorithm and constitutes a potential successor to DES. IDEA 
became known as part of the reputed software package PGP (Pretty Good Privacy) for the encryption 
of emails. In contrast to DES not only was the algorithm published but even its basic design criteria. 
The objective was the use of particularly simple operations (addition modulo 2, addition modulo 219. 
multiplication modulo 21611). 
IDEA works with keys of 128 bits length. IDEA encrypts plaintext blocks of 64 bits each. The algo- 
rithm splits a block into four subblocks of 16 bits each. From the 128-bit key 52 subkeys are derived, 
each 16 bits long. Each of the eight encryption rounds employs six subkeys; the remaining four subkeys 
are used in the final transformation which constructs the resulting 64-bit ciphertext. Decryption uses 
the same algorithm with the subkeys in reverse order. 


IDEA is twice as fast as DES, its implementation in hardware, however, is more difficult. No successful 
attack against IDEA is known. Exhaustive attacks trying all 256 keys are infeasible considering the 
length of the keys. 
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5.6 Universal Algebra 


A universal algebra consists of a set, the underlying set, and operations on this set. Simple examples are 
semigroups, groups, rings, and fields discussed in sections 5.3.2, p. 336; 5.3.3, p. 336 and 5.3.7, p. 361. 
Universal algebras (mostly many-sorted, i.e., with several underlying sets) are handled especially in 
theoretical informatics. There they form the basis of algebraic specifications of abstract data types 
and systems and of term-rewriting systems. 


5.6.1 Definition 


Let Q be a set of operation symbols divided into pairwise disjoint subsets Ωμ, n € IN. Ωρ contains the 
constants, Ωμ, n > 0, contain the n-ary operation symbols. The family (Qn)nen is called the type or 
signature. If A is a set, and if to every n-ary operation symbol w € Ω, an n-ary operation w^ in A is 
assigned, then A = (A, {w“|w € Q}) is called an Q algebra or algebra of type (or of signature) Q. 

If Q is finite, Q = {w,...,w,}, then one also writes A = (A,w7',...,w7) for A. 

If a ring (see 5.3.7, p. 361) is considered as an Q algebra, then 2 is partitioned Ωρ = {w1}, Q1 = {wo}, 
Qə = (wa, w4}, where to the operation symbols w1, w2, w3, w4 the constant 0, taking the inverse with 
respect to addition, addition and multiplication are assigned. 

Let A and B be Q algebras. B is called an € subalgebra of A, if B C A holds and the operations w? are 
the restrictions of the operations w^ (ω € Ω) to the subset B. 


5.6.2 Congruence Relations, Factor Algebras 


In constructing factor structures for universal algebras, the notion of congruence relation is needed. A 
congruence relation is an equivalence relation compatible with the structure: Let A = (A, {w4|w € Q}) 
be an €? algebra and R be an equivalence relation in A. R is called a congruence relation in A, if for all 
w E Qn (n € N) and all a;,b; € A with a; Rb; (i = 1,...,n): 


w^ (a1, ..., as) Rw (bi, .. . bs). (5.278) 
The set of equivalence classes (factor set) with respect to a congruence relation also form an Q algebra 
with respect to representative-wise calculations: Let A = (A, {w4|w € Q}) be an Q algebra and R be 
a congruence relation in A. The factor set A/R (see 5.2.4, 2., p. 334) is an Q algebra A/R with the 
following operations u^/* (w € Qn, n € N) with 


w^ P (12,], .. . , [as] g) = [ω (αι... απ)]η (5.279) 
and it is called the factor algebra of A with respect to R. 
'The congruence relations of groups and rings can be defined by special substructures — normal sub- 
groups (see 5.3.3.2, 2. p. 338) and ideals (see 5.3.7.2, p. 362), respectively. In general, e.g., in semi- 
groups, such a characterization of congruence relations is not possible. 


5.6.3 Homomorphism 


Just as with classical algebraic structures, the homomorphism theorem gives a connection between the 
homomorphisms and congruence relations. 
Let A and B be Q algebras. A mapping h: A — B is called a homomorphism, if for every w € Q, and 
all ay,...,@n € A: 

h(w^ (ai, ...,a4)) = w?” (h(a), . .. , h(a,)). (5.280) 
If, in addition, h is bijective, then h is called an isomorphism; the algebras A and B are called isomor- 
phic. The homomorphic image h(A) of an Q algebra A is an Ω subalgebra of B. Under a homomorphism 


h, the decomposition of A into subsets of elements with the same image corresponds to a congruence 
relation which is called the kernel of h: 


ker h = ((a,b) € A x Alh(a) = h(b)}. (5.281) 
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5.6.4 Homomorphism Theorem 


Let A and B be Q algebras and h: A + B a homomorphism. h defines a congruence relation ker h in 
A. The factor algebra A/ ker h is isomorphic to the homomorphic image h(A). 

Conversely, every congruence relation R defines a homomorphic mapping natg: A > A/R with natg(a) 
= [a]g. Fig. 5.19 illustrates the homomorphism theorem. 


5.6.5 Varieties 

A variety V is a class of Ω algebras, which is closed 

ο ο under forming direct products, subalgebras, and 
homomorphic images, i.e., these formations do not 
lead out of V. Here the direct products are defined 
in the following way: 
Considering the operations corresponding to Q 
componentwise on the Cartesian product of the un- 


derlying sets of Q algebras, an Q algebra, the direct 
[η [| [ | product of these algebras is obtained. The theorem 
A/kerh of Birkhoff (see 5.6.6, p. 395) characterizes the va- 
rieties as those classes of Ω algebras, which can be 

Figure 5.19 equationally defined. 


5.6.6 Term Algebras, Free Algebras 


Let (Qn)nen be a type (signature) and X a countable set of variables. The set To(.X) of Q terms over 
X is defined inductively in the following way: 
1. X U Qo C To(X). 
2. If ti,...,tn € To(X) and w € Ωμ hold, then also wt; . . . tn € To(X) holds. 
The set To(X) defined in this way is an underlying set of an € algebra, the term algebra To(.X ) of type 
Q over X, with the following operations: Ift,...,t, € To(X) and w € Ωμ hold, then wT) is defined 
by 

WO (4... ta) = why... tn. (5.282) 
Term algebras are the “most general” algebras in the class of all Ω algebras, i.e., no “identities” are 
valid in term algebras. These algebras are called free algebras. 
An identity is a pair (s(zi,..., En), t(@1,-..,@p)) of Q terms in the variables z1,..., £n. An Q algebra 
A satisfies such an equation, if for every a4,...,a,, € A holds: 

s^ (a4,..., a4) = O ., a4). (5.283) 
A class of 2 algebras defined by identities is a class of Q algebras satisfying a given set of identities. 
Theorem of Birkhoff: The classes defined by identities are exactly the varieties. 
W Varieties are for example the classes of all semigroups, groups, Abelian groups, and rings. But, e.g., 
the direct product of cyclic groups is not a cyclic group, and the direct product of fields is not a field. 
"Therefore cyclic groups or fields do not form a variety, and cannot be defined by equations. 


5.7 Boolean Algebras and Switch Algebra 


Calculating rules, similar to the rules established in 5.2.2, 3., p. 329 for set algebra and propositional 
calculus (5.1.1, 6., p. 324), can be found for other objects in mathematics too. The investigation of 
these rules yields the notion of Boolean algebra. 


5.7.1 Definition 
A set B, together with two binary operations M (“conjunction”) and U (“disjunction”), and a unary 
operation (“negation”), and two distinguished (neutral) elements 0 and 1 from B, is called a Boolean 
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algebra B = (B, ΓΙ, U, , 0, 1) if the following properties are valid: 
(1) Associative Laws: 


(anb) re 2 an (bn c), (5.284) (aub) Le au (buc). (5.285 


(2) Commutative Laws: 


alib —bra, (5.286) alib — bLla. (5.287 
(3) Absorption Laws: 

an (aUlb) =a, (5.288) aU (ab) — a. (5.289 
(4) Distributive Laws: 

(aUlb)Ne=(aNe)U(bNe), (5.290) (aNb)Uc=(aUc)A(bUc). (5.291 
(5) Neutral Elements: 

anl=a, (5.292) all0=a, (5.293 

an0- 0, (5.294) alil-1, (5.295 


(6) Complement: 


ana - 0, (5.296) allá -- 1. (5.297 


A structure with the associative laws, commutative laws, and absorption laws is called a lattice. If the 
distributive laws also hold, then the lattice is called a distributive lattice. So a Boolean algebra is a 
special distributive lattice. 

Remark: The notation used for Boolean algebras is not necessarily identical to the notation for the 
operations in propositional calculus. 


5.7.2 Duality Principle 


1. Dualizing 

In the “axioms” of a Boolean algebra is included the following duality: Replacing N by U, LI by ΠΠ, 0 
by 1, and 1 by 0 in an axiom gives always the other axiom in the same row. The axioms in a row are 
dual to each other, and the substitution process is called dualization. The dual statement follows from 
a statement of the Boolean algebra by dualization. 

2. Duality Principle for Boolean Algebras 

The dual statement of a true statement for a Boolean algebra is also a true statement for the Boolean 
algebra, i.e., with every proved proposition, the dual proposition is also proved. 

3. Properties 

One gets, e.g., the following properties for Boolean algebras from the axioms. 

(E1) The Operations M and U are Idempotent: 


αΠαα, (5.298) alla — a. (5.299) 


(E2) De Morgan Rules: 


anb=aub, (5.300) allb=anb, (5.301) 


(E3) A further Property: 
G@=a. (5.302) 
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It is enough to prove only one of the two properties in any line above, because the other one is the dual 
property. The last property is self-dual. 


5.7.3 Finite Boolean Algebras 
All finite Boolean algebras can be described easily up to “isomorphism”. Let Bı, B5 be two Boolean 
algebras and f: Bı — B» a bijective mapping. f is called an isomorphism if 

Flam) = f(a)n f(), f(aub)- f(a)U f(b) and f(@) = Fla) (5.303) 
hold. Every finite Boolean algebra is isomorphic to the Boolean algebra of the power set of a finite set. 
In particular every finite Boolean algebra has 2" elements, and every two finite Boolean algebras with 
the same number of elements are isomorphic. 


Hereafter B denotes the Boolean algebra with two elements (0, 1} and with the operations 


njo 1 Uulo 1 — 
0/0 0 010 1 0/1 
T0. Ίνα T 1190 


Defining the operations ΓΙ, Ll, and ~ componentwise on the n-times Cartesian product B" = {0,1} x 

- x {0,1}, then B" will be a Boolean algebra with 0 = (0,...,0) and 1 = (1,...,1). B" is called 
the n times direct product of B. Because B” contains 2" elements, this way one gets all finite Boolean 
algebras (out of isomorphism). 


5.7.4 Boolean Algebras as Orderings 
An order relation can be assigned to every Boolean algebra B: Here a € b holds if aM b = a is valid (or 


equivalently, if a L1 b — b holds). 
So every finite Boolean algebra can be represented by a Hasse diagram (see 5.2.4, 4., p. 334). 


W Suppose B is the set {1,2,3,5,6, 10, 15, 30] of the divisors of 30. Then, the . 
least common multiple and the greatest common divisor can be defined as bi- 
nary operations and the complement as unary operation. The numbers 1 and 


Pd ES 
30 correspond to the distinguished elements 0 and 1. The corresponding Hasse * 10 *15 
diagram is shown in Fig. 5.20. 
EN P4 ' 
1 


5.7.5 Boolean Functions, Boolean Expressions 


1. Boolean Functions 


Denoting by B the Boolean algebra with two elements as in 5.7.3, p. 397, then D 

an n-ary Boolean function f is a mapping from B" into B. There are 2?" n-ary i 

Boolean functions. The set of all n-ary Boolean functions with the operations Figure 5.20 
(FD g)(b) = f(b) T g0), (5.304) (F U g)(b) = f(b) Ug(®), (5.305 
70) πο (5.306 


is a Boolean algebra. Here b always means an n tuple of the elements of B = {0,1}, and on the right- 
hand side of the equations the operations are performed in B. The distinguished elements 0 and 1 
correspond to the functions fo and fı with 


fo(b)=0, fi(b)=1 forall be B”. (5.307 
ΒΑ: Inthe case n = 1, i.e., for only one Boolean variable b, there are four Boolean functions: 
Identity f(b) =b, Negation f(b) — b, (5.308 


Tautology f(b) =1, Contradiction f(b) — 0. 


W B: Inthecasen = 2. i.e., for two Boolean variables a and b, there are 16 different Boolean functions, 
among which the most important ones have their own names and notation. They are shown in Table 
5.6. 
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Table 5.6 Some Boolean functions with two variables a and b 


Value table for 


Name of the | Different Different : 0 0 1 1 

function notation symbols () = (Ὁ) , 0) ; (ο) 3 (i) 

Sheffer a-b 

or a| b &— D- 1, 1. l5 0 

NAND NAND (a,b) 

Peirce a+b ες 

or alb > = 1, 0, 0, 0 

NOR NOR a,b 

ΜῊΝ z ab + ab 

Antivalence aXORb EX " 

Or zb 0 1 1 0 
α 

XOR αφ b 
ab+ab L14 e 

Equivalence a= 1. 0 0 1 
a € b 

Implication | 2+ 1, 1, 0, 1 
a —> b 


2. Boolean Expressions 
Boolean expressions are defined in an inductive way: Let X = {x,y,z,...} be a (countable) set of 
Boolean variables (which can take values only from {0, 11): 


1. The constants 0 and 1 just as the Boolean variables from X are 


Boolean expressions. (5.309) 
2. If S and T are Boolean expressions, so are T, (S N T), and (S U T), as well. (5.310) 
If a Boolean expression contains the variables £1, ... , &n, then it represents an n-ary Boolean function 
fr: 
Let b be a “valuation” of the Boolean variables x1,..., ap, i.e., b = (b,..., 04) € B". 


Assigning a Boolean function to the expression T in the following way gives: 


1. fT =0, then fr = fo; ifT — 1, then fr = fi. (5.311a) 
2. If T = αι, then fr(b) = bi; if T = 5, then fr(b) = fs(b). (5.311b) 
3. IET = RN S, then fr(b) = fg(b) N fs(b). (5.311c) 
4. IT = RU S, then fr(b) = fr(b) U fs(b). (5.311d) 


On the other hand, every Boolean function f can be represented by a Boolean expression T' (see 5.7.6, 
p. 399). 

3. Concurrent or Semantically Equivalent Boolean Expressions 

The Boolean expressions S and T are called concurrent or semantically equivalent if they represent the 
same Boolean function. Boolean expressions are equal if and only if they can be transformed into each 
other according to the axioms of a Boolean algebra. 

Under transformations of a Boolean expression here are considered especially two aspects: 

e Transformation in a possible “simple” form (see 5.7.7, p. 399). 

e Transformation in a “normal form". 
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5.7.6 Normal Forms 


1. Elementary Conjunction, Elementary Disjunction 

Let B = (B,T,U, ,0,1) be a Boolean algebra and {21,..., £n} a set of Boolean variables. Every 
conjunction or disjunction in which every variable or its negation occurs exactly once is called an ele- 
mentary conjunction or an elementary disjunction respectively (in the variables z4,... 2). 

Let T(xi,...,24) be a Boolean expression. A disjunction D of elementary conjunctions with D = T is 
called a principal disjunctive normal form (PDNF) of T. A conjunction C of elementary disjunctions 
with C = T is called a principal conjunctive normal form ( PCNF) of T. 


W Part 1: In order to show that every Boolean function f can be represented as a Boolean expression, 


the PDNF form of the function f given in the annexed table is to be constructed: 
x y z|f(x,y,z) The PDNF of the Boolean function f contains the elementary conjunc- 
tions TN YN z,2@Nynz,«nyMz. These elementary conjunctions belong 
0 0 0 0 to the valuations b of the variables where the function f has the value 1. 
0 0 1 If a variable v has the value 1 in b, then v is to put in the elementary con- 
0 1 0 0 junction, otherwise v. ` 
: 7 ᾽ ^ W Part 2: The PDNF for the example of Part 1 is: 
1 0 1 (zrghz)u(zngns)u(zngrns). (5.312) 
T. a d The “dual” form for PDNF is the PCNF: The elementary disjunctions be- 
1 11 0 long to the valuations b of the variables for which f has the value 0. 
If a variable v has the value 0 in b, then v is to put in the elementary disjunction, otherwise v. So the 
PCNF is: 
(νι z)n(zugtus)n(ezugusz)n(zugtszrn(ugLtsz). (5.313) 


The PDNF and the ΡΟΝΕ of f are uniquely determined, if the ordering of the variables and the ordering 
of the valuations is given, e.g., if considering the valuations as binary numbers and arranging them in 
increasing order. 

2. Principal Normal Forms 
The principal normal form of a Boolean function fr is considered as the principal normal form of the 
corresponding Boolean expression T. 

Checking the equivalence of two Boolean expressions by transformations is often difficult. The prin- 
cipal normal forms are useful: Two Boolean expressions are semantically equivalent exactly if their 
corresponding uniquely determined principal normal forms are identical letter by letter. 

W Part 3: In the considered example (see Part 1 and 2) the expressions (y N z) U (£ N y N Z) and 
(x LU ((yLl z) (gl z)n(gusz))n (FU (yz) (GU z))) are semantically equivalent because the 
principal disjunctive (or conjunctive) normal forms of both are the same. 


5.7.7 Switch Algebra 

A typical application of Boolean algebra is the simplification of series-parallel connections (SPC). 
Therefore a Boolean expression is to be assigned to a SPC (transformation). This expression will be 
"simplified" with the transformation rules of the Boolean algebra. Finally a SPC is to be assigned to 
this expression (inverse transformation). The result is a simplified SPC which produces the same be- 
havior as the initial connection system (Fig. 5.21). 


A SPC has two types of contact points: the so-called “make contacts" and “break contacts”, and both 
types have two states; namely open or closed. The usual symbolism is: When the equipment is put on, 
the make contacts close and the break contacts open. With Boolean variables assigned to the contacts 
of the switch equipment follows: 

The position “off” or “on” of the equipment corresponds to the value 0 or 1 of the Boolean variables. 
'The contacts being switched by the same equipment are denoted by the same symbol, the Boolean 
variable belonging to this equipment. The contact value of a SPC is 0 or 1, according to whether the 
switch is electrically non-conducting or conducting. The contact value depends on the position of the 
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contacts, so it is a Boolean function S (switch function) of the variables assigned to the switch equip- 
ment. Contacts, connections, symbols, and the corresponding Boolean expressions are represented in 


Fig. 5.22. 


electrically equivalent 


transformation 
(modelling) 


inverse 

simplification by 

Boolean Boolean algebra 
expression 


Figure 5.21 


simplified 
Boolean 
expression 


make contact series connection 
(symbol: i ) (symbol: b L—) 
S=a S=anb 
break contact parallel connection 
(symbol: Ex πα ) (symbol: p τη. ) 
b 
S=a S-aub 
Figure 5.22 
a c 
c 
LL—. 
τε 
Figure 5.23 Figure 5.24 


transformation 


The Boolean expressions, which represent switch functions of SPC, have the special property that the 


negation sign can occur only above variables (never over subexpressions). 


W Simplification of the SPC Fig. 5.23. This connection corresponds to the Boolean expression 


S = (@Nb)U(anbnz)U (an (bu c)) 
as switch function. According to the transformation formulas of Boolean algebra holds: 
S = (bN (@U (ar1e))) u (an (bUc)) 
(br1(aLlc)) u (am (bUc)) 
= (απο)! (bric)u (arie) 
(ar1bre)u (anbrne)u(bne)u (anb 
= (anc) u (bnz). 


cu(ane)u(anbne) 


(5.314) 


(5.315) 


Here one gets aN c from (@NbNc)U (anc) u (an bn c), and bnz from (aribrie) u (bre) ui (an brio). 


The finally simplified result SPC is shown in Fig. 5.24. 


This example shows that usually it is not so easy to get the simplest Boolean expression by transfor- 


mations. In the literature one can find different methods for this procedure. 
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5.8 Algorithms of Graph Theory 


Graph theory is a field in discrete mathematics having special importance for informatics, e.g., for rep- 
resenting data structures, finite automata, communication networks, derivatives in formal languages, 
etc. There are also applications in physics, chemistry, electrotechnics, biology and psychology. More- 
over, flows can be applied in transport networks and in network analysis in operations research and in 
combinatorial optimization. 


5.8.1 Basic Notions and Notation 


1. Undirected and Directed Graphs 

A graph G is an ordered pair (V, E) of a set V of vertices and a set E of edges. There is a mapping, 
defined on E, the incidence function, which uniquely assigns to every element of E an ordered or non- 
ordered pair of (not necessarily distinct) elements of V. If a non-ordered pair is assigned then G is 
called an undirected graph (Fig. 5.25). If an ordered pair is assigned to every element of E, then the 
graph is called a directed graph (Fig. 5.26), and the elements of E are called arcs or directed edges. All 
other graphs are called mixed graphs. 

In the graphical representation, the vertices of a graph are denoted by points, the directed edges by 
arrows, and undirected edges by non-directed lines. 


" Va Vs νι V5 
ο : Cj 
u f ] i 
v. e; ei νι 6: e, 
€ WV Εἰ eV, 
a M 
Figure 5.25 Figure 5.26 Figure 5.27 


W A: For the graph G in Fig. 5.27: V = (vi, vo, v3, v4, Us}, E = (61,62, 63, 61, ἐς. 66, C7}, 
filer) = (vi v2}, h(e») = {υι, v2}, files) = (vo, va), filea) = (v3, va), files) = ( 
fi ερ) = (v4, v2), filer) = (vs, vs). 

E B: For the graph G in Fig. 5.26: V = (vi, v2, v3, v4, v5], E' = (ese €b, e1} 

falei) = (va, va), falez) = (va; va), falez) = (va, v2), foleg) = (vs, vs). 

E C: For the graph G in Fig. 5.25: V = (vi, vo, v3, v4, U5}, E" = (et e3, e3, ef}, 

felet) = (vs, us}, fa(e2) = (va, us}, falez) = (va v2}, alel) = (os, vs}. 

2. Adjacency 

If (v, w) € E, then the vertex v is called adjacent to the vertex w. Vertex v is called the initial point of 
(v, w), w is called the terminal point of (v, w), and v and w are called the endpoints of (v, w). 
Adjacency in undirected graphs and the endpoints of undirected edges are defined analogously. 


3. Simple Graphs 
If several edges or arcs are assigned to the same ordered or non-ordered pairs of vertices, then they 
are called multiple edges. An edge with identical endpoints is called a loop. Graphs without loops and 
multiple edges and multiple arcs, respectively, are called simple graphs. 
4. Degrees of Vertices 
The number of edges or arcs incident to a vertex v is called the degree dc (v) of the vertex v. Loops are 
counted twice. Vertices of degree zero are called isolated vertices. 
For every vertex v of a directed graph G, the out-degree dé; (v) and in-degree dc; (v) of v are distinguished 
as follows: 

dá(v) = |(w|(vw) € E], — (5.3162) dg(v) = 


(w|(w,v) € ΕΗ. (5.316b) 
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5. Special Classes of Graphs 

Finite graphs have a finite set of vertices and a finite set of edges. Otherwise the graph is called infinite. 
In regular graphs of degree r every vertex has degree r. 

An undirected simple graph with vertex set V is called a complete graph if any two different vertices in 
V are connected by an edge. A complete graph with an n element set of vertices is denoted by Kn. 

If the set of vertices of an undirected simple graph G can be partitioned into two disjoint classes X and 
Y such that every edge of G joins a vertex of X and a vertex of Y , then G is called a bipartite graph. 
A bipartite graph is called a complete bipartite graph, if every vertex of X is joined by an edge with 
every vertex of Y. If X has n elements and Y has m elements, then the graph is denoted by Ky, m. 

W Fig. 5.28 shows a complete graph with five vertices. 

W Fig. 5.29 shows a complete bipartite graph with a two-element set X and a three-element set Y. 


ii 
5 2 Xi m 
K; -- X, s 
4 3 Ys 
Figure 5.28 Figure 5.29 


Further special classes of graphs are plane graphs, trees and transport networks. Their properties will 
be discussed in later paragraphs. 

6. Representation of Graphs 

Finite graphs can be visualized by assigning to every vertex a point in the plane and connecting two 
points by a directed or undirected curve, if the graph has the corresponding edge. There are examples 
in Fig. 5.30-5.33. Fig. 5.33 shows the Petersen graph, which is a well-known counterexample for 
several graph-theoretic conjectures, which could not be proved in general. 


+ d. ορ 


Figure 5.30 Figure 5.31 Figure 5.32 Figure 5.33 


7. Isomorphism of Graphs 

A graph Gi = (Vi, Ei) is called isomorphic to a graph G2 = (V2, E») iff there are bijective mappings 
ᾧ from Vj onto V5 and ψ from E, onto Ε2 being compatible with the incidence function, i.e., if u,v 
are the endpoints of an edge or u is the initial point of an arc and v is its terminal point, then y(u) 
and φ(υ) are the endpoints of an edge and y(u) is the initial point and (v) the terminal point of 
an arc, respectively. Fig. 5.34 and Fig. 5.35 show two isomorphic graphs. The mapping y with 
p(1) =a, (2) = b, (3) = c, (4) = d is an isomorphism. In this case, every bijective mapping of 
(1,2, 3, 4} onto {a, b, c, d) is an isomorphism, since both graphs are complete graphs with equal number 
of vertices. 


8. Subgraphs, Factors 

If G = (V, E) is a graph, then the graph G' = (V’, E") is called a subgraph of G, if V' C V and E' C E. 
If E' contains exactly those edges of E which connect vertices of V’, then G' is called the subgraph of G 
induced by V' (induced subgraph). 
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1 2 a b 
Figure 5.34 Figure 5.35 


A subgraph G' = (V', E") of G = (V, E) with V’ = V is called a partial graph of G. 
A factor F of a graph G is a regular subgraph of G containing all vertices of G. 
9. Adjacency Matrix 
Finite graphs can be described by matrices: Let G = (V, E) be a graph with V = (vi, v2,...,Un} and 
E = {e1,€2,...,em}. Let m(vi, vj) denote the number of edges from v; to vj. For undirected graphs, 
loops are counted twice; for directed graphs loops are counted once. The matrix A of type (n,n) with 
A = (m(vi, v;)) is called an adjacency matrix. If in addition the graph is simple, then the adjacency 
matrix has the following form: 
1, for(wu,v;)e E, 

Acne te or n g E; 
i.e., in the matrix A there is a 1 in the i-th row and j-th column iff there is an edge from v; to vj. 
The adjacency matrix of undirected graphs is symmetric. 
W A: Beside Fig. 5.36 there is the adjacency matrix A; = A(G) of the directed graph G4. 
Β D: Beside Fig. 5.37 there is the adjacency matrix Ay = A(G3) of the undirected simple graph Gy. 


(5.317) 


γι 
V3 νι 010101 
0100 Y; v; 1910 
i | [0000 010101 
" Ai=|0103 v v, A2— 10101 
νι 0100 010101 
v, 10101 


Fieure 5. 
igure 5.36 Figure 5.37 


10. Incidence Matrix 
For an undirected graph G = (V, E) with V = (vi, v2,...,Un} and E = {e1, e2,...,em}, the matrix I 
of type (n,m) given by 


T= (bi; 1, wis incident with e; and e; is not a loop, (5.318) 


0, viis not incident with ερ, 
with bi = 
2, viis incident with e; and ej is a loop 


is called the incidence matriz. 
For a directed graph G = (V, E) with V = (vi, v5,...,v,] and E = (e1,€s,..., επι}, the incidence 
matrix I is the matrix of type (n, m), defined by 


. vis not incident with ej, 

> wis the initial point of e; and e; is not a loop, 

> vis the terminal point of e; and ej is not a loop, 
, viis incident to e; and e; is a loop. 


0 
T= (by) with by -ἑ _] (5.319) 
-0 


11. Weighted Graphs 
If G = (V, E) is a graph and f is a mapping assigning a real number to every edge, then (V, E, f) is 
called a weighted graph, and f(e) is the weight or length of the edge e. 
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In applications, these weights of the edges represent costs resulting from the construction, maintenance 
or use of the connections. 


5.8.2 Traverse of Undirected Graphs 


5.8.2.1 Edge Sequences or Paths 
1. Edge Sequences or Paths 


In an undirected graph G = (V, E) every sequence F = ({v1, vo], (v9, V3}, ..., {Us, Use }) of the ele- 
ments of E is called an edge sequence of length s. 1 

If v; = vs41, then the sequence is called a cycle, otherwise it is an open edge 

sequence. An edge sequence F is called a path iff v4, vo, ..., v, are pairwise 

distinct vertices. A closed pathis a circuit. A trailis a sequence of edges with- 5 2 


out repeated edges. 

E In the graphs in Fig. 5.38, Fı = ({1,2}, {2,3}, {3, 5}, { 

an edge sequence of length 5, Fh = ({1,2}, {2,3}, {3, 4}, {4,2}, (2. 1]) is 

a cycle of length 5, F3 = ({2,3}, {3,5}, (5,21, (2,1]) is a path, Fy = 4 8 
({1, 2}, {2,3}, (3, 4]) is a path. An elementary cycle is given by F; = . 
({1, 2}, {2,5}, (5.1). Ee oe 


, (5,21, (2, 4}) is 


2. Connected Graphs, Components 

If there is at least one path between every pair of distinct vertices v, w in a graph G, then G is called 
connected. If a graph G is not connected, it can be decomposed into components, i.e., into induced 
connected subgraphs with maximal number of vertices. 


3. Distance Between Vertices 
The distance δ(υ, w) between two vertices v, w of an undirected graph is the length of a path with 
minimum number of edges connecting v and w. If such a path does not exist, then let ó(v, w) = oc. 


4. Problem of Shortest Paths 
Let G = (V, E, f) be a weighted simple graph with f(e) > 0 for every e € E. Determine the shortest 
path from v to w for two vertices v, w of G, i.e., a path from v to w having minimum sum of weights of 
edges and arcs, respectively. 
There is an efficient algorithm of Dantzig to solve this problem, which is formulated for directed graphs 
and can be used for undirected graphs (see 5.8.6, p. 410) in a similar way. 
Every graph G = (V, E, f) with V = (vi, vs, ... , Un} has a distance matriz D of type (n, n): 

D = (αμ) with dij = (vi, vj) (4,3 1,2,...,n). (5.320 
In the case that every edge has weight 1, i.e., the distance between v and w is equal to the minimum 
number of edges which have to be traversed in the graph to get from v to w, then the distance between 
two vertices can be determined using the adjacency matrix: Let v1, vo, ..., v, be the vertices of G. The 
adjacency matrix of G is A = (a;;), and the powers of the adjacency matrix with respect to the usua 
multiplication of matrices (see 4.1.4, 5., p. 272) are denoted by A" = (ai), m € N. 
There is a shortest path of length k from the vertex v; to the vertex v; (i Z 1) iff: 


aj Z0 and aj —0 (s—1,2,..,k— 1). (5.321 


W The weighted graph represented in Fig. 5.39 has the distance matrix D beside it. 

W Tho graph represented in Fig. 5.40 has the adjacency matrix A beside it, and for m — 2 or m — 3 
the matrices A? and A? are obtained. Shortest paths of length 2 connect the vertices 1 and 3, 1 and 4, 
1 and 5, 2 and 6, 3 and 4, 3 and 5, 4 and 5. Furthermore the shortest paths between the vertices 1 and 
6, 3 and 6, and finally 4 and 6 are of length 3. 
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Figure 5.40 
5.8.2.2 Euler Trails 
1. Euler Trail, Euler Graph 


A trail containing every edge of a graph G is called an open or closed Euler trail of G. 


A connected graph containing a closed Euler trail is an Euler graph. 


W The graph (σι (Fig. 5.41) has no Euler trail. The graph G3 (Fig. 5.42) has an Euler trail, but it 
is not an Euler graph. The graph σα (Fig. 5.43) has a closed Euler trail, but it is not an Euler graph. 


The graph G4 (Fig. 5.44) is an Euler graph. 


x SGA BW 


Figure 5.41 Figure 5.42 Figure 5.43 


2. Theorem of Euler-Hierholzer 


Figure 5.44 


A finite connected graph is an Euler graph iff all vertices have positive even degrees. 


3. Construction of a Closed Euler Trail 


If G is an Euler graph, then one chooses an arbitrary vertex v; of G and constructs a trail F by traversing 
a path, starting at v; and proceeding until it cannot be continued. If F} does not yet contain all edges of 
G, then one constructs another path F containing the edges not in F}, but starting at a vertex v € Fi 
and proceeds until it cannot be continued. Then one composes a closed trail in G using F} and F>: 
Starting to traverse F} at v, until v9 is reached, then continuing to traverse Fy, and finishing at the 
edges of ΡῚ not used before. Repeating this method a closed Euler trail is obtained in finitely many 


steps. 
4. Open Euler Trails 


There is an open Euler trail in a graph G iff there are exactly two vertices in G with odd degrees. 


Fig. 5.45 shows a graph which has no closed Euler trail, but it has an open 


Euler trail. The edges are 


consecutively enumerated with respect to an Euler trail. In Fig. 5.46 there is a graph with a closed 


Euler trail. 


406 5. Algebra and Discrete Mathematics 


Figure 5.45 Figure 5.46 Figure 5.47 


5. Chinese Postman Problem 
The problem, that a postman should pass through all streets in his service area at least once and return 
to the initial point and use a trail as short as possible, can be formulated in graph theoretical terms as 
follows: Let G = (V, E, f) be a weighted graph with f(e) > 0 for every edge e € E. Determine an edge 
sequence F with minimum total length 

L- M f(e). (5.322) 

ecF 

'The name of the problem refers to the Chinese mathematician Kuan, who studied this problem first. 
Το solve it two cases are distinguished: 
1. G is an Euler graph - then every closed Euler trail is optimal — and 
2. G has no closed Euler trail. 
An effective algorithm solving this problem is given by Edmonds and Johnson (see [5.25]). 


5.8.2.3 Hamiltonian Cycles 


1. Hamiltonian Cycle 

A Hamiltonian cycle is an elementary cycle in a graph covering all of the vertices. 

W In Fig. 5.47, lines in bold face show a Hamiltonian cycle. 

The idea of a game to construct Hamiltonian cycles in the graph of a pentagondodecaeder, goes back 
to Sir W. Hamilton. 

Remark: The problem of characterizing graphs with Hamiltonian cycles leads to one of the classical 
NP-complete problems. Therefore, an efficient algorithm to determine the Hamilton cycles cannot be 
given here. 

2. Theorem of Dirac 

If a simple graph G = (V, E) has at least three vertices, and dg(v) > |V|/2 holds for every vertex v of 
G, then G has a Hamiltonian cycle. This is a sufficient but not a necessary condition for the existence 
of Hamiltonian cycles. The following theorems with more general assumptions give only sufficient but 
not necessary conditions for the existence of Hamilton cycles, too. 


W Fig. 5.48 shows a graph which has a Hamiltonian cycle, but does 
not satisfy the assumptions of the following theorem of Ore. 
3. Theorem of Ore 
If a simple graph G = (V, E) has at least three vertices, and dc (v) + 
ἄσ(ω) > |V| holds for every pair of non-adjacent vertices v, w, then G 
contains a Hamiltonian cycle. 
4. Theorem of Posa 
Let G = (V, E) bea simple graph with at least three vertices. There is 
Figure 5.48 a Hamiltonian cycle in G if the following conditions are satisfied: 
1. For 1 € k < (|V| — 1)/2, the number of vertices of degree not exceeding k is less than k. 
2. If |V] is odd, then the number of vertices of degree not exceeding (|V| — 1)/2 is less than or equal to 
(V| = 9/2. 
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Figure 5.49 Figure 5.50 Figure 5.51 


5.8.3 Trees and Spanning Trees 


5.8.3.1 Trees 
1. Trees 


An undirected connected graph without cycles is called a tree. Every tree with at least two vertices has 
at least two vertices of degree 1. Every tree with n vertices has exactly n — 1 edges. 

A directed graph is called a tree if G is connected and does not contain any circuit (see 5.8.6, p. 410). 
W Fig. 5.49 and Fig. 5.50 represent two non-isomorphic trees with 14 vertices. They demonstrate 
the chemical structure of butane and iso-butane. 


2. Rooted Trees 

A tree with a distinguished vertex is called a rooted tree, and the distinguished vertex is called the 
root. In diagrams, the root is usually on the top, and the edges are directed downwards from the root 
(see Fig. 5.51). Rooted trees are used to represent hierarchic structures, as for instance hierarchies in 
factories, family trees, grammatical structures. 

W Fig. 5.51 shows the genealogy of a family in the form of a rooted tree. The root is the vertex assigned 
to the father. 


3. Regular Binary Trees 

If a tree has exactly one vertex of degree 2 and otherwise only vertices of degree 1 or 3, then it is called 
a regular binary tree. 

The number of vertices of a regular binary tree is odd. Regular trees with n vertices have (n + 1)/2 
vertices of degree 1. The level of a vertex is its distance from the root. The maximal level occurring 
in a tree is the height of the tree. There are several applications of regular binary rooted trees, e.g., in 
informatics. 


4. Ordered Binary Trees 


Arithmetical expressions can be represented by binary trees. Here, the numbers and variables are as- 
signed vertices of degree 1, the operations “+” ,“—”", “.” correspond to vertices of degree > 1, and the 
left and right subtree, respectively, represents the first and second operand, respectively, which i is, in 
general, also an expression. These trees are called ordered binary trees. 
The traverse of an ordered binary tree can be performed in three different ways, which are defined in a 
recursive way (see also Fig. 5.52): 
Inorder traverse: Traverse the left subtree of the root (in inorder traverse), 
visit the root, 
traverse the right subtree of the root (in inorder traverse). 
Preorder traverse: Visit the root, 
traverse the left subtree (in preorder traverse), 
traverse the right subtree of the root (in preorder traverse). 
Postorder traverse: Traverse the left subtree of the root (in postorder traverse), 
traverse the right subtree of the root (in postorder traverse), 
visit the root. 
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Using inorder traverse the order of the terms does not change in comparison with the given expression. 
The term obtained by postorder traverse is called postfix notation PN or Polish notation. Analogously, 
the term obtained by preorder traverse is called prefiz notation or reversed Polish notation. 

Prefix and postfix expressions uniquely describe the tree. This fact can 

be used for the implementation of trees. (+) 

E In Fig. 5.52 the term a-(b—c) +d is represented by a graph. Inorder 

traverse yields a - b — c + d, preorder traverse yields + - —bcad, and o (d) 
postorder traversal yields abc — -d+. 


5.8.3.2 Spanning Trees @) ο 


1. Spanning Trees E © 
A tree, being a subgraph of an undirected graph G, and containing all 
vertices of G, is called a spanning tree of G. Every finite connected graph 


: à Figure 5.52 
G contains a spanning tree H: 


If G contains a cycle, then delete an edge of this cycle. 


The remaining graph (αι is still connected and can be i 1 

transformed into a connected graph G» by deleting a fur- 

ther edge of a cycle of G4, if there exists such an edge. Af- 2 2 3 1 
ter finitely many steps a spanning tree of G is obtained. 3 4 

W Fig. 5.54 shows a spanning tree H of the graph G Figure 5.53 Figure 5.54 


shown in Fig. 5.53. 
2. Theorem of Cayley 
Every complete graph with n vertices (n > 1) has exactly n"^? spanning trees. 
3. Matrix Spanning Tree Theorem 
Let G = (V, E) bea graph with V = {v1, v2,...,Un} (n > 1) and E = {e1,€2,...,em}. Define a matrix 
D = (dij) of type (n, n): 
0 for i Æj, 
dg(u;) for i = j, 
which is called the degree matrix. The difference between the degree matrix and the adjacency matrix 
is the admittance matrix L of G: 
L-D-A. (5.323b) 
Deleting the i-th row and the i-th column of L the matrix L; is obtained. The determinant of L; is 
equal to the number of spanning trees of the graph G. 


dij = (5.323a) 


W The adjacency matrix, the degree matrix and the admittance matrix of the graph in Fig. 5.53 are: 


2110 4000 2-1-1 0 
1020 0300 -1 3-2 0 
A=l1201]? D=| 9040] πε 3-3 E 
0010 0001 0 0-1 1 


Since detL3 = 5, the graph has five spanning trees. 
4. Minimal Spanning Trees 
Let G = (V, E, f) beaconnected weighted graph. A spanning tree H of G is called a minimum spanning 
tree if its total length f(H) is minimum: 
ΚΗ) - X fle). (5.324) 
ecH 
Minimum spanning trees are searched for, e.g., if the edge weights represent costs, and one is interested 
in minimum costs. A method to find a minimum spanning tree is the Kruskal algorithm: 
a) Choose an edge with the least weight. 
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b) Continue, as long as it is possible, choosing a further edge having least weight and not forming a 
cycle with the edges already chosen, and add such an edge to the tree. 

In step b) the choice of the admissible edges can be made easier by the following labeling algorithm: 
e Let the vertices of the graph be labeled pairwise differently. 

e At every step, an edge can be added only in the case that it connects vertices with different labels. 

e After adding an edge, the label of the endpoint with the larger label is changed to the value of the 
smaller endpoint label. 


5.8.4 Matchings 


1. Matchings 


A set M of edges of a graph G is called a matching in G, iff M contains no loop and two different edges 

of M do not have common endpoints. 

A matching M* of G is called a saturated matching, if there is no matching M in G such that M* C M. 

A matching M** of G is called a maximum matching, if there is no matching M in G such that |M| > 

|M**]. 

If M is a matching of G such that every vertex of G is an endpoint of an edge of M, then M is called a 
1 perfect matching. 


6 " 5 Β In the graph in Fig. 5.55 Mı = {{2,3},{5,6}} is a saturated 
matching and M» = {{1,2}, {3,4}, {5, 6}} 15 a maximum matching 
5 3 which is also perfect. 


Remark: In graphs with an odd number of edges there is no perfect 
Figure 5.55 matching. 


2. Theorem of Tutte 

Let q(G — S) denote the number of the components of G -- with an odd number of vertices. A graph 
G = (V, E) has a perfect matching iff |V | is even and for every subset S of the vertex set q(G — S) < |S]. 
Here G — S denotes the graph obtained from G by deleting the vertices of S and the edges incident with 
these vertices. 

Perfect machings exist for example in complete graphs with an even number of vertices, in complete 
bipartite graphs πι and in arbitrary regular bipartite graphs of degree r > 0. 

3. Alternating Paths 

Let G be a graph with a matching M. A path W in G is called an alternating path 18 in W every edge 
e with e € M (or e ¢ M) is followed by an edge e' with e' ¢ M (or e € M). 

An open alternating path is called an increasing path iff none of the endpoints of the path is incident 
with an edge of M. 


4. Theorem of Berge 

Α matching M in a graph G is maximum iff there is no increasing alternating path in G. 

If W is an increasing alternating path in G with corresponding set E(W) of traversed edges, then 
M' = (MN E(W)) U (E(W)N M) forms a matching in G with |M’| = |M]| + 1. 

WB In the graph of Fig. 5.55 ({1, 2}, {2,3}, (3, 4}) is an increasing alternating path with respect to 
matching Mı. Matching M» with |λ49| = |Mi| + 1 is obtained as described above. 

5. Determination of Maximum Matchings 

Let G be a graph with a matching M. 

a) First form a saturated matching M* with M C M*. 

b) Chose a vertex v in G, which is not incident with an edge of M*, and determine an increasing 
alternating path in G starting at v. 

c) If such a path exists, then the method described above results in a matching M’ with |M’| > | M*]. 
If there is no such path, then delete vertex v and all edges incident with v in G, and repeat step b). 
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There is an algorithm of Edmonds, which is an effective method to search for maximum matchings, but 
it is rather complicated to describe (see [5.24]). 


5.8.5 Planar Graphs 
Here, the considerations are restricted to undirected graphs, since a directed graph is planar iff the 
corresponding undirected graph is a planar one. 


1. Planar Graph 
A graph is called a plane graph iff G can be drawn in the plane 
with its edges intersecting only in vertices of G. A graph iso- 
morphic with a plane graph is called a planar graph. 
Fig. 5.56 shows a plane graph σι. The graph G in Fig. 5.57 
is isomorphic to G4, it is not a plane graph but a planar graph, 
since it is isomorphic with G4. 
2. Non-Planar Graphs 

Figure 5.56 Figure 5.57 The complete graph K; and the complete bipartite graph 3.3 
are non-planar graphs (see 5.8.1, 5., p. 402). 


3. Subdivisions 

A subdivision of a graph G is obtained if vertices 
of degree 2 are inserted into edges of G. Every 
graph is a subdivision of itself. Certain subdivi- 
sions of K; and K3, are represented in Fig. 5.58 
and Fig. 5.59. 

4. Kuratowski's Theorem 

A graph is non-planar iff it contains a subgraph 
which is a subdivision either of the complete bipar- Figure 5.58 Figure 5.59 
tite graph K; 3 or of the complete graph ΑΣ. 


5.8.6 Pathsin Directed Graphs 


1. ArcSequences 

A sequence F = (e1, €2,.. . , €s) of arcs in a directed graph is called a chain of length s, iff F does not 
contain any arc twice and one of the endpoints of every arc e; for i = 2,3,...,s — 1 is an endpoint of 
the arc 6 1 and the other one an endpoint of e;,,. 


A chain is called a directed chain iff for i = 1,2,...,s— 1 the terminal point of the arc e; coincides with 
the initial point of e;,4. 

Chains or directed chains traversing every vertex at most once are called elementary chains and ele- 
mentary directed chains, respectively. 

A closed chain is called a cycle. A closed directed path, with every vertex being the endpoint of exactly 
two arcs, is called a circuit. 

W Fig. 5.60 contains examples for various kinds of arc sequences. 

2. Connected and Strongly Connected Graphs 

A directed graph G is called connected iff for any two vertices there is a chain connecting these vertices. 
The graph G is called strongly connected iff to every two vertices v, w there is is assigned a directed 
chain connecting these vertices. 

3. Algorithm of Dantzig 

Let G = (V, E, f) be a weighted simple directed graph with f(e) > 0 for every arc e. The following 
algorithm yields all vertices of G, which are connected with a fixed vertex v, by a directed chain, together 
with their distances from vj: 

a) Vertex vı gets the label t(v1) = 0. Let δι = {v1}. 
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g^ qu LI 


chain directed elementary elementary directed cycle circuit 
chain chain chain 


Figure 5.60 
b) The set of the labeled vertices is Sm. 
c) If Um = (ele = (vi, vj) € E, vi € Sm, vj Z Sm} = 0, then one finishes the algorithm. 
d) Otherwise one chooses an arc e* = (x*, y*) with minimum t(x*) + f(e*). One labels e* and y* and 
puts t(y*) = t(a*) + f(e*) and also Sm+1 = Sm U (y*) and repeats b) with m :— m + 1. 
(If all arcs have weight 1, then the length of a shortest directed chain from a vertex v to a vertex w can 
be found using the adjacency matrix (see 5.8.2.1, 4., p. 404)). 


If a vertex v of G is not labeled, then there is no di- 
rected path from v4 to v. 

If v has label t(v), then t(v) is the length of such a 
directed chain. A shortest directed path from v, to 
v can be found in the tree given by the labeled arcs 
and vertices, the distance tree with respect to v1. 
W In Fig. 5.61, the labeled arcs and vertices rep- 
resent the distance tree with respect to v, in the 
graph. The lengths of the shortest directed chains 
are: 


from v to vs : 2 from tovg: 7 
from v4 to v; : 3 from ti to vg : 7 
from v4 to ug : 3 from v tovg: 8 
from v, to v» : 4 from v, tovs : 8 
from v, to vio: 5 from w to 0] : 9 
from v4 to v, : 6 from, tovg: 10 
from v; tov: 6. 


Remark: There is also a modified algorithm to 
Figure 5.61 find the shortest directed chains in the case that 
G = (V,E, f) has arcs with negative weights. 


5.8.7 Transport Networks 


1. Transport Network 

A connected directed graph is called a transport network if it has two labeled vertices, called the source 
Q and sink S which have the following properties: 

a) There is an arc u; from δ to Q, where "η is the only arc with initial point S and the only are with 
terminal point Q. 

b) Every arc u; different from u; is assigned a real number c(u;) > 0. This number is called its capacity. 
The arc u, has capacity oo. 

A function y, which assigns a real number to every arc, is called a flow on G, if the equality 


35 wv(wv)- M. e(vw) (5.325a) 


(u,v)eG (v,w)eG 
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holds for every vertex v. The sum 


D e(Q.v) (5.325b) 
(Q)eG 

is called the intensity of the flow. A flow y is called compatible to the capacities, if for every arc u; of G 
0 οι) € c(u;) holds. 
W For an example of a transport network see p. 412. 
2. Maximum Flow Algorithm of Ford and Fulkerson 
Using the maximum flow algorithm one can recognize whether a given flow y is maximal. 
Let G bea transport network and o a flow of intensity v; compatible with the capacities. The algorithm 
given below contains the following steps for labeling the vertices, and after finishing this procedure one 
can realize how much the intensity of the flow could be improved depending on the chosen labeling 
steps. 
a) One labels the source Q and sets ε(ϱ) = oc. 
b) If there is an arc u; = (x, y) with labeled x and unlabeled y and g(u;) < c(u;), then one labels y and 
(x,y), and sets e(y) = min{e(x), c(u;) — y(u;)}, then one repeats step b), otherwise follows step c). 
c) If there is an arc u; = (x, y) with unlabeled x and labeled y, p(u;) > 0 and u; # u1, then one labels 
x and (x,y), substitutes e(x) = min{e(y), p(u;)} and returns to continue step b) if it is possible. 
Otherwise one finishs the algorithm. 
If the sink S of G is labeled, then the flow in G can be improved by an amount of e(S). If the sink is 
not labeled, then the flow is maximal. 


W Maximum flow: For the graph in Fig. 5.62 the weights are written next to the edges. A flow with 
intensity 13, compatible to these capacities, is represented in the weighted graph in Fig. 5.63. It isa 
maximum flow. 


Figure 5.62 


Wi Transport network: A product is produced 
by p firms Fi,PF5...,F,. There are q users 
Vi, V5, .., Vj. During a certain period there will be 
s; units produced by F; and t; units required by Ν᾽. 
cij units can be transported from F; to V; during 
the given period. Is it possible to satisfy all the re- 
quirements during this period? The corresponding 
graph is shown in Fig. 5.64. 


P, 
Figure 5.64 
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5.9 Fuzzy Logic 


5.9.1 Basic Notions of Fuzzy Logic 
5.9.1.1 Interpretation of Fuzzy Sets 


Real situations are very often uncertain or vague in a number of ways. The word “fuzzy” also means 
some uncertainty, and the name of fuzzy logic is based on this meaning. Basically there are to distin- 
guish two types of fuzziness: vagueness and uncertainty. There are two concepts belonging here: The 
theory of fuzzy sets and the theory of fuzzy measure. In the following practice-oriented introduction 
the notions, methods, and concepts of fuzzy sets are discussed, which are the basic mathematical tools 
of multi-valued logic. 


1. Notions of Classical and Fuzzy Sets 

The classical notion of (crisp) set is two-valued, and the classical Boolean set algebra is isomorphic to 
two-valued propositionallogic. Let X be a fundamental set named the universe. Then for every A C X 
there exists a function 


fa: X > {0,1}, (5.326) 
such that it says for every x € X whether this element x belongs to the set A or not: 
fa(r)21er€ A and fa(r)-0 e vd A. (5.326b) 


The concept of fuzzy sets is based on the idea of considering the membership of an element of the set as a 
statement, the truth value of which is characterized by a value from the interval [0, 1]. For mathematical 
modeling of a fuzzy set A a function is necessary whose range is the interval [0, 1] instead of {0,1}, i.e.: 
μα: X > 0,1]. (5.327) 
In other words: To every element x € X is to assign a number j14(x) from the interval [0, 1], which 
represents the grade of membership of x in A. The mapping p4 is called the membership function. The 
value of the function j,4(x) at the point x is called the grade of membership. The fuzzy sets A, B,C, 
etc. over X are also called fuzzy subsets of X. The set of all fuzzy sets over X is denoted by F(X). 
2. Properties of Fuzzy Sets and Further Definitions 
The properties below follow directly from the definition: 
(E1) Crisp sets can be interpreted as fuzzy sets with grade of membership 0 and 1. 
(E2) The set of the arguments x, whose grade of membership is greater than zero, i.e., jja(x) > 0, is 
called the support of the fuzzy set A: 
supp(A) = {x € X | na(x) > 0}. (5.328) 
The set ker(A) = (x € X : na(x) = 1} is called the kernel or core of A. 
(E3) Two fuzzy sets A and B over the universe X are equal if the values of their membership functions 
are equal: 
A — B, if a(x) = p(x) holds for every x € X. (5.329) 
(E4) Discrete representation or ordered pair representation: If the universe X is finite, i.e., 
X = {x1,%2,..., En} it is reasonable to define the membership Table 5 


function of the fuzzy set with a table of values. The tabular 
representation of the fuzzy set Α is seen in Table 5.7. 


.7 Tabular representation of a 
fuzzy set 


ee A ET 2 mer | 
Also it is possible to write [ falar) | ala) |- | naQ) | 


A :— na(zi)/21 +-+ + pa(tn)/tn = 2 μασ) /2;. (5.950) 


In (5.330) the fraction bars and addition signs have only symbolic meaning. 
(E5) Ultra-fuzzy set: A fuzzy set, whose membership function itself is a fuzzy set, is called, after Zadeh, 
an ultra-fuzzy set. 
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3. Fuzzy Linguistics 

Assigning linguistic values, e.g., “small”, “medium” or “big”, to a quantity then it is called a linguistic 
quantity or linguistic variable. Every linguistic value can be described by a fuzzy set, for example, by 
the graph of a membership function (5.9.1.2) with a given support (5.328). The number of fuzzy sets 
(in the case of “small”, “medium”, “big” they are three) depends on the problem. 

In 5.9.1.2 the linguistic variable is denoted by x. For example, x can have linguistic values for temper- 
ature, pressure, volume, frequency, velocity, brightness, age, wearing, etc., and also medical, electrical, 
chemical, ecological, etc. variables. 

W By the membership function j/4(x) of a linguistic variable, the membership degree of a fixed (crisp) 
value can be determined in the fuzzy set represented by µα(α). Namely, the modeling of a “high” 
quantity, e.g., the temperature, as a linguistic variable given by a trapezoidal membership function 
(Fig. 5.65) means that the given temperature α belongs to the fuzzy set ^high temperature" with the 
degree of membership £ (also degree of compatibility or degree of truth). 


5.9.1.2. Membership Functions on the Real Line 


'The membership functions can be modeled by functions with values between 0 and 1. They represent 
the different grade of membership for the points of the universe being in the given set. 

1. Trapezoidal Membership Functions 

Trapezoidal membership functions are widespread. Piecewise (continuously differentiable) member- 
ship functions and their special cases, e.g., the triangle shape membership functions described in the 
following examples, are very often used. Connecting fuzzy quantities gives smoother output functions 


if the fuzzy quantities were represented by continuous or piecewise continuous membership functions. 
W A: Trapezoidal function (Fig. 5.65) corresponding to (5.331). The graph of this function 
turns into a triangle function 
Neri if ao = a3 = a and αι < 
0 T <a, a < a4. Choosing differ- 
1 » ent values for a1, ..., a4 gives 
: pecu dj € T € à2, symmetrical or asymmetri- 
05 ! a2 — 4 cal trapezoidal functions, a 
Bras ! =41 az € x < az , (5.331) symmetrical triangle func- 
: tion (a2 = ag = a and |a — 
- i αι - X rT Te αι] = |as — al) or asymmet- 
0 a®a, à, a,x à4 — ag rical triangle function (a — 

0 zl, az = aand |a — αι] Z |as 

Figure 5.65 al). 


BW B: Membership function bounded to the left and to the right (Fig. 5.66) corresponding to (5.332): 


μισο 1 2 <, 
τμ az --0 
-------- 4 <<a, 
az — ay, 
μα(α) -- 4 0 a2 X X < da, (5.332) 
X — as 
-i 03 < T < a4, 
0 a a, a8, ay Q4 — 43 
1 αι € m. 


Figure 5.66 
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WB C: Generalized trapezoidal function (Fig. 5.67) corresponding to (5.333). 


0 z <a, 
bo(x — 
Li a Qı < T < d», 
az — ay 
oa bs — by) (x — a; 
i ! (bs — bo) (a= a2) |, az € £ € as, 
EE az — αρ 
| μα(α) = b =by = 1 a3 < T < 4, (5.333) 
E b, — bs)(a4— x 
(a= bsa =a) ae a<r < as, 
ἂν a, as a5 — αι 
bs (a6 — x) " 
Figure 5.67 Cc a5 < t «αρ, 
0 ag στ. 
2. Bell-Shaped Membership Functions 0 d 


W A: A class of bell-shaped, differentiable membership 

functions is given by the function f(x) from (5.334) by — f(z)— ελ ασ x <b, (5.334) 
choosing an appropriate p(x): 

For p(x) = k(x — a)(b — x) and, e.g., k = 10 or k = 1 or 0 xb. 

k — 0.1, there is a family of symmetrical curves of different 


b b 
width with the membership function alx) = f(x) i f ( t } where 1 / f (2) is the normal- 


2 

izing factor (Fig. 5.68). The exterior curve follows with the value k — 10 and the interior one with 
k — 04. 

Asymmetrical membership functions in [0,1] follow e.g. for p(w) = z(1— x)(2 — x) or for p(x) = 
z(1— x)(x 4-1) (Fig. 5.69), using appropriate normalizing factors. The factor (2 — x) in the first poly- 
nomial results in the shifting of the maximum to the left and it yields an asymmetrical curve shape. 
Similarly, the factor (x + 1) in the second polynomial results in a shifting to the right and in an asym- 
metric form. 


ul Tos) 
0.5 0.5 
0 ' » 
? uh bx 0 05 Tx 
Figure 5.68 Figure 5.69 
W B: A more flexible class of membership functions can be got by the formula 
f (t(u)) du 
F,(z) = 72,—— —— —, (5.335) 
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where f is defined by (5.334) with p(x) = (x — a)(b — x) and t is a transformation on [a,b]. If t is 
à smooth transformation on [a,b], i.e., if t is differentiable infinitely many times in the interval [a, b], 
then F; is also smooth, since f is smooth. Requiring t to be either increasing or decreasing and to be 
smooth, then the transformation t allows to change the shape of the curve of the membership function. 
In practice, polynomials are especially suitable for transformations. The simplest polynomial is the 
identity t(x) = x on the interval [a,b] = [0, 1]. 


The next simplest polynomial with the given properties is t(x) = —: 


3 

constant c € [—6, 3]. The choice c = —6 results in the polynomial of maximum curvature, its equation 
is q(x) = 4r? — 6x? + 3x. Choosing for qo the identity function, i.e., go(x) = x, then can be got 
recursively further polynomials q by the formula q; = q0 qi—ı for i € IN. Substituting the corresponding 
polynomial transformations qo, qı, ... into (5.335) for t, gives a sequence of smooth functions Fjo, Fy, 


"A (1 = s) x with a 


and F,, (Fig. 5.70), which can be considered as membership functions p4 (x), where F}, converges to 
a line. The trapezoidal membership function can be approximated by differentiable functions using the 
function Fy», its reflection and a horizontal line (Fig. 5.71). 
πα ^u) 
Aik 
0.5 0.5F 
0 1 : T 0 : η : Li» 
0 025 05 075 1x 0 025 05 075 1x 
Figure 5.70 Figure 5.71 


Summary: Imprecise and non-crisp information can be described by fuzzy sets and represented by 
membership functions u(x). 
5.9.1.3 Fuzzy Sets 
1. Empty and Universal Fuzzy Sets 
a) Empty fuzzy set: A set A over X is called empty if uaA(v) = 0 Va € X holds. 
b) Universal fuzzy set: A set is called universal if p A(r) = 1 Vx € X holds. 
2. Fuzzy Subset 
If p(x) € ua(x) V x € X, then B is called a fuzzy subset of A (one writes: B C A). 
3. Tolerance Interval and Spread of a Fuzzy Set on the Real Line 
If A is a fuzzy set on the real line, then the interval 

[a,b] = (x € X|ua(x) =1} (a,b const, a < b) (5.336) 
is called the tolerance interval of the fuzzy set A, and the interval [c, d] = cl(suppA) (c, d const,c < d) 
is called the spread of A, where cl denotes the closure of the set. (The tolerance interval is sometimes 
also called the peak of set A.) The tolerance interval and the kernel coincide only if the kernel contains 
more then one point. 


W A: In Fig. 5.65 [a5, a3] is the tolerance interval, and [a4, a4] is the spread. 

WB D: a = az = a (Fig. 5.65), gives a triangle-shaped membership function μ. In that case the 
triangular fuzzy set has no tolerance, but its kernel is the set {a}. If additionally a, = a = αι holds, 
too, then a crisp value follows; it is called a singleton. A singleton A has no tolerance, but ker(A) = 
supp(A) — (a]. 

4. Conversion of Fuzzy Sets on a Continuous and Discrete Universe 

Let the universe be continuous, and let a fuzzy set be given on it by its membership function. Discretiz- 
ing the universe, every discrete point together with its membership value determines a fuzzy singleton. 
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Conversely, a fuzzy set given on a discrete universe can be converted into a fuzzy set on the continuous 
universe by interpolating the membership value between the discrete points of the universe. 
5. Normal and Subnormal Fuzzy Sets 
If A is a fuzzy subset of X, then its height is defined by 

H(A) := max {pa(x)|a € X}. (5.337) 
A is called a normal fuzzy set if H(A) = 1, otherwise it is subnormal. 
The notions and methods represented in this paragraph are limited to normal fuzzy sets, but it easy to 
extend them also to subnormal fuzzy sets. 


6. Cut of a Fuzzy Set 
The a cut A?? or the strong a cut A?® of a fuzzy set A are the subsets of X defined by 


A?^* = {x € X|ua(x) > a}, A?* = {σε X|ua(x) >a}, o € (0,1). (5.338) 
and A?9 = cl (479). The α cut and strong a cut are also called a-level set and strong a-level set, 
respectively. 


1. Properties 

a) The a cuts of fuzzy sets are crisp sets. 

b) The support supp(A) is a special a cut: supp(.A) = A??. 

c) The crisp 1 cut A?! = (x € Χ]μα(α) = 1} is called the kernel of A. 

2. Representation Theorem 

To every fuzzy subset A of X can be assigned uniquely the families of its a cuts (A7?), 2; and its 


strong a cuts (427) The a cuts and strong a cuts are monotone families of subsets from X, 


a€(0,1]* 
since: 

a«-A^*2A4^? and A?* D A”. (5.3392) 
Conversely, if there exist the monotone families (Ua)aejo,1) or (Va)aea,1) of subsets from X, then there 
are uniquely defined fuzzy sets U and V such that U^^ = Ua and V2? = V, and moreover 

µυ(α) = sup{a € [0, 1))|z € Ua}, Lv (x) = sup(a € (0,1]|x € Va}. (5.339b) 
7. Similarity of the Fuzzy Sets Aand B 
1. The fuzzy sets A, B with membership functions 44, 4g: X — [0,1] are called fuzzy similar if for 
every a € (0, 1] there exist numbers o; with a; € (0, 1]; (i = 1, 2) such that: 

supp(@i/t4)a C supp(up)a, supp(2/1z)a C supp(ua)a. (5.340 


ο. " : re : — f olx) if uc(r) > a 
(µο)α represents a fuzzy set with the membership function (µο)α = { 0 otherwise 


B if pole) > B 


0 otherwise. 


and (Buc 
represents a fuzzy set with the membership function (buc) = { 


2. Theorem: Two fuzzy sets A, B with membership functions 44, yg: X — [0,1] are fuzzy-similar i 
they have the same kernel: 


οιρρ(μα}ι = supp(up). (5.341a 
since the kernel is equal to the 1 cut, i.e. 
supp(ua) = {x € Χ|μα(α) = 1). (5.341b 


3. A, B with ua, ug: X — [0,1] are called strongly fuzzy-similar if they have the same support and 
the same kernel: 
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supp(ua): = supp(up)i. (5.342a) supp(ua)o = supp(ug)o. (5.342b) 


5.9.2 Connections (Aggregations) of Fuzzy Sets 
Fuzzy sets can be aggregated by operators. There are several different suggestions of how to generalize 
the usual set operations, such as union, intersection, and complement of fuzzy sets. 


5.9.2.1 Concepts for Aggregations of Fuzzy Sets 


1. Fuzzy Set Union, Fuzzy Set Intersection 
The grade of membership of an arbitrary element x € X in the sets AU B and AN B should depend 
only on the grades of membership µα(α) and μρ(α) of the element in the two fuzzy sets A and B. The 
union and intersection of fuzzy sets is defined with the help of two functions 

s,t: [0,1] x [0,1] > [0, 1] 
and they are defined in the following way: 


pavele) = 5(μα(α).μρία)). (55344) μληρ(α) = t (uala), un(m)). (6.345) 


The grades of membership j(4(x) and jug(x) are mapped in a new grade of membership. The functions 
t and s are called the { norm and t conorm; this last one is also called the s norm. 


(5.343) 


Interpretation: The functions μαι)» and pang represent the truth values of membership, which is 
resulted by the aggregation of the truth values of memberships μα(α) and μμ(α). 
2. Definition of the tNorm: 
The t norm is a binary operation t in [0, 1]: 

t: [0,1] x [0,1] > [0,1]. (5.346 
It is symmetric, associative, monotone increasing, it has 0 as the zero element and 1 as the neutra 
element. For x, y, z, v, w € [0,1] the following properties are valid: 


(E1) Commutativity: t(r,y) = t(y, x). (5.347a 
(E2) Associativity: t(r,t(y. z)) = t(t(x, y), z). (5.34Tb 
(E3) Special Operations with Neutral and Zero Elements: 

t(x,1) = rand because of (E1): t(1,r) — zx; t(x,0)= t(0, x)= 0. (5.347¢ 
(E4) Monotony: lfr € vandy € w, then t(x,y) < t(v, w) is valid. (5.347d 


3. Definition of the sNorm: 


The s norm is a binary function in {0, 1]: 


s: [0,1] x [0,1] > [0, 1]. (5.348 
It has the following properties: 
(E1) Commutativity: s(x, y) = s(y, £). (5.349a 
(E2) Associativity: s(x, s(y, z)) = s(s(x, y), 2). (5.349b 
(E3) Special Operations with Zero and Neutral Elements: 

s(x,0) = ο(0,α) = v; s(x, 1) = s(1,2) — 1. (5.349c 
(E4) Monotony: Ifv X vand y € w, then s(x,y) € s(v,w) is valid. (5.349d 


With the help of these properties a class T' of t norms and a class S of s norms can be introduced. 
Detailed investigations proved that the following relations hold: 
min(z,y) > t(x,y) Vt € T, Vr,y € [0,1] and (5.349e 


max(z,y) € s(z.y) Vs € S, Va,y € [0,1]. (5.349f 


5.9 Fuzzy Logic 419 


5.9.2.2 Practical Aggregation Operations of Fuzzy Sets 
1. Intersection of Two Fuzzy Sets 


The intersection A N B of two fuzzy sets A and B is defined by the minimum operation min(.,.) on 
their membership functions j4(2) and μμ(α). Based on the previous requirements there is: 


C :— AN B and po(x) := min(ua(x),upg(x) | Vx € X, where: (5.3502) 
: . fa, ifa<b, 
min(a, b) :— ie ifa b. (5.350b) 


The intersection operation corresponds to the AND operation of two membership functions (Fig.5.72). 
The membership function uo (x) is defined as the minimum value of p4 (x) and μμ(α). 
2. Union of Two Fuzzy Sets 
The union AU B of two fuzzy sets is defined by the maximum operation max(., .) on their membership 
functions µα(α) and μμ(α): 

C :— AU B and pic(x) := max(ua(r),ug(v) | Voc € X, where: 


max(a, b) :— i gon d (5.351b) 


The union corresponds to the logical OR operation. Fig.5.73 illustrates jjc(x) as the maximum value 
of the membership functions µα() and g(x). 

WB Thet norm t(z, y) = min(z, y] and the s norm s(x, y) = max(z, y) define the intersection and the 
union of two fuzzy sets, respectively (see (Fig.5.74) and (Fig.5.75)). 


Loot uO) 
1 wx) 


1 


μείχ) 


Figure 5.74 Figure 5.75 


3. Further Aggregations 
Further aggregations are the bounded, the algebraic, and the drastic sumand also the bounded difference, 
the algebraic and the drastic product (see Table 5.8). 
The algebraic sum, e.g., is defined by 

C :— A+ B and uc(x) := a(x) + (2) — a(x): up(x) for every x € X. (5.352a) 
Similarly to the union (5.351a,b), this sum also belongs to the class of s norms. They are included in 
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Table 5.8 t and s norms, p € IR 


Author t norm s norm 
Zadeh intersection: t(x, y) = min(z, y} union: s(x, y) = max{x, y} 
Lukasiewicz bounded difference bounded sum 
f(x, y) = max(0, z + y — 1} s(x, y) = min(1, x + y) 
algebraic product algebraic sum 
ta(z,y) = xy Sa(X,y) =£ +y — xy 
drastic product drastic sum 
min(z, y}, whether x = 1 max(z, y), whether x = 0 
tay(v, y) = ory =1 Sas(@,Y) = ory =0 
0 otherwise 1 otherwise 
Hamacher 
vy zd y-cy-(1-p)y 
p>0 talz, y) = salz, y) = 
(p 2 0) Μαη) p+ (1— p)x-y- xy) eur. y) 1—(1— p)zy 
Σ . TY rcty 
Einste [98 SrA = — 7 e(z, y) = 
instein (x,y) ιτ ο τν] Se(x, y) (ga 
Frank tg(m,y) = ϑρ(α, Ὁ) — 1— 
z—] Hl pi-* 1 pi-¥ — 1 
(p > 0,p #1) | log, 14 μμ ο. log, [1 +4 (p Xp 
pl p-1 
Yager ty (2, y) = 1— Sys (1, y) = min (1, (x? + y)'/?) 
(p > 0) min (1. ((1— z)? +(1- yy)"7) 
Schweizer t,(r,y) = max(0, x^? + y^? — 1)-U» ss(r,y) = 1— 
(p > 0) max (0, (1 — x)? + (1 — y)? — 1) 
Dombi tao(@,y) = Sdo(X, y) = 1— 
/p) Γ᾽ mp) 7} 
1— r4? l-y ΗΝ x Ῥ y 
p>0 1+|(—*) 4 1 (5) 
(p ) | | x ( y ) t l-r + l-y 
Weber tu(x,y) = max(0, (1 + p) Sw(@,y) = min(l, x + y+ pry) 
(pz -1) (z +y- 1) — pry) 
Dubois tau(@,y) = url Sau(@,y) = 


max(z, y, p) 
r-cy-—zy-— min(z,y,(1— p)) 
max((1 — x), (1 — y). p) 
Remark: For the values of the { and s norms listed in the table, the following ordering is valid: 
tap < ty Ste Sta Sth X t X s € Sh € Sa € Se € Sy X Sas. 


(0€ p € 1) 


the right-hand column of Table 5.8. In Table 5.9 is given a comparision of operations in Boolean logic 
and fuzzy logic. 
Analogously to the notion of the extended sum as a union operation, the intersection can also be ex- 
tended for example by the bounded, the algebraic, and the drastic product. So, e.g., the algebraic 
product is defined in the following way: 

C :— A- B and µρ(α) := a(x) + a(x) for every x € X. (5.352b) 
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It also belongs to the class of t norms, similarly to the intersection (5.350a,b), and it can be found in 
the middle column of Table 5.8. 


5.9.2.3. Compensatory Operators 
Sometimes operators are necessary lying between the { and the s norms; they are called compensatory 
operators. Examples for compensatory operators are the lambda and the gamma operator. 
1. Lambda Operator 

μλλη(α) = A ar) )] + (1.--- A) [ia (x) + uele) — μα(α)μα(α)]) with λε [0,1]. (5.353) 
Case à = 0: Equation (5.353) results in a form known as the algebraic sum (Table 5.8, s norms); it 
belongs to the OR operators. 


Case \ = 1: Equation (5.353) results in the form known as the algebraic product (Table 5.8, t norms); 

it belongs to the AND operators. 

2. Gamma Operator 
Iano) = ἱμα(α)μμ(α)) ^ 1 — (1 — μα(ῳ)) 1 uele) with y € [0,1]. (5.854) 

Case y= 1: Equation (5.354) results in the representation of the algebraic sum. 


Case y= 0: Equation (5.354) results in the representation of the algebraic product. 
The application of the gamma operator on fuzzy sets of any numbers is given by 


μα) = n T h The 22) P (5.355) 


and with weights δι: 


μία) = n uic) i h - Πα - Td with r € X, P» —1, γε[θ,1]. (5.356) 


i=1 


5.9.2.4 Extension Principle 


In the previous paragraph there are discussed the possibilities of generalizing the basic set operations 
for fuzzy sets. Now, the notion of mapping is extended on fuzzy domains. The basis of the concept is the 
acceptance grade of vague statements. The classical mapping Φ: X" — Y assigns a crisp function value 
P(a1,..., En) € Y tothe point (z1,...,7,) € X". This mapping can be extended for fuzzy variables as 


follows: The fuzzy mapping is Φ: F(X)" + F(Y), which assigns a fuzzy function value d(u,, . . . , μι) 
to the fuzzy vector variables (11...) given by the membership functions (ji1,.... Hn) € F(X)”. 


5.9.2.5 Fuzzy Complement 


A function c: [0,1] — [0,1] is called a complement function if the following properties are fulfilled for 
V z,y € [0,1]: 


(EK1) Boundary Conditions: c(0) = 1 and c(1) = 0. (5.357a 
(EK2) Monotony: x< y => c(r) > ε(υ). (5.357b 
(EK3) Involutivity: c(c(x)) = x. (5.357€ 
(EK4) Continuity: c(x) should be continuous for every x € [0,1]. (5.357d 
W A: The most often used complement function is (continuous and involutive): 

ε(α) = 1-2. (5.358 


W B: Other continuous and involutive complements are the Sugeno complement cy(x) := (1—2)(1+ 
Ar) with À € (—1,00) and the Yager complement c,(x) := (1 — a?)"? with p € (0, oo). 
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Table 5.9 Comparison of operations in Boolean logic and in fuzzy logic 


Operator | Boolean logic | Fuzzy logic (ja, up € [0,1]) 
AND C=AAB Hane = min({04, ug) 

OR C=AVB Haug = Max(fla, μη) 

NOT C=-A uS =1-— ua (uS as complement of μα) 


5.9.3 Fuzzy-Valued Relations 
5.9.3.1 Fuzzy Relations 


1. Modeling Fuzzy-Valued Relations 
Uncertain or fuzzy-valued relations, as e.g. “approximately equal”, “practically larger than", or “prac- 
tically smaller than" , etc., have an important role in practical applications. A relation between numbers 


» 


is interpreted as a subsets of R2. So, the equality “=” is defined as the set 
A= (y) € R?|z =y}, (5.359) 


i.e., by a straight line y = x in R°. 

Modeling the relation “approximately equal” denoted by Γι, can be used a fuzzy subset on R?, the 

kernel of which is A. Furthermore it is to require that the membership function should decrease and 

tend to zero getting far from the line .A. A linear decreasing membership function can be modeled by 
Hg, (x,y)- max(0,1— a|r —y]] with a€ R, a» 0. (5.360) 

For modeling the relation Rə “practically larger than”, it is useful to start with the crisp relation “>”. 

The corresponding set of values is given by 


(ων) € ΒΡ]: < y}. (5.361) 
It describes the crisp domain above the line z — y. 
The modifier “practically” means that a thin zone under the half-space in (5.361) is still acceptable 
with some grade. So, the model of Rə is 


Lr, (η, y) = { Merto; 1 — αμ — yl} a ᾽ ` 2 with ae R,a>0. (5.362) 


If the value of one of the variables is fixed, e.g., y = yo, then Rə can be interpreted as a region with 

uncertain boundaries for the other variable. 

Handling the uncertain boundaries by fuzzy relations has practical importance in fuzzy optimization, 

qualitative data analysis and pattern classification. 

The foregoing discussion shows that the concept of fuzzy relations, i.e., fuzzy relations between several 

objects, can be described by fuzzy sets. In the following section the basic properties of binary relations 

are discussed over a universe which consists of ordered pairs. 

2. Cartesian Product 

Let X and Y be two universes. Their “cross product” X x Y, or Cartesian product, is a universe G: 
G=XxY={(x,y)\teXAYEY}. (5.363) 

Then, a fuzzy set on G is a fuzzy relation, analogously to classical set theory, if it consists of the valued 

pair of universes X and Y. A fuzzy relation R in G is a fuzzy subset R € F(G), where F(G) denotes the 

set of all the fuzzy sets over X x Y. R can be given by a membership function jig(x, y) which assigns 

a membership degree μµ(α, y) from [0, 1] to every element of (x, y) € G. 

3. Properties of Fuzzy- Valued Relations 

(E1) Since the fuzzy relations are special fuzzy sets, all propositions stated for fuzzy sets will also be 

valid for fuzzy relations. 

(E2) All aggregations defined for fuzzy sets can be defined also for fuzzy relations; they yield a fuzzy 
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relation again. 

(E3) The notion of a cut defined above can be transmitted without difficulties to fuzzy relations. 
(E4) The 0 cut (the closure of the support) of a fuzzy relation R € F(G) is a usual relation on G. 
(E5) Denoting the membership value by μπ(α, y), i.e., the degree by which the relation R between the 
pair (x,y) holds. The value pug(x, y) = 1 means that R holds perfectly for the pair (x, y), and the value 
µηία, y) = 0 means that R does not at all hold for the pair (x, y). 

(E6) Let R € F(G) be a fuzzy relation. Then the fuzzy relation S := R7', the inverse of R, is defined 
by 


Hs(x.y) = un(y. x) for every (x,y) € G. (5.364) 
E The inverse relation R5 ! means “practically smaller than” (see 5.9.3.1, 1., p. 422); the union ΓΙ Πο! 
can be determined as “practically smaller or approximately equal". 
4. n-Fold Cartesian Product 
Let n be the number of universal sets. Their cross product is an n-fold Cartesian product. A fuzzy set 
on an n-fold Cartesian product represents an n-fold fuzzy relation. 


Consequences: The fuzzy sets, considered until now, are unary fuzzy relations, i.e., in the sense of the 
analysis they are curves above a universal set. A binary fuzzy relation can be considered as a surface 
over the universal set G. A binary fuzzy relation on a finite discrete support can be represented by a 
fuzzy relation matrix. 

W Colour-ripe grade relation: The well-known correspondence between the colour x and the ripe grade 
y of a friut is modeled in the form of a binary relation matrix with elements (0, 1). The possible colours 
are X = {green, yellow, red} and the ripe grades are Y = {unripe, half-ripe, ripe}. The relation 
matrix (5.365) belongs to the table: 


| unripe halfripe ripe 


green |1 0 0 100 
yellow | 0 1 0 R= : ᾽ : (5.365) 
red 0 0 1 


Interpretation of this relation matrix: IF a fruit is green, THEN it is unripe. IF a fruit is yellow, 
THEN it is half-ripe. IF a fruit is red, THEN it is ripe. Green is uniquely assigned to unripe, yellow to 
half-ripe and red to ripe. If beyond it should be formalized that a green fruit can be considered half-ripe 
in a certain percentage, then the following table with discrete membership values can be arranged: 


Hr (green, unripe) — — 1.0, µη (green, half-ripe) = 0.5, The relation matrix with µη € [0,1] 
μμ (green, ripe) — 0.0, µη (yellow, unripe) =0.25, is: 

Lr (yellow, halfripe) = 1.0, µη (yellow, ripe) = 0.25, 1.0 0.5 0.0 

ur (red, unripe) — 0.0, µη (red, halfripe) =0.5, R= 0.25 1.0 0.25 |. (5.366) 
Lr (red, ripe) = 1.0. 0.0 0.5 1.0 


5. Rules of Calculations 
The AND-type aggregation of fuzzy sets, e.g. μι: X — [0,1] and μι: Y — [0,1] given on different 
universes is formulated by the min operation as follows: 
ur(x, y) = min(ja (x), uo(y)) or (pi x pao)(x, y) = min(ja(x), uo(y)) with (5.3672) 
Ia X po: G > [0,1], where G= X x Y. (5.367b) 
The result of this aggregation is a fuzzy relation R on the cross product set (Cartesian product universe 
of fuzzy sets) G with (x, y) € G. If X and Y are discrete finite sets and so µι (x), p2(y) can be represented 
as vectors, then holds: 


μι X pa = µι o ul and  ugai(cr,y):— ug(y,v) σαι) EG. (5.368) 


The aggregation operator o does not denote here the usual matrix product. The product is calculated 
here by the componentwise min operation and addition by the componentwise max operation. 
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The validity grade of an inverse relation R^ for the pair (x, y) is always equal to the validity grade of 
R for the pair (y, x). 

If the fuzzy relations are given on the same Cartesian product universe, then the rules of their aggre- 
gations can be given as follows: Let Ri, Ho: X x Y — [0,1] be binary fuzzy relations. The evaluation 
rule of their AND-type aggregation uses the min operator, namely for V(x, y) € G: 


HR nRa (x, y) = min(/R, (x, y). HRS (x, y))- (5.369 
A corresponding evaluation rule for the OR-type aggregation is given by the max operation: 
UR UR (2, y) = max(4r, (x, y), HR2 (x, y))- (5.370 


5.9.3.2 Fuzzy Product Relation Ro S 


1. Composition or Product Relation 
Suppose R € F(X x Y) and S € F(Y x Z) are two relations, and it is additionally assumed tha 
R,S € F(G) with G C X x Z. Then the composition or the fuzzy product relation R o S is: 

µποσία, 2) := supyey (min(un(m, y), us(y, z))) V (a, 2) € X x Z. (5.371 
If a matrix representation is used for a finite universal set analogously to (5.366), then the composition 
Ro S is motivated as follows: Let X = (zi,...,z4, Y = {m,---,Ym},Z = {a,...,a} and R € 
F(X x Y), S € F(Y x Z) and let the matrix representations R, S be in the form R = (rij) and 
S = (si) fori = 1,... n; j =1,...,m;k =1,...,1, where 


rj = un(vi,yj) and sin = ns(yj, zi). (5.372) 
If the composition T = Ro S has the matrix representation t;,, then 
tik = sup min(rij, μμ}. (5.373) 
J ja 8; 


The final result is not a usual matrix product, since instead of the summation operation there is the 

least upper bound (supremum) operation and instead of the product there is the minimum operator. 

Bl With the representations for r;; and sj, and with (5.371), the inverse relation R^! (r;;;)*, can also 

be computed taking into consideration that R7! can be represented by the transpose matrix, i.e., 
CY D fa AT 

R= = (ry): 

Interpretation: Let R be a relation from X to Y and S be a relation from Y to Z. Then the following 

compositions are possible: 

a) Ifthe composition Ro S of R and S is defined as a max-min product, then the resulted fuzzy compo- 

sition is called a max-min composition. The symbol sup stands for supremum and denotes the largest 

value, if no maximum exists. 


b) If the product composition is defined as the usual matrix multiplication, then the max-prod com- 
position is obtained. 

c) For max-average composition, “multiplication” is replaced by the average. 

2. Rules of Composition 


The following rules are valid for the composition of fuzzy relations R, S, T € F(G): 
(E1) Associative Law: 


(RoS)oT — Ro(SoT). (5.374 
(E2) Distributive Law for Composition with Respect to the Union: 

Ro(SUT)=(RoS)U(ROT). (5.375 
(E3) Distributive Law in a Weaker Form for Composition with Respect to Intersection: 

Ro(SnT)C(RoS)n(RoT). (5.376 
(E4) Inverse Operations: 

(RoS)!—-S'!om, (RUS)!-R'!US' and (RNS) =R NnS, (5.377 
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(E5) Complement and Inverse: 
(απ, (m)'-(my. (5.378) 


(E6) Monotonic Properties: 
RCS-RoTCSoT und TORCTOS. (5.379) 


W A: Equation (5.371) for the product relation Ro S is defined by the min operation as we have done 
for intersection formation. In general, any t norm can be used instead of the min operation. 

E D: Theo cuts with respect to the union, intersection, and complement are: (AU B)^^ = A**UB^^, 
(An Β}75 -A?*nBg?*, (ΑΌτα- AS? = (x € X|ua(x) € 1— aj. Corresponding statements 
are valid for strong a cuts. 

3. Fuzzy Logical Inferences 

It is possible to make a fuzzy inference, e.g., with the IF THEN rule by the composition rule fg = p0 R. 
The detailed formulation for the conclusion μο is given by 


μα(υ) = maxs;ex (min(ui (2), ur(x,y))) (5.380) 
with y € Y, μι: X > [0,1], wo: Y — [0,1], R: G— [0,1] und G = X x Y. 


5.9.4 Fuzzy Inference (Approximate Reasoning) 
Fuzzy inference is an application of fuzzy relations with the goal of getting fuzzy logical conclusions with 
respect to vague information (see 5.9.6.3, p. 428). Vague information means here fuzzy information 
but not uncertain information. Fuzzy inference, also called implication, contains one or more rules, a 
fact and a consequence. Fuzzy inference, which is called by Zadeh, approximate reasoning, cannot be 
described by classical logic. 
1. Fuzzy Implication, IF THEN Rule 
The fuzzy implication contains one IF THEN rule in the simplest case. The IF part is called the premise 
and it represents the condition. The THEN part is the conclusion. Evaluation happens by jug µιο R 
and (5.380). 
Interpretation: μο is the fuzzy inference image of μι under the fuzzy relation R, i.e., a calculation 
prescription for the IF THEN rule or for a group of rules. 
2. Generalized Fuzzy Inference Scheme 
The rule IF A; AND Ay AND Αχ... AND A, THEN B with A;: μι: X; > [0,1] (i = 1,2,..., n) and 
the membership function of the conclusion B: µ: Y — [0,1] is described by an (n + 1)-valued relation 
R: Xı x Xo x X4 x Y > [0,1]. (5.381a) 
For the actual input with crisp values 21,25, ..., 27, the rule (5.381a) defines the actual fuzzy output 
by 
uB (y) = nna(zi x5,..., a, y) = min(ji(z,), uo(25),..., s (m5), up(y)) where y € Y. (5.381b) 
Remark: The quantity min(yi (x1), Mo(25), ... 15 (27,)) is called the degree of fulfillment, and the 
quantities (4a (1), ua(25), -..., s (27,)) represent the fuzzy-valued input quantities. 
W Forming the fuzzy relations for a connection between the quantities “medium” pressure and “high” 
temperature (Fig. 5.76): (p, T) = pu(p) VT € Xə with μι: X4 > [0,1] is a cylindrical extension 
(Fig. 5.76c) of the fuzzy set medium pressure (Fig. 5.76a). Analogously, ña(p, T) = ματ) Vp € 
X; with μι: Χο — [0,1] is a cylindrical extension (Fig. 5.76d) of the fuzzy set high temperature 
(Fig. 5.76b), where jii, fiz: G = X1 x X» > [0,1]. 
Fig. 5.77a shows the graphic result of the formation of fuzzy relations: In Fig. 5.77b the result of the 
composition medium pressure AND high temperature with the min operator ug(p, T) = min(ja(p), 
μα(Τ)) is represented, and (Fig. 5.77b) shows the result of the composition OR with the max operator 
µην, T) = max(pus (p), μα(Τ)). 
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Figure 5.77 
5.9.5 Defuzzification Methods 
One has to get a crisp set from a fuzz-valued set in many cases. This process is called defuzzification. 
There are different methods available for this task. 


1. Maximum-Criterion Method 


membership degree. 


2. Mean-of-Maximum Method (MOM) 
'The output value is the mean value of the maximal membership values: 


T Output —_ 
sup (ugh ) = 


{y € Y lta (Y) Σ Haren (Y) Vy" ΕΥ}; (5.382) 


cn 


y di 
Le Output | YOY 


Baayen 


MOM = .(5.383) 
i.e., the set Y is an interval, which should not be empty and |, esup(ustest. ) dy 
it is characterized by (5.382), from which follows (5.383). | 
3. Center of Gravity Method (COG) MR u(y)y dy 
In the center of gravity method, one takes the abscissa ncog = Lt . (5.384) 


value of the center of gravity of a surface with a fictitious 
homogeneous density of value 1. 


Ysup 
/ u(y) dy 
Y Yinf 
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4. Parametrized Center of Gravity Method Ysup i 
(PCOG) / μ(υ)ν dy 
The parametrized method works with the exponent y € R. TPCOG = E . (5.385) 
From (5.385) it follows for y = 1, "coa = rjcoc and for f : u(y)? dy 
1» 0, "coc = mom- Ἢ 
5. Generalized Center of Gravity Method 
(GCOG) 
The exponent y is considered as a function of y in the Ysup 
PCOG method. Then (5.386) follows obviously. The / μι) y dy 
GCOG method is a generalization of the PCOG method, "Gcoa = mL (5.386) 
where u(y) can be changed by the special weight y depend- / πο ® dy 
Yinf 


ing itself on y. 

6. Center of Area (COA) Method f 
One calculates a line parallel to the ordinate axis so that / { μυ) dy = / i μη) dy. (5.387) 
the area under the membership function is the same on the : © dn pose 

left- and on the right-hand side of it. 


7.Parametrized Center of Area (PCOA) "PB, n AE LR ic dis 
Method i u(y)’ dy / u(y)? dy. (5.388) 


Ying 
8. Method of the Largest Area (LA) 
The significant subset is selected and one of the methods defined above, e.g., the method of center of 
gravity (COG) or center of area (COA) is used for this subset. 


Vint 


NPF 


5.9.6 Knowledge-Based Fuzzy Systems 

There are several application possibilities of multi-valued fuzzy logic, based on the unit interval, both 
in technical and non-technical life. The general concept consists in the fuzzification of quantities and 
characteristic numbers, in the aggregation them in an appropriate knowledge base with operators, and 
if necessary, in the defuzzification of the possibly fuzzy result set. 


5.9.6.1 Method of Mamdani 
The following steps are applied for a fuzzy control process: 
1. Rule Base Suppose, for example, for the i-th rule 

R’ : If eis E! AND éis AE! THEN u is U’. (5.389) 
Here e characterizes the error, é the change of the error and u the change of the (not fuzzy valued) output 
value. Every quantity is defined on its domain E, AE and U. Let the entire domain be E x AE x U. 
The error and the change of the error will be fuzzified on this domain, i.e., they will be represented by 
fuzzy sets, where linguistic description is used. 
2. Fuzzifying Algorithm In general, the error e and its change é are not fuzzy-valued, so they must 
be fuzzified by a linguistic description. The fuzzy values will be compared with the premisses of the 
IF THEN rule from the rule base. From this it follows, which rules are active and how large are their 
weights. 
3. Aggregation Module The active rules with their different weights will be combined with an 
algebraic operation and applied to the defuzzification. 
4. Decision Module In the defuzzification process a crisp value should be given for the control 
quantity. With a defuzzification operation, a non-fuzzy-valued quantity is determined from the set of 
possible values, i.e., a crisp quantity. This quantity expresses how the control parameters of the system 
should be set up to keep the deviation minimal. 
Fuzzy control means that the steps from 1. to 4. are repeated until the goal, the smallest deviation e 
and its change €, is reached. 
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5.9.6.2 Method of Sugeno 


The Sugeno method is also used for planning of a fuzzy control process. It differs from the Mamdani 
concept in the rule base and in the defuzzification method. It has the following steps: 
1. Rule Base: The rule base consists of rules of the following form: 
RÈ: IF αι is ΑἹ AND ... AND ay is AL, THEN u; = pi + pixi + poxo +e + pix. (5.390) 
The notations mean: 
Aj: fuzzy sets, which can be determined by membership functions; 
xj: crisp input values as, e.g., the error e and the change of the error é, which tell us something about 
. the dynamics of the system; 
pi: weights ofa; (j =1,2,...,h); 


τη: the output value belonging to the i-th rule (i = 1,2,...,n). 

2. Fuzzifying Algorithm: A ji; € [0, 1] is calculated for every rule R'. 

3. Decision Module: A non-fuzzy-valued quantity is calculated from the weighted mean of u;, where 
the weights are u; from the fuzzification: 


n n = 
E uM (Σ 2 l (5.391) 
i=1 i=1 


Here u is a crisp value. 

The defuzzification of the Mamdani method does not work here. The problem is to get the weight 
parameters p} available. These parameters can be determined by a mechanical learning method, e.g., 
by an artificial neuronetwork (ANN). 


5.9.6.3 Cognitive Systems 
To clarify the method, the following known example will be investigated with the Mamdami method: 
The regulation of a pendulum that is perpendicular to its moving base (Fig. 5.78). The aim of the 
control process is to keep a pendulum in balance so that the pendulum rod should stand vertical, i.e., 
the angular displacement from the vertical direction and the angular velocity should be zero. It must 
be done by a force F acting at the lower end of the pendulum. This force is the control quantity. The 
model is based on the activity of a human “control expert" (cognitive problem). The expert formulates 
its knowledge in linguistic rules. Linguistic rules consist, in general, of a premise, i.e., a specification of 
the measured values, and a conclusion which gives the appropriate control value. 

For every set of values X1, X2,..., Xn for the measured values and Y for the control quantity the appro- 
priate linguistic terms are defined as “approximately zero", “small positive", etc. Here “approximately 
zero" with respect to the measured value ξι can have a different meaning as for the measured value ξο. 
E Inverse Pendulum on a Moving Base (Fig. 5.78) 

1. Modeling For the set X, (values of angle) and analogously for the input mv! 

quantity X» (values of the angular velocity) the seven linguistic terms, negative 
large (nl), negative medium (nm), negative small (ns), zero (z), positive small 
(ps), positive medium (pm) and positive large (pl) are chosen. 


F 
For the mathematical modeling, a fuzzy set must be assigned by graphs to every ———924-2——— 
one of these linguistic terms (Fig. 5.77), as was shown for fuzzy inference (see 


5.9.4, p. 425). Figure 5.78 


2. Choice of the Domain of Values 

ο Values of angles: O(—90° < © < 90°): X, := [—90°, 90°]. 

e Values of angular velocity: O(—45?s7! < È < 45?s7!): X := [-45? s71, 45? s71]. 
ο Values of force F: (C10N € F € 10N): Y := [-10N, 10 N]. 
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The partitioning of the input quantities X; and X» and the output quantity Y is represented graphically 
in Fig. 5.79. Usually, the initial values are actual measured values, e.g., © = 36°, Ò = —2.25^ s^. 
μίθ) μ(θ) μία 
1 1 1 
> > > 
nl nm 45 pmpl 0 nl 8 ΡΙ ὁ nl 5 pl Ε 


Figure 5.79 
3. Choice of Rules Considering the following table, there are 49 possible rules (7 x 7) but there are T 
only 19 important in practice, so the following two are to be discussed: R1 and R2. 


R1: If O is positive small (ps) and © zero (z), then F is positive small (ps). For the degree of fulfillment 
(also called the weight of the rules) of the premise with a = min {μῶ(ο); μο(ϐ)) = min{0.4;0.8} = 


0.4 one gets the output set (5.392) by an a cut, hence the output fuzzy set is positive small (ps) in the 
height a = 0.4 (Fig. 5.80c). 
Table: Rule base with 19 practically meaning- 
ful rules " 

ONO | nl | nm | ns | z ps | pm | pl 57 0S yc 

nl ps | pl 0.4 1<y<4, 

nm pm ο R Eb d 

ns nm ns | ps Ls, 7228 P (y) = 2 (5.392) 

Ζ nl | nm | ns z ps | pm | pl 2— 5! 4<yS5, 

ps ns | ps pm . 

pm na 0 otherwise. 

pl nl | ns 

: z 1 25< 4 
R2: If O is positive medium (pm) and O is 5457 a = 
zero (z), then F is positive medium (pm). 0.6 4<y<6 
For the performance score of the premise fol- Output (R2) ` παπί 
ni 2) (QE i ; H3s;:=223 (Y) = 2 (5.393) 

lows a = min [uf (O); u 6) = min{0.6; 3-fy 6«y&T5, 
0.8) — 0.6, the output set (5.393) analogously 9 
to rule R1 (Fig. 5.80f). 0 otherwise. 


4. Decision Logic The evaluation of rule R, with the min operation results in the fuzzy set in 
Figs. 5.80a-c. The corresponding evaluation for the rule Rg is shown in Figs. 5.80d-—f. The control 
quantity is calculated finally by a defuzzification method from the fuzzy proposition set (Fig. 5.80g). 
The result is the fuzzy set (Fig. 5.80g) by using the max operation and taking into account the fuzzy 
sets (Fig. 5.80c) and (Fig. 5.80f). 
a) Evaluation of the fuzzy set obtained in this way, which is aggregated by operators (see max-min 
composition 5.9.3.2, 1., p. 424). 'The decision logic yields: 
r : 1 D 
ad :Y 2 [0,1];y > maxre{i,...x} {min (uf ) (x1), -- „uÉ? (En), ui(y)}} : (5.394) 


ve n 
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b) After taking the maximum (5.395) is ob- ο or O<y<1, 
tained for the function graph of the fuzzy set. 5 ι 
ο) For the other 17 rules results a degree of ful- 4 or 1<y<3.5, 
fillment equal to zero for the premise, i.e., it re- 9 
sults in fuzzy sets, which are zeros themselves. Cy—-1lfo 35<y<4 
å " por PER Output ( ) = 5° oe , ( 395) 
5. Defuzzification The decision logic yields #36;—2.25 Y 9.959 
no crisp value for the control quantity, but 6 or 4<y<6, 
a fuzzy set. That means, by this method, " 
| ds : jpapping, d assigns a πῶς set 3-Cyfor 6<y<75, 
peep of Y to every tuple (z1,..., 24) € X1x £ 
Xa X ++ X Xn of the measured values. ) or otherwise. 


Defuzzification means that there is to determine a control quantity using defuzzification methods. 


The center of gravity method and the maximum criterion method result in the value for control quantity 
F-395or F —5.0. 
μίθ) 


μίθ) i» με) 


10 


min operation 


— 
0.4 


> » > 
22536 9 1550515 2.5 F 
a) positive small b) C) positive small 
Ὁ (a) F u(F) 
102 μίθ) 1012 0 


0.6 
min operation 


max operation 
— 


> 
"36 45 6 225 ὁ 50 F É 
d) positive medium €) zero f) positive medium g) control quantity 
6. Remarks Figure 5.80 


1. The “knowledge-based” trajectories should lie in the rule base so that the endpoint is in the center 
of the smallest rule deviation. 


2. By defuzzification an iteration process is introduced, which leads finally to the center of the partition 
space, i.e., which results in a zero control quantity. 

3. Every non-linear domain of characteristics can be approximated with arbitrary accuracy by the 
choice of appropriate parameters on a compact domain. 

5.9.6.4 Knowledge-Based Interpolation Systems 


1. Interpolation Mechanism 
Interpolation mechanisms can be built up with the help of fuzzy logic. Fuzzy systems are systems 
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to process fuzzy information. With them it is possible to approximate and interpolate functions. A 
simple fuzzy system, by which this property can be investigated , is the Sugeno controller. It has n 
input variables £, . . . , En and defines the value of the output variable y by rules R,,...,R, in the form 


Βι: IF & is A and --- and &, is AM, THEN is y = f,(&,...,€:) (i =1,2,...,n). (5.396) 


The fuzzy sets AM, - , APP always partition the input sets X;. The conclusions f;(&,...,&,) of the 
rules are singletons, which can depend on the input variables £1, .. ., n. 


By a simple choice of the conclusions the expensive defuzzification can be omitted and the output value 
y will be calculated as a weighted sum. Το do this, the controller calculates a degree of fulfillment a; 
for every rule R; with a t norm from the membership grades of the single inputs and determines the 
output value 

N 

iz αἱ fi(E1,---,&n 

ý Eu dien En) (5.397) 
PD 195 

2. Restriction to the One-Dimensional Case 
For fuzzy systems with only one input x = ξι, fuzzy sets represented by triangular functions are often 
used which are cut at the height 0.5. Such fuzzy sets satisfy the following three conditions: 


1. For every rule R; there is an input z;, for which only one rule is fulfilled. For this input z;, the output 
is calculated by f;. By this, the output of the fuzzy system is fixed at N nodes z,,...,xyw. Actually, 
the fuzzy system interpolates the nodes z;,...,xw. The requirement that at the node x; only one rule 
R; holds, is sufficient for an exact interpolation, but it is not necessary. For two rules Rı and Ra, as 
they will be considered below, this requirement means that a)(%2) = a»(x1) = 0 holds. To fulfill the 
first condition, o4 (ο) = as(z1) = 0 must hold. This is a sufficient condition for an exact interpolation 
of the nodes. 
2. There are at most two rules fulfilled between two consecutive nodes. If x; and xo are two such nodes 
with rules R; and Rs, then for inputs x € [x1, x2] the output y is 
a(x) fi (c) + a2(x) fo(x) ith o:— a(x) 
fil) + οί) [f2(x) — fi(x)] with g = —— — —. 
alx) + a(x) αι(α) + a»(x) 
The actual shape of the interpolation curve between x; and x» is determined by the function g. The 


shape depends only on the satisfaction grades αι and az, which are the values of the membership func- 
tions Haw and Hye) at the point x, i.e., ay = pao (x) and ag = μαω(α) are valid, or in short form 


(5.398) 


αι µια) and ay = u(x). The shape of the curve depends only on the relation 44/1 of the mem- 
bership functions. 

3. The membership functions are positive, so the output y is a convex combination of the conclusions 
fi. For the given and for the general case hold (5.399) and (5.400), respectively: 


min(fi f2) < y < max(fi.fo, — (5399 — MIA A SY S max, fi (9400) 


For constant conclusions, the terms fı and f» cause only a translation and stretching of the shape of 
the curve g. If the conclusions are dependent on the input variables, then the shape of the curve is 
differently perturbed in different sections. Consequently, another output function can be found. 


Applying linearly sependent conclusions and membership functions with constant sum for the input 
x, then the output is y — c 2 o (x) f(x) with a; depending on x and a constant c, so that the interpo- 


lation functions are polynomial of second degree. These polynomials can be used for the construction 
of an interpolation method with polynomials of second degree. 

In general, choosing polynomials of n-th degree, an interpolation polynomial of (n + 1)-th degree is 
obtained as a conclusion. In this sense fuzzy systems are rule-based interpolation systems besides con- 
ventional interpolation methods interpolating locally by polynomials, e.g., with splines. 


6 Differentiation 


6.1 Differentiation of Functions of One Variable 
6.1.1 Differential Quotient 


1. Differential Quotient or Derivative of a Function 


f (zo + Δα) — f (z0) 


The differential quotient of a function y = f(x) at πρ is equal to Jim, if this limit 
d» 


g 

exists and is finite. The derivative function of a function y = f(x) with respect to the variable x is 
d df (x 

another function of x denoted by the symbols y', y, Dy, oe f'(x), Df (x), or ^ ) , and its value for 
dx dx 


every x is equal to the limit of the quotient of the increment of the function Ay and the corresponding 
increment Ax for Ax — 0, if this limit exists: 
A f 2 + Δα) -- f(a 
X f'(x)-— lim f ) - f(z) : 


Δα-»0 Ax 


(6.1) 


2. Geometric Representation of the Derivative 
Ify — f(x) is represented as a curve in a Cartesian coordinate 
system as in Fig. 6.1, and if the z-axis and the y-axis have the 
same unit, then 

f(x) = tana (6.2) 
is valid. The angle a between the z-axis and the tangent line of 
the curve at the considered point defines the angular coefficient or 
slope of the tangent (see 3.6.1.2, 2., p. 245). The angle is measured 
from the positive x-axis to the tangent in a counterclockwise di- 
Figure 6.1 rection, and it is called the angle of slope or angle of inclination. 


3. Differentiability 
From the definition of the derivative it obviously follows that f(x) is differentiable with respect to x 
for the values of x where the differential quotient (6.1) has a finite value. The domain of the derivative 
function is a subset (proper or trivial) of the domain of the original function. If the function is contin- 
uous at z but the derivative does not exist, then perhaps there is no determined tangent line at that 
point, or the tangent line is perpendicular to the x-axis. In this last case the limit in (6.1) is infinity. 
For this case is used the notation f'(x) = +00 or —oo. 


y 


a) 


Figure 6.2 
'(0) — oo. At the point 0 the limit (6.1) tends to infinity, so the 


MA: f(x) = ὑπ: f) = m f 


derivative does not exist at the point 0 (Fig. 6.2a). 


1 

E B: f(x) = vsin— fora z 0. At the point x = 0 the function f(x) is not defined, but it has zero 
t 

limit, so one writes f (0) = 0. However the limit (6.1) does not exist at x = 0 (Fig. 6.2b). 
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4. Left-Hand and Right-Hand Differentiability 
If the limit (6.1) does not exist for a value x = a, but the left-hand 
y limit or the right-hand limit exists, this limit is called the left-hand 
΄ derivative or right-hand derivative respectively. If both exist, the 
curve has two tangents here: 
f'(a — 0) = tanos, f'(a 4- 0) = tan as. (6.3) 
ὴ Geometrically this means that the curve has a knee (Fig. 6.26, 


A Fig. 6.3). 


[1 A 


7 e T 
0 x Β /(ι) -- τ for x z 0. For x = 0 the function is not defined, 
+ez 
Figure 6.3 but it has zero limit at 2; = 0, so one writes f(x) = 0. At the point 


x = 0 there is no limit of type (6.1), but there is a left-hand and a right-hand limit f'(—0) = 1 and 
f'(+0) = 0, i.e., the curve has a knee here (Fig. 6.2c). 


6.1.2 Rules of Differentiation for Functions of One Variable 


6.1.2.1 Derivatives of the Elementary Functions 
'The elementary functions have a derivative on all their domains except perhaps some points, as repre- 
sented in Fig. 6.2. 


A summary of the derivatives of elementary functions can be found in Table 6.1. Further derivatives 
of elementary functions can be found by reversing the results of the indefinite integrals in Table 8.1. 


Remark: In practice, it is often useful to transform the function into a more convenient form to per- 
form differentiation, e.g., to transform it into a sum where parentheses are removed (see 1.1.6.1, p. 11) 
or to separate the integral rational part of the expression (see 1.1.7, p. 14) or to take the logarithm of 
the expression (see 1.1.4.3, p. 9). 
; 1 2 
2-3/e+47/r+2? 2 .. Er d 
MA: y= UR = ο πο -- 


ax 
x 


T HT; 
ae dx 


α3-1 1 2 1 2 dy 1 2x 
BB: y= ID) a =i ο Inte + 1) g n(x 1); mE (3 x η) 


6.1.2.2 Basic Rules of Differentiation 

Assume u, v, w, and y are functions of the independent variable x, and wu’, v/, w', and y’ are the deriva- 
tives with respect to x. The differential is denoted by du, dv, dw, and dy (see 6.2.1.3, p. 446). The basic 
rules of differentiation, which are explained separately, are summarized in Table 6.2, p. 439. 


1. Derivative of a Constant Function 
The derivative of a constant function c is the zero function: 
6-0. (6.4) 
2. Derivative of a Scalar Multiple 
A constant factor c can be factored out from the differential sign: 
(cu) =cu', d(cu) = cdu. (6.5) 
3. Derivative of a Sum 
If the functions u, v, w, etc. are differentiable one by one, their sum and difference is also differentiable, 
and equal to the sum or difference of the derivatives: 
(αυ wy =u tv — w, (6.6a) 
d(u +v — w) = du + dv — dw. (6.6b) 
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6. Differentiation 


It is possible that the summands are not differentiable separately, but their sum or difference is. 
the derivative must be calculated by definition formula (6.1). 


Tab 


Then 


e 6.1 Derivatives of elementary functions in the intervals on which they are defined and the 
occurring numerators are not equal to zero 
Function Derivative Function Derivative 
C (constant) 0 secx = 
cos? x 
— cos x 
T 1 cosec x —35 
sin? 2; 
r” (nc R) fip arcsinx (|a| < 1) Ξ 
1 1 i 
T (x £0) -z (x #0) arccosx (|r]| « 1) | — > 
1 1 
— (x #0) 2: arctan 2; rz: 
1 1 
ντ (x > 0) να arccot x προ 
Vi (ne R, n0, z» 0) ! (251) : 
x (n ,n #0, 2 —— arcsec z (x 
f | Vn ανα i 1 
e” e? arccosec x (r1 = 
( ) ryz? — 1 
e" (bE R) beh sinh x cosh x 
a” (a> 0) a” lna cosh x sinh x 
1 
a* (ΦΕΒ, a>0) ba Ina tanh x —;- 
cosh” x 
1 
Ina (x > 0) = cotha (x 40) BM 
T sinh^r 
l (a>0,a41, «>0) ; ] 1 Arsinl : 
og,r (a>0,a#1, a — log, € = —— rsinh x ——— 
Ba ᾿ ᾿ g ρα sla V14+ 2? 
1 0.4343 1 
Iga (x0) —lge x ——— Arcoshx (x > 1) : 
x x αὲ-1 
1 
sin x COS x Artanhz (|v| « 1) - 
l= 
1 
sg — sint Arcothz (|z| » 1) | — 
COS x sin x rcothz (|x| > 1) TE 
1 , 
tang (rz (2-17. keZ)| —,— = sec? x (f(x) (n € R) n[f (z)]" f(E) 
cos?zr 
stotku k zi —À yk ix f(x) 
cotz (rz km, k € Z) —,— = —cosec^z | In f(x) (f(x) > 0) 
sin“ gx f(x) 


4. Derivative of a Product 
If two, three, or n functions are differentiable one by one, then their product is differentiable, and can 
be calculated as follows: 


a) Derivative of the Product of Two Functions: 


(uv) 2 uv uv', 


d(uv) = vdu + u dv. 


(6.7a) 


6.1 Differentiation of Functions of One Variable 435 


It is possible that the terms are not differentiable separately, but their product is. Then the derivative 
must be calculated by definition formula (6.1). 


b) Derivative of the Product of Three Functions: 
(uvw)' =u vw+uvw+uvw, d(uvw)-vwdu- uw dv + uvdw. (6.7b) 


c) Derivative of the Product of n Functions: 


n 
(ty us = tug ht (Οτο) 
i—l 


MA: y=x%cosx, y —3a?cosz — ai? sin x. 
BB: y —a?)c^cosz, y'= 32e" cosx + z?e* cosa — xe" sin x. 
5. Derivative of a Quotient 


If both u and v are differentiable, and v(x) Æ 0, their ratio is also differentiable: 


uN vu —uwv u v du — u dv 
T . d E . 
υ υ υ v? 


sing — ,  (cosz)(sinz)' — (sina)(cosa)! _ cos? x + sin? x 1 


E -= tanzt = ; : 
cosa” cos? x cos? a cos? a 


6. Chain Rule 
The composite function (see 2.1.5.5, 2., p. 61) y = u(v(x)) has the derivative 
dy j ; du dv 
= =u(v) u(x“) = —— 
da: VERA) dv dx’ 
where the functions u — u(v) and v — v(x) must be differentiable functions with respect to their own 
variables. u(v) is called the exterior function, and v(x) is called the interior function. According to 


(6.9) 


u dv 
this, E is the exterior derivative and dn is the interior derivative. It is possible that the functions u 

υ η 
and v are not differentiable separately, but the composite function is. Then one gets the derivative by 
the definition formula (6.1). 
Similarly one has to proceed if there is a longer “chain”, i.e., in the case of a composite function of 
several intermediate variables. For example for y = u(v(w(x))): 

, dy du dv dw 

jj m -— — E M 

y dr dv dw dx 

ΠΡ 
d (eis à, "m . 
MA: y= E x dy d (sin a) d (sin x) 
^o dg d (sin? r) d (sin x) da: 


„tan γα 
ay (A) aem am wi ἡ 
dx d(tan/z) | d(\/x) dx ! cos? γα ον. 
7. Logarithmic Differentiation 
If y(x) > 0 holds, one can calculate the derivative y' starting with the function In y(x), whose derivative 
(considering the chain rule) is 


d(ln y(x)) l. j 


(6.10) 


und. 
eN T3 sin y cos x. 


E B: y=ctan ντ. 


Ξ ; 11 
da yz)" 16: 
From this rule 
d(In y(x 
y = yx) (In y(1)) (6.12) 


dx 
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follows. 

Remark 1: With the help of logarithmic differentiation it is possible to simplify some differentiation 
problems, and there are functions such that this is the only way to calculate the derivative, for instance, 
when the function has the form 


y- u(z y") with u(x) > 0. (6.13) 
The logarithmic differentiation of this equality follows from the formula (6.12) 
1 (1n u* αυ] vu' 
y =y dinu”) y (wau) w |v mnu — |. (6.14) 
da dx u 
y' 3a +2. 


B y = (2x - 1)", my Πρ + 1), 4 3ln(2z + 1) 4 
y 


ς 2x 
oy = 3(2” 3r : 
y —3(2x + 1) (ins +1) 1). 


Remark 2: Logarithmic differentiation is often used in the case to differentiate a product of several 
functions. 


: 1 
MA: y= vre singz, lny= 38 Ina + 4a + Insin z), 
y! ME Eds m) y-2 


| : 
y Dyg sin x 


vet sina 


ESI 
a 


y 1 1 
BB: y=uv, ny=mu+inv, % = =u + =v". From this identity it follows that y’ = (uv) = 
y u v 


v u’ +u v, so one gets the formula for the derivative of a product (6.7a) (under the assumption u, v > 0). 


ics g-— ll Μπ. aes ; uA 
B C: y , lny = lnu- Inv, = —u v'. From this identity it follows that γ' = (|—] = 
υ y u v υ 
u w  vu-uwv T" ey . ; 

--- = — — —— , which is the formula for the derivative of a quotient (6.8) (under the assumption 


v vw? v? 


8. Derivative of the Inverse Function 

If y = (x) is the inverse function of the original function y = f(x), then both forms y = f(a) and 
x = p(y) are equivalent. For every corresponding value of x and y such that f is differentiable with 
respect to x, and q is differentiable with respect to y, e.g., none of the derivatives is equal to zero, 
between the derivatives of f and its inverse function ¢ is valid the following relation: 


1 dy 1 
πα) = | IL. i 
F(x) e) ΟΥ de dr (6 5) 
dy 


Β The function y = f(x) = arcsin x for -1 < x < 1 is equivalent to the function x = y(y) = sin y for 
-π/2 < y < 7/2. From (6.15) it follows that 
1 1 1 1 


(siny) cosy γι η  vl-a?' 


9. Derivative of an Implicit Function 

Suppose the function y = f(x) is given in implicit form by the equation F(x, y) = 0. Considering the 
rules of differentiation for functions of several variables (see 6.2, p. 445) calculating the derivative with 
respect to x gives 


(arcsin £)” because cos y Z 0 for =r /2 < y < 1/2. 


—+—y'=0 andso y =-=, (6.16) 
Or Oy Fy 


if the partial derivative F, differs from zero. 
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Β The equation = EDT E = 1 of an ellipse with semi-axes a and b can be written in the form F(a, y) = 


b2 
2 + 7 — 1 = 0. For the slope of the tangent line at the point of the ellipse (x, y) one gets according 
io (6.16) 
ES 2 


10. Derivative of a Function Given in Parametric Form 
If a function y = f(x) is given in parametric form x = απ), y = y(t), then the derivative y’ can be 
calculated by the formula 


d E 
d = fr) - ὃ (6.17) 
dx i 
ly dx 
with the help of the derivatives y(t) = E and «(t) = T with respect to the variable t, if of course 


a(t) 7 0 holds. 
E Polar Coordinate Representation: If a function is given with polar coordinates (see 3.5.2.2, 3., 
p. 192) p = p(y), then the parametric form is 
r=p(p)cosy, y= p(p)sin g (6.18) 
with the angle i» as a parameter. For the slope y' of the tangent of the curve (see 3.6.1.2, 2., p. 245 or 
6.1.1, 2., p. 432) one gets from (6.17) 
dp 


y = μες = PSOR where p = —. (6.19) 
pcosq — psiny dp 
Remarks: 


1. The derivatives t, ἡ are the components of the tangent vector at the point (x(t), y(t)) of the curve. 
2. It is often useful to consider the complex relation: 
a(t) +iy(t) =2(t), t(t)+ 19) = z(t). (6.20) 
wt ; . it i(wt + 5) 
B Circular Movement: z(t) = re! (r,w const), ż(t) = riwe = rue ^*^ T 2 
vector runs ahead by a phase-shift 7/2 with respect to the position vector. 


. The tangent 


11. Graphical Differentiation 

If a differentiable function y = f(x) is represented by its curve I in the 
Cartesian coordinate system in an interval a < x « b, then the curve I” 

of its derivative can be constructed approximately. The construction of a 
tangent estimated by eye is pretty inaccurate. However, if the direction 

of the tangent ΛΙΝ (Fig. 6.4) is given, then one can determine the point 

of contact Α more precisely. 

1. Construction of the Point of Contact of a Tangent 

One draws two secants M,N, and Λίο Νο parallel to the direction MN of 
the tangent so that the curve is intersected in points being not far from M 
each other. Then there are to be determine the midpoints of the secants, 
and a straight line through them must be drawn. This line PQ intersects 
the curve at the point A, which is approximately the point, where the tan- Figure 6.4 

gent has the given direction M N. Το check the accuracy, one draws a third line close to and parallel to 
the first two lines, and the line PQ should intersect it at the midpoint. 

2. Construction of the Derivative Curve 


a) Choose some directions l4, l2,..., In which could be the directions of some tangents of the curve 
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y = f(x) in the considered interval as in Fig. 6.5, and determine the corresponding points of contact 


A, A2, ..., An, where the tangents themselves must not be constructed. 
b) Choose a point P, a *pole", on the negative side of 1 Ay 
the x-axis, where the longer the segment PO = a, the 1 
flatter the curve is. L 1 

c) Draw the lines through the pole P parallel to the di- i 
rections [4,[5, ... ln, and denote their intersection points B 
with the y-axis by By, Bo, ... Bn. "ac 

d) Construct the horizontal lines B1C1, B3C5, ..., B, C, k 
through the points Βι, B2,..., Bn to the intersection 

points ΟἹ, C5, ..., Cn with the orthogonal lines from the 

points Αι. A», ..., An. P 


e) Connect the points C),C2,...,C, with the help of a 
curved ruler. The resulting curve satisfies the equation 
y = af'(x). If the segment a is chosen so that it cor- 
responds to the unit length on the y-axis, then the curve 
one getsis the curve ofthe derivative. Otherwise, one has 
to multiply the ordinates of C1, C, ..., Cn by the factor 
1/a. The points Di, D»,..., Dn given in Fig. 6.5 are on 


the correctly scaled curve I" of the derivative. Figure 6.5 


6.1.3 Derivatives of Higher Order 
6.1.3.1 Definition of Derivatives of Higher Order 


1 (d 
The derivative of y = f'(x), which means (ψ')' or E (2). is called the second derivative of the 
x dr 
; x ay ai d*f(v) |. hus 
function y = f(x) and it is denoted by y", ἢ. d f" (x) or 1 5; . Higher derivatives can be defined 
dx x 


analogously. The notation for the n-th derivative of the function y = f(x) is: 


y™ = i f(x) = Lari) (n —0,1,... ; (x) = f (2) = f()) : (6.21) 


da” da” 
6.1.3.2 Derivatives of Higher Order of some Elementary Functions 
The n-th derivatives of the simplest functions are collected in Table 6.3. 


6.1.3.3 Leibniz’s Formula 


To calculate the n-th-order derivative of a product of two functions, the Leibniz formula can be used: 


n n=l a , 
D” (uv) =uD"v +7 Du D"-ly + phu Dw +... 
! n(n — 1)... e —m+1) Du D^7"y +--+ D uv (6.22) 
m! 
m 
Here the notation D” = — is used. If D°u is replaced by u and D°v by v, then one gets the formula 


dx” 
(6.23) whose structure corresponds to the binomial formula (see 1.1.6.4, p. 12): 


D'(uv) = Σ ML (6.23) 


m=o AT 
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Table 6.2 Differentiation rules 


Expression Formula for the derivative 
Constant function c —0 (c const) 
Constant multiple (cu)! 2 σα (c const) 


Sum 


/ 


(ux v) =u tv 


Product of two functions 


(uv)! = u'v + uv' 


Product of n functions 


(uiua +++ Un)! = 


n 
Dupes ues Un 
ii 


three functions 


4 1 4 I mu / 
Quotient (“) a! T = (v z 0) 
Chain rule for yat qus du dv 
two functions -;, Y= dv dz 
Chain rule for du dv dw 


3j τη T : VE tabe cie 
ο y= OO 


a #0) 


(ut)! = aut! ; («€ R, 


differentiation 


Power 1 / 
Mus specially : (2) = a (u# 0) 
u u? 
ἀν) 1, .,  düny) 
Logarithmic dx y y 9.9. dg 


" / 
special: (u") = u” (v Inu+ x) (u > 0) 
u 


Differentiation of the 
inverse function 


p inverse function of f, 1.6. y = f(x) 4> x = ply): 


Implicit 
differentiation 


Derivative in 
parameter form 


1 dy 1 
Tf ay . EC rums 
πο qe de 
dy 

F(x,y)=0: F+ i a Or 

T OF 

'=— ,fy= ; F, #0 

R ES ὃν ) 
qu y )vy-wt)(t ps ter): 
uc ptt oy ὦ 
: Z= : uü ' dt 


Derivative in 
polar coordinates 


x = plp) cos p 
r=r(y): À 

(21 y = pising 

, dy _ psing + pcos 

dr pcosy—rsiny 


(angle i as parameter) 


. dx 
P dy 


y 


W A: 


2x, v" 


x? cos az) 09: If v = x? 
2, v" v9 


(uv) 50) = x?a° cos (ax + 507) + — 


“= 


0. Except the first three cases, 


50: 49 
1.2 


- 22a" cos c + 497) + : 2a/5 cos c + 487) 


cos ax are substituted, then follows u? = a* cos (ax + k3) ,U 


/ 


all the summands are equal to zero, so 
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= a’ [(2450 — a?a?) cos ασ — 100az sin az] . 
.3 ρα Y(6) 6 ,9 4 6 240 6 “pt 6 z m d 2 
WB B: (76)? = "LA T 1 -8α”ε" + 9 -6αο” + al 6e* = e” (x° + 182° + 90y + 120). 


Table 6.3 Derivatives of higher order of some elementary functions 


Function n-th-order derivative 
gm m(m — 1)(m —2)...(m— n + 1)a"7" 
(for integer m and n > m the n-th derivative is 0) 
1 
ζω E 
lng (x 0) (—1)"(n — 1)! E 
(n—1) 1 
log,r (x0 -1pd————— 
gaz (0 > 0) (ay = 
eke kref? 
a? (In a)"a* 
ak (kIna)"a** 
. . no 
sin 2; sin(x + —) 
2 
nT 
COS x cos(x + τ) 
: ; 1π 
sin ka k” sin(kx + A) 
nT 
cos kx k” cos(ka + EX 
sinh x sinh x for even n, cosh x for odd n 
cosh x cosh x for even n, sinh x for odd n 


6.1.3.4 Higher Derivatives of Functions Given in Parametric Form 


If a function y = f(x) is given in the parametric form x = a(t), y = y(t), then its higher derivatives 


di da: 

(y", y". etc.) can be calculated by the following formulas, where y(t) = E r(t) = ην y(t) = 
du . dx I ! 
E qm dg etc., denote the derivatives with respect to the parameter t: 
d 

y ij-gi dy a?j—3iig + 393?— ijë : 

E LM e. (a(t) £0). 6.24 

2-3 τὸ Ξ 0) 40) (6.24) 
6.1.3.5 Derivatives of Higher Order of the Inverse Function 
If y = p(x) is the inverse function of the original function y = f(x), then both forms y = f(a) and 


r= (ϱ 


p(y) are equivalent. Supposing φ'(ψ) 4 0 holds, the x inta 7 15) is valid for the w of 
the function f and its inverse function y. For higher derivatives (y”, y", etc.) one gets 


Py ο ὧν 3" - ee") En 
da? [Φ(υ)]}᾽ dx? [e (y)? Imus ; 
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6.1.4 Fundamental Theorems of Differential Calculus 
6.1.4.1 Monotonicity 


If a function f(x) is defined and continuous in a connected interval, and if it is differentiable at every 
interior point of this interval, then the relations 
f(x) 2 0 for a monotone increasing function, (6.26a) 


f(x) X 0 fora monotone decreasing function (6.26b) 


are necessary and sufficient. If the function is strictly monotone increasing or decreasing, then the 
derivative function f'(x) must not be identically zero on any subinterval of the given interval. In 
Fig. 6.6b this condition is not fulfilled on the segment BC. 

'The geometrical meaning of monotonicity is that the curve of an increasing function never falls for 
increasing values of the argument, i.e., it either rises or runs horizontally (Fig. 6.6a). Therefore the 
tangent line at any point of the curve forms an acute angle with the positive x-axis or it is parallel to 
it. For monotonically decreasing functions (Fig. 6.6b) analogous statements are valid. If the function 
is strictly monotone, then the tangent can be parallel to the x-axis only at some single points, e.g., at 
the point A in Fig. 6.6a, i.e., not on a subinterval such as BC in Fig. 6.6b. 


» 
b) 0| UX 


Figure 6.6 Figure 6.7 
6.1.4.2 Fermat’s Theorem 


If a function y = f(x) is defined on a connected interval, and it has a maximum or a minimum value at 
an interior point x = c of this interval (Fig. 6.7), i.e., if for every x in this interval 


f(c) > f(x) (6.272) Or f(c) « f(x), (6.27b) 
holds, and if the derivative exists at the point c, then the derivative must be equal to zero there: 
f'(c) ^ 0. (6.219) 


The geometrical meaning of the Fermat theorem is that if a function satisfies the assumptions of the 
theorem, then its curve has tangents parallel to the z-axis at A and B (Fig. 6.7). 

The Fermat theorem gives only a necessary condition for the existence of a maximum or minimum value 
at ἃ point. From Fig. 6.6a it is obvious that having a zero derivative is not sufficient to give an extreme 
value: At the point A, f'(x) = 0 holds, but there is no maximum or minimum here. 

Το have an extreme value differentiability is not a necessary condition. The function in Fig. 6.8d has 
a maximum at e, but the derivative does not exist here. 


6.1.4.3 Rolle's Theorem 


If a function y = f(x) is continuous on the closed interval [a, b], and differentiable on the open interval 


(a, b), and 


f(a)—-0, f(b)=0 (ας ὐ) (6.284) 
hold, then there exists at least one point c between a and b such that 
f(c) 20 (a<c<b) (6.28b) 


holds. The geometrical meaning of Rolle's theorem is that if the graph of a function y = f(a) which is 
continuous on the interval (a, b) intersects the z-axis at two points A and B, and it has a non-vertical 
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tangent at every point, then there is at least one point C between A and B such that the tangent is 
parallel to the x-axis here (Fig. 6.8a). It is possible, that there are several such points in this inter- 


Figure 6.8 


val, e.g., the points C, D, and E in Fig. 6.8b. The properties of continuity and differentiability are 
important in the theorem: in Fig. 6.8c the function is not continuous at x = d, and in Fig. 6.8d the 
function is not differentiable at x = e. In both cases f'(x) z 0 holds everywhere where the derivative 
exists. 


6.1.4.4 Mean Value Theorem of Differential Calculus 


If a function y = f(x) is continuous on the closed interval [a, b] and differentiable on the open interva 
(a, b), then there is at least one point c between a and b which satisfies the following relation: 


b) — 
yA ΄ E —f(c) (a<c<b) (6.29a 
—a 

holds. Substituting b — a 4- h, and O means a number between 0 
and 1, then the theorem can be written in the form 

Jf(a-- h) 2 f(a) - hf'(a-- Gh) (0«0 «1). (6.29b 
1. Geometrical Meaning The geometrical meaning of the 
theorem is that if a function y = f(x) satisfies the conditions o 
the theorem, then its graph has at least one point C between A 


τ 


oO }+----------------------- 


2 o a c 


» 
x and B such that the tangent line at this point is parallel to the 
line segment between A and B (Fig. 6.9). There can be severa 
Figure 6.9 such points (Fig. 6.8b). 


That the properties of continuity and differentiability are important can be shown in examples and also 
as can be observed in Fig. 6.8c,d. 
2. Applications The mean value theorem has several useful applications. 


ΒΑΔ: This theorem can be used to prove some inequalities in the form 


|f(b) — f(a)| < K|b — αἰ. (6.30) 
where K is an upper bound of |f" (x)| for every x in the interval [a, b]. 
1 
Β D: How accurate is the value of f(a) = if 7 is replaced by the approximate value 7 = 3.14? 


14-7? 


| — πι < 0.053 - 0.0016 = 0.000085, which means ne is 


" 2c 
We have: |f(x) — f(z)| = ο 
between 0.092 084 + 0.000 085. 
6.1.4.5 Taylor's Theorem of Functions of One Variable 


If a function y = f(x) is continuously differentiable (it has continuous derivatives) n — 1 times on the 
interval [a, a + h], and if also the n-th derivative exists in the interior of the interval, then the Taylor 


6.1 Differentiation of Functions of One Variable 443 


formula or Taylor expansion is : i 
h / 2 h” hr 
f(a+h) = f(a) + T f'(a) 4 2 μονο m π΄” 3) + — f(a t Oh) (6.31) 
with 0 < O < 1. The quantity h can be positive or negative. The mean value theorem (6.29b) is a 
special case of the Taylor formula for n — 1. 


6.1.4.6 Generalized Mean Value Theorem of Differential Calculus 
(Cauchy's Theorem) 
If two functions y = f(x) and y = y(x) are continuous on the closed interval [a,b] and they are dif- 
ferentiable at least in the interior of the interval, and g'(x) is never equal to zero in this interval, then 
there exists at least one value c between a and b such that 
(ο 

PET) Το ΚΗ (6.32) 
vb) -= pla) (c) 
The geometrical meaning of the generalized mean value theorem corresponds to that of the first mean 
value theorem. Supposing, e.g., that the curve in Fig.6.9 is given in parametric form z = y(t), y = 
f(t), where the points A and B belong to the parameter values t = a and t = b respectively. Then for 
the point C 


fla) _ FO 
g(a) po) 


= x the generalized mean value theorem is simplified into the first mean value 


tana = 


f(b) = 
6.33 
φ() (6.33) 
is valid. For (x) 
theorem. 


6.1.5 Determination of the Extreme Values and Inflection Points 


6.1.5.1 Maxima and Minima 


The substitution value f (xo) of a function f(x) is called the relative maximum (M) or relative minimum 
(m) if one of the inequalities 
Τίτο +h) < f(xo) (for maximum), (6.34a) 
f(to +h) > f(xo) (for minimum) (6.34b) 


holds for arbitrary positive or negative values of h small enough. At a relative maximum the value 
f (xo) is greater than the values in the neighborhood, and similarly, at a minimum it is smaller. The 
relative maxima and minima are called relative or local extrema. The greatest or the smallest value of 
a function in an interval is called the global or absolute maximum or global or absolute minimum in this 
interval. 


yA M y p 
X Ὁ. 
a) Mec b) m 
Figure 6.10 
6.1.5.2 Necessary Conditions for the Existence of a Relative 


Extreme Value 
A function can have a relative maximum or minimum only at the points where its derivative is equal 
to zero or does not exist. That is: At the points of the graph of the function corresponding to the 
relative extrema the tangent line is whether parallel to the x-axis (Fig. 6.104) or parallel to the y-axis 
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(Fig. 6.10b) or does not exist (Fig. 6.10c). Anyway, these are not sufficient conditions, e.g., at the 
points A, B, C in Fig. 6.11 these conditions are obviously fulfilled, but there are no extreme values of 
the function. 

If a continuous function has relative extreme values, then maxima and minima follow alternately, that 
means, between two neighboring maxima there is a minimum, and conversely. 


6.1.5.3 Determination of the Relative Extreme Values and the Inflection 
Points of a Differentiable Explicit Function y — f(x) 


Since f'(x) = 0 is a necessary condition where the deriva- 


y tive exists, after determining the derivative f'(x), firs 
; one calculates all the real roots z4,239,...,Tj,..., Zn Ὁ 
| the equation f'(r) — 0. Then each of them has to be 
E A et checked , e.g., z; with one of the following methods. 
! 1. Method of Sign Change 
| > For values z- and r,, which are slightly smaller anc 
0| X greater than z;, and for which between x; and x. and r+ 
Figure 6.11 no more roots or points of discontinuity of f'(x) exist, one 
checks the sign of f'(x). When during the transition from f'(x..) to f’(a+) the sign of f'(x) changes 
from “+” to *—", then there is a relative maximum of the function f(x) at x = v; (Fig. 6.12a); if i 
changes from “—” to “+”, then there is a relative minimum there (Fig. 6.12b). If the derivative does 
not change its sign (Fig. 6.12c,d), then there is no extremum at x = πι, but it has an inflection poin 


with a tangent parallel to the x-axis. 

2. Method of Higher Derivatives 
If a function has higher derivatives at x = 2), then one can substitute, e.g., the root x; into the second 
derivative f"(x). If f"(x;) < 0 holds, then there is a relative maximum at z;, and if f”(x;) > 0 holds, 
a relative minimum. If f”(x;) = 0 holds, then x; must be substituted into the third derivative f" (x). 
If f"(x;) Z 0 holds, then there is no extremum at x = x; but an inflection point. If still Γή) = 0 
holds, then one substitutes it into the forth derivative, etc. If the first non-zero derivative at x = x; is 
an even one, then f(a) has an extremum here: If the derivative is positive, then there is minimum, if 
it is negative, then there is a maximum. If the first non-zero derivative is an odd one, then there is no 
extremum there (actually, there is an inflection point). 


y 
-0 i 
T1 a 
0| x χι € x 


Figure 6.12 


3. Further Conditions for Extreme Points and Determination of Inflection Points 
If a continuous function is increasing below zo and decreasing after, then it has a maximum there; if it 
is decreasing below and increasing after, then it has a minimum there. Checking the sign change of the 
derivative is a useful method even if the derivative does not exist at certain points as in Fig. 6.10b,c 
and Fig. 6.11. If the first derivative exists at a point where the function has an inflection point, then 
the first derivative has an extremum there. So, to find the inflection points with the help of derivatives, 
one has to do the same investigation for the derivative function as one has done for the original function 
to find its extrema. 


Remark: For non-continuous functions, and sometimes also for certain differentiable functions the 
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determination of extrema needs individual ideas. It is possible that a function has an extremum so 
that the first derivative exists and it is equal to zero, but the second derivative does not exist, and 
the first one has infinitely many roots in an arbitrary neighborhood of the considered point, so it is 
meaningless to say it changes its sign there. For instance f(x) = 27(2 + sin (1/x)) for x # 0 and 
f(0) — 0. 

6.1.5.4 Determination of Absolute Extrema 

The considered interval of the independent variable is divided into subintervals such that in these in- 
tervals the function has a continuous derivative. The absolute extreme values are among the relative 
extreme values, or at the endpoints of the subintervals, if their endpoints belong to them. For non- 
continuous functions or for non-closed intervals it is possible that no maximum or minimum exists on 
the considered interval. 

Examples of the Determination of Extrema: 

WA:y—c*, * interval [-1, +1]. Greatest value at x = 0, smallest at the endpoints (Fig. 6.13a). 


BB: y — 2? — 2’, interval [--1, +2]. Greatest value at x = +2, smallest at x = —1, at the ends of the 
interval (Fig. 6. 13b). 


1 
BC: y= τ» İnterval [--ὸ, +3], Ὁ # 0. There is no maximum or minimum. Relative minimum 
1 +e: 
at x = —3, relative maximum at x = 3. If one defines y = 1 for x = 0, then there will be an absolute 
maximum at x = 0 (Fig. 6.13c). 


E D: y —2 — x3, interval [--1. +1]. Greatest value at x = 0 (Fig. 6.13d, the derivative is not finite). 


6.1.5.5 Determination ofthe Extrema of Implicit Functions 

If the function is given in the implicit form F(x,y) = 0, and the function F itself and also its partial 
derivatives F,, Fy are continuous, then its maxima and minima can be determined in the following way: 
1. Solution of the Equation System F(a, y) = 0, F,(z, y) = 0 and substitution of the resulting 
values (z1, 41), (12. Y2), -- - , (i, yi)... in Fy and Fix. 

2. Sign Comparison for Εν and Fz, at the Point (a;, yi): When they have different signs, the 
function y = f(a) has a minimum at z;; when Fy and Fys have the same sign, then it has a maximum at 
xi. If either F, or Fea vanishes at (x;, y;), then one needs further and rather complicated investigation. 


Figure 6.13 


6.2 Differentiation of Functions of Several Variables 
6.2.1 Partial Derivatives 
6.2.1.1 Partial Derivative of a Function 


The partial derivative of a function u = f(x, x2,...,j,..., η) with respect to one of its n variables, 
e.g., with respect to xı is defined by 
Qu | πι + Az,22,023,..., 24) — f (£1, 22, 73,.-.,2 
lim f( 1 1,42, 43 i) "i 1; 2; 93 n) (6.35) 


Ox,  Azi—0 Ax, 
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so only one of the n variables is changing, the other n — 1 are considered as constants. The symbols 


1 TE ðu ὃ . . . 
for the partial derivatives are Jr’ Us zd , f. A function of n variables can have n first-order partial 
x z 
A Ou Ou Qu Ou ; . AS p 
derivatives: . The calculation of the partial derivatives can be done following 


Ox, Oxy’ 0x3’? Oty 
the same rules as there are for the functions of one variable. 
" xy Ou xy Qu x? Ou xy 
u= E = E a S E 2o € 2 
2 Ox 2 ὃν z Oz 2? 


6.2.1.2. Geometrical Meaning for Functions of Two Variables 


If a function u = f(x,y) is represented as a surface in a Cartesian coordinate system, and this surface 
is intersected through its point P by a plane parallel to the x, u plane (Fig. 6.14), then holds 


ou — tana, (6.36a) 
Ox 

where a is the angle between the positive x-axis and the tangent line of the intersection curve at P, 
which is the same as the angle between the positive x-axis and the perpendicular projection of the 
tangent line into the x, u plane. Here, a is measured starting at the x-axis, and the positive direction is 
counterclockwise if looking toward the positive half of the y-axis. Analogously to a, ϐ is defined with 


a plane parallel to the y, u plane: 
ὃν = tan B. (6.36b) 
ὃν 


The derivative with respect to a given direction, the so-called directional derivative, and derivative with 
respect to volume, will be discussed in vector analysis (see 13.2.1, p. 708 and p. 709). 


u 
u=f(x,y) 
Au 
y 0 — 
P 
dx 
x dy 
Figure 6.14 Figure 6.15 


6.2.1.3 Differentials of x and f(z) 
1. The Differential dx of an Independent Variable x 


is equal to the increment Az, i.e., 
dr = Ax (6.37a) 
for an arbitrary value of Az. 
2. The Differential dy of a Function y = f(x) of One Variable x 
is defined for a given value of 2; and for a given value of the differential dx as the product 


dy = f'(x) da. (6.37b) 
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3. The Increment of a Function y = f(x) for x + Ax 
is the difference 

Ay = f(a + Ax) — f(x). (6.37c) 
4. Geometrical Meaning of the Differential 
If the function is represented by a curve in a Cartesian coordinate system, then dy is the increment of 
the ordinate of the tangent line for the change of x by a given increment dx (Fig. 6.1). In an analogous 
way Ay is the increment of the ordinate of the curve. 


6.2.1.4 Basic Properties of the Differential 


1. Invariance 
Independently of whether x is an independent variable or a function of a further variable t 
dy = f'(x) dx (6.38) 
is valid. 
2. Order of Magnitude 
If dx is an arbitrarily small value, then dy and Ay = y(x + Az) — y(x) are also arbitrarily small, but of 
A 
equivalent amounts, i.e., Jim, Ex — 1. Consequently, the difference between them is also arbitrarily 
r0 dy 
small, but of higher order than dz, dy and Ax (except if dy = 0 holds). Therefore, one gets the relation 
QA ; ; 
du =1, Ayxdy= f'(x)dz, (6.39) 
which allows to reduce the calculation of a small increment to the calculation of its differential. This 
formula is frequently used for approximate calculations (see 6.1.4.4, p. 442 and 16.4.2.1, 2., p. 855). 
6.2.1.5 Partial Differential 
For a function of several variables u = f(x, y, ...) one can form the partial differential with respect to 
one of its variables, e.g., with respect to x, which is defined by the equality 
Ou . 
d,u = τα... an dx. (6.40) 


6.2.2 Total Differential and Differentials of Higher Order 


6.2.2.1 Notion of Total Differential of a Function of Several Variables 
(Complete Differential) 


1. Differentiability 
The function of several variable u = f(x, 29, ..., Ti, ..., Zn) is called differentiable at the point 
Po(@10, 290»... Tio, - - - , Uno) if at a transition to an arbitrarily close point P(x194- Aa, 1294- A9, . . . , jo 
ΔΙ. £no + Αι) with the arbitrarily small quantities Au, Ax, ..., Av;,..., Av, the complete 
increment 

Au = f(x1o + Az, 39 + Απο... 2910 + ATi... no + ATp) 


— f (X10, 320; i9; +++, Xn0) (6.41a) 
of the function differs from the sum of the partial differentials of all variables 
Qu Qu Qu 
dx, 4 dz» 4 ...4 o M AN P 6.41b 
r Ότο T Ox, ninao ves Tn ( ) 
by an arbitrarily small amount in higher order than the distance 
ΠΡ = "NS + Az +... + Az? = Vaz? +da3+...+dx?. (6.41c) 
A continuous function of several variables is differentiable at a point if its partial derivatives, as func- 
tions of several variables, are continuous in a neighborhood of this point. This is a sufficient but not a 
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necessary condition, while the simple existence of the partial derivatives at the considered point is not 
sufficient even for the continuity of the function. 


2. Total Differential 
If u is a differentiable function, then the sum (6.41b) 


oe m εν du, (6.42a 


Ἢ dty+...4 
Ox, "d ὅτο s On, 


is called the total differential of the function. With the n-dimensional vectors 


du 


edie ( = Ou Ou 


T 
Js (6.42b) dr = (dai, dz2,.. . , d£n)" (6.42c 


the total differential can be expressed as the scalar product 
du = (grad u)” - dr. (6.42d 
In (6.42b), there is the gradient, defined in 13.2.2, p. 710, for n independent variables. 


3. Geometrical Representation 

The geometrical meaning of the total differential of a function of two variables u = f(x, y), represented 
in a Cartesian coordinate system as a surface (Fig. 6.15), is that duis the same as the increment of the 
applicate (see 3.5.3.1, 3., p. 210) of the tangent plane (at the same point) if dx and dy are the increments 
of x and y. 

From the Taylor formula (see 6.2.2.3, 1., p. 449) it follows for functions of two variables that 


ὃ ὃ 
f(x,y) = f (0, Yo) + A (no Yo) (x — vo) + o. (ro Vo)(y — vo) + Fa. (6.434) 
Ignoring the remainder Ft, holds that 
ὃ ὃ 
u = f(vo,yo) + A (ao, yo)(x — πρ) + Fp (to, yo)(y — wo) (6.43b) 


gives the equation of the tangent plane of the surface u = f(x,y) at the point Po(xo, yo, uo). 
4. The Fundamental Property of the Total Differential 

is the invariance with respect to the variables as formulated in (6.38) for the one-variable case. 
5. Application in Error Calculations 


In error calculations one uses the total differential du for an estimation of the error Au (sce (6.41a)) 
(see, e.g., 16.4.1.3, 5., p. 852). From the Taylor formula (see 6.2.2.3, 1., p. 449) follows 


|Au| = |du + Ri| € |du| + |Ri| ~ |dul, (6.44) 
i.e., the absolute error |Au| can be replaced by |du| as a first approximation. It follows that du is a 
linear approximation for Au. 


6.2.2.2 Derivatives and Differentials of Higher Order 


1. Partial Derivatives of Second Order, Schwarz’s Exchange Theorem 


The second-order partial derivative of a function u = f(x1, £2, ..., Zi, ..., Zn) can be calculated 
; . Cu Pu . 
with respect to the same variable as the first one was, i.e., A's or with respect to another 
z?’ Oy? 
1 2 


Pu Ou Pu 
Oxj0r3' Oxg0r4 Ox3021 
If at the considered point the mixed derivatives are continuous, then 

Pu Ou 
Ox, 0x2 E Ox303, 


variable, iie. ———— , ------;,-----,:::. In this second case one talks about mixed derivatives. 


(6.45) 
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holds for given xı and x» independently of the order of sequence of the differentiation (Schwarz's ex- 
change theorem). 

Ou Bu 

Ox? > OrOy? 
2. Second-Order Differential of a Function of One Variable y = f(x) 

The second-order differential of a function y = f (x) of one variable, denoted by the symbols d?y, d? f (x), 
is the differential of the first differential: d?y = d(dy) = f”(x)dx?. These symbols are appropriate only 
if r is an independent variable, and they are not appropriate if x is given, e.g., in the form x = z(v). Dif 
ferentials of higher order are defined analogously. If the variables x1, 2,...,2j,...,Up» are themselves 
functions of other variables, then one gets more complicated formulas (see 6.2.4, p. 452). 

3. Total Differential of Second Order of a Function of Two Variables u = f(x, y) 


Partial derivatives of higher order such as, e.g., are defined analogously. 


, Pu, o? CEE 
du = d(du) = dx? 4 Bx dx dy 4 πρ dy? (6.46a 
or symbolically 
, ; 2 
u= (5 dx + Za) u. (6.46b 
4. Total Differential of n-th Order of a Function of Two Variables 
d'u = ο + Za) u. (6.47 


5. Total Differential of n-th Order of a Function u = f(£1, £2,..., £m) of m Vari- 
ables 


ð ὃ ὃ 4 
d'u = (= daz + δες dro... Br. dan) u. (6.48) 


6.2.2.3 Taylor's Theorem for Functions of Several Variables 


1. Taylor's Formula for Functions of Two Variables 


a) First Form of Representation: 


Of(v. y) Of (x,y) 
f(x,y) = fla, b) + = (α -- α) -Ε ᾽ (y — b) 
Or — (xa) (ab) Oy ων (a) 
1 (0 fv. y) 2 S0 f (v, y) 
qp um z—a)-42——2-7- x—a)(y—b 
2! | Ox? — OxOy € Xy -9) 
82 T. : 1 1 
Sn (y- πο t ned Re (6.492) 
Oy (2) 7 (a,b) 3! n! 
Here (a, b) is the center of expansion and R, is the remainder. Sometimes one writes, e.g., instead of 
afr. v) the shorter expression OF dis Yo) - 
Ox |ey)=(20,v0) dx 


The terms of higher order in (6.49a) can be represented in a clear way with the help of operators: 


flaw) = Had) tie- + (i rr) 
ὃ 


1 ay 
ία τος + 0-5} fe) 


(z,y)=(a,b) 


(z,y)=(a,b) 
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+ Bn. (6.49b) 


(αφ) (a,b 


+5 if (a,y) τ ῥοῦ! Tec ae e foy) 


This symbolic form means that after using the binomial theorem the powers of the differential operators 


ὃ --- 
and = — ,, Iepresent the higher-order derivatives of the function f(a, y). Then the derivatives must 


Ox 
be in. in the point (a,b). 
b) Second Form of the Representation: 


1/89 ð 1/9 ὃ 
fet+hytk) = f(y) ΕΤ (5: n+ Ža) fayt+5 ο. h+ By K) ο 
1/a Oy" 1/8 ὃ 
ta (5 n+ 5) f(z,y) 4-4 π (δ -h z 2! f(x,y) + Rn. (6.496) 
c) Remainder: The expression for the remainder is 
m-l (2 n+ 2k)” ο <0<1) (6.494) 
^^ (n+1)! (3x ? ὃν oe ne ` ᾿ 
2. Taylor Formula for Functions of m Variables 
The analogous representation with differential operators is 
(σελ, y k,...,t- I) 
"1/0 ὃ a\' 
= f(a,y,...,t)4 Li (z h4 ὃν k++ π) f(x,y,---,t)+ Rn, (6.50a) 
where the remainder can be calculated by the expression 
1 a, ὃ ay 
Ra n Dj (sat gee z1) f(a@t+Oh,y+Ok,...,t+0l) 
(0 « O « 1). (6.50b) 


6.2.3 Rules of Differentiation for Functions of Several Variables 
6.2.3.1 Differentiation of Composite Functions 
1. Composite Function of One Independent Variable 
U = f(t baean. r= tilé), Φος TE) pe Be = σε) (6.51a) 
Ou Ou dz, Ou dx» Ou dx, 
gn anc ης με (6.51b) 


2. Composite Function of Several Independent Variables 


u = f(zi,22,..., 24), 

Hy SA (E MeT h Ce ξ (Eee των Ba — (50s T) (6.52a) 
Ou Ou Ox, , Ou Oxy Ou Om, 
δε Ox, OF | Oxy OF | Ox, OE’ 
Ou Ou Ox, , Ou Óv» Qu Om, 
Op Ox, On δα δη OU Om, On? (6.52b) 
ðu | Ou ‘Ox, Ou δα» Ou θα, 


ðr an, ðr δι Or δε, OT 
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6.2.3.2 Differentiation of Implicit Functions 
1. A Function y = f(x) of One Variable is given by the equation 


F(x,y) - 0. (6.53a 
Differentiating (6.53a) with respect to x with the help of (6.51b) one gets 
F, 
F, + Fyy' =0 (6.53b) and y = -7 (Εν # 0). (6.53c 
y 


Differentiation of (6.53b) yields in the same way 
Foz + 2Fay + Fyy(y')? + Fy" = 0, (6.53d 

so considering (6.53b) one has 

y" 2F.F,EL — (Fy)? Fire - (Fe)? Foy 


: 6.53e 
(y - 
In an analogous way one can calculate the third derivative 

Έπος + 3Feeyy! + 3F Y + Fay (y')? 3E y" 4 BFyyyly! + Fy" = 0, (6.536 


from which y" can be expressed. 
2. A Function u = f (£1, £2,- , Lis- - - , En) of Several Variables is given by the equation 
F(21,29,...,%i;+-+,2n,u) = 0. (6.54a 
The partial derivatives 
Ou Γι Ou Frys Ou F, 
— =, T., am (6.54b 
Ox, Fu ὅτο Fa 1 
can be calculated similarly as it has been shown above but here the formulas (6.52b) are to be used. 
The higher-order derivatives can be calculated in the same way. 
3. Two Functions y = f(a) and z = (x) of One Variable are given by the system o 
equations 


F(x,y,z)=0 and (x,y,z) =0. (6.55a 
Then differentiation of (6.55a) according to (6.51b) results in 
F, + Fy + F.2z' =0, B, +8yy + δις =0, (6.55b 
ΕΦ, — DFs FD, — F0, 
V , 9 E ELS (6.55c 
F9, — ΕΦ, F0. — Ε.Φ, 


The second derivatives y" and z" are calculated in the same way by differentiation of (6.55b) considering 
y' and 2’. 
4. n Functions of One Variable Let the functions yı = f(x), y2 = q(x),.... ya = v(x) be given 
by a system 

F(£,Y1,Y2,.--;Yn) =0, Φα, yi Y2,---5Yn) =0, ...,ψΨ(α,νι,υα,....υμ)--0 (6.56a) 
of n equations. Differentiation of (6.56a) using (6.51b) results in 


F4 Fuys Fyy Fed FynYn =0, 
aie ete / T f 
Po + Puyi + Puya + oor Putin = 0, (6.56b) 
eS: ut tae τας seer does a c0 
V, + Vat + ΡΕ 
Solving (6.56b) yields the derivatives yj, y5,...,y/,, which are to be looking for. In the same way one 


can calculate the higher-order derivatives. 
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5. Two Functions u = f(z,y),v = φ(α,υ) of Two Variables are given by the system o 
equations 
F(x,y,u,v)=0 and d$(r,y,u,v) = 0. (6.57a 
Then differentiation of (6.57a) with respect to x and y with the help of (6.52b) results in 
OF | OF Ou , OF Ov OF OF Ou , OF Ov 
Ox Oude Ovdx ^ Oy © Ou dy ὃυὸν ` 
05 0 EM Er Bv ipu Ob ὃφδι Əv (6.57¢ 
I ΓΞ" 0. 
Or Oudx  OvOx ᾿ Oy  OuOy  OvOy 


Solving the system (6.57b) for n 2 and the system (6.57c) for τ a give the first-order partia 
ὅς Ox Oy Oy 


derivatives. The higher-order derivatives should be calculated in the same way. 
6. n Functions of m Variables Given by a System of πι Equations The first-order and 
higher-order partial derivatives can be calculated in the same way as in the previous cases. 


6.2.4 Substitution of Variables in Differential Expressions and 


Coordinate Transformations 
6.2.4.1 Function of One Variable 


Suppose, given a function y(x) and a differential expression F containing the independent variable, the 
function, and its derivatives: 


dy d^y dy . 
y = f(x), (6.58a) PHP 2,9) sop (6.58b) 
dx’ dx?’ da? 
If the variables are substituted, then the derivatives can be calculated in the following way: 
Case 1a: The variable x is replaced by the variable t, and they have the relation 
z= e(t). (6.59a 
Then holds 
dy 1 dy dy 1 "DLP 
Ξ - err p (t ).59E 
dx (8) dt ? dx? ale e( JTE p ( T , (6 59b 
dy 1 dm d'y dy 
ae ο one / "(¢) 2 3ly" (t 2. (t Pág I. A 6.59 
iss = WOR tet PGE -3e(0) e" 072 Be" OP — 0 e" 0122 (6.59¢ 
Case 1b: If the relation between the variables is not explicit but it is given in implicit form 
(x,t) = 0, (6.60 
μα αν dy oe 
then the derivatives — are calculated by the same formulas, but the derivatives v (t), v" (t), 


dx’ dx?’ da? 
p(t) must be calculated according to the rules for implicit functions. In this case it can happen tha: 
the relation (6.58b) contains the variable x. To eliminate x, the relation (6.60) is used. 


Case 2: If the function y is replaced by a function u(x), and the relation between them is 


y= p(u), (6.614 
then the calculation of the derivatives can be performed using the following formulas: 
dy „du d’y 1, Cu» du\? 
-~= y(t ; 5 = v(t + 1 61 
dx Y (u) dx’ dz * (4) πρ (u) dx} ` ΕΘ 
dy pn Ou du d?u du\* 
= = + 3p” sty" € 6.61 
di ^ Ms P de daz © (u) dx '. 
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Case 3: The variables x and y are replaced by the new variables t and u, and the relations between 
them are given by 


z—g(tu, y-w(tu) (6.622. 
For the calculation of the derivatives the following formulas are used: 
: Ov  Owdu 
y C ἷ 
Ts ὃν δ. ἄς, (6.62b 
Ot ' Ou dt 
Ov | Ov du Ow | Ob du 
dy d (5) - δι δια. ..— 1 — d|9t Judt (6.62c 
da? da \dx dx | ὃρ , pdu Op , Apdu dt | Op  ὃράι i 
Ot ' δι dt Ot ' Ou dt Ot — Ou dt 
1d (5) _ 1 c AG] 
B dt NB ΒΡ dt dt J^ 
. OW | Oi du Op pdu 
with A= E + Bu dt (6.6209) and B= ET + aE ; 


The determination of the third derivative ur can be done in an analogous way. 
Wi For the transformation from Cartesian coordinates into polar coordinates according to 
X -—pcosp, y=psing (6.634) 
the first and second derivatives should be calculated as follows: 
dy p'sing + pcosy 
dc p! cosg — psiny’ 


dy 7 p? ES 20? = pp" 


.63b ] 
(585 dx? (p' cosy — psin p)? 


(6.63c) 


6.2.4.2 Function of Two Variables 


Suppose given a function w(x, y) and a differential expression F containing the independent variables, 
the function and its partial derivatives: 


(6.64b 


2 2, 2 
w= f(x,y), (6.644) Fa F (nya Pu Pu ee ΓΝ » 


'Or'Oy Ox? ' 0xOy Oy? 7 
If x and y are replaced by the new variables u and v given by the relations 
α —g(uv) y-w(uv) (6.65a 


then the first-order partial derivatives can be expressed from the system of equations 


Ow  OwOp  Qwdy ðw  OwOp , δωδψ (6.65b 
Ou  OrOu Oy Au’ Ov  OrOv ` ðy ðv i 
with the new functions A, B, C, and D of the new variables u and v 
Ow Ow Ow δω Ow Ow 
A BR, = + D—. : 
Ox Ou Τ ðv’ ὃν E7 ðv οσο 
The second-order partial derivatives are calculated with the same formulas, only without using w in 
; : οχι Ow Ow 
them but its partial derivatives — and —, e.g., 
Ox Oy 


Pu 0 (ðv 9 429 | pôw A VM Pw |0A0w , ABA 
Ox? Ox \ Ox Ox Ou Ov Ou? ϑιϑυ | Ou ðu du dv 
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Pw Pw ôðAðw  0BOw 
B - 7 E- . (6.66 
( Oudv = Ov  OvOu Ov ον) (6:66) 
The higher partial derivatives can be calculated in the same way. 


W The Laplace operator (see 13.2.6.5, p. 716) is to be expressed in polar coordinates (see 3.5.2.1, 2., 
p. 191): 


2 32 
Aw = = + a T (6.672) z= pcos, y-—psing. (6.67b) 
The calculations are: 
Ow δω Ow Ow Ow Ow 


δ» = δρ cos p 4 d sin p, δρ = p, ane } p PP 


δω Ow sinpow Ow . ðw cosp dw 
COS ip sin Y= t c 
Ox "Op p Op ὃν Pap p Op 


Ow sing 5) siny ὃ C ðw sing ο) 
7 sy- : 
Op p Op p Op ὃρ p Op 


δω 


Ox? 


ὃ 
= cos 3 c 
p 


2 
Similarly, € is calculated, so finally: 
y? 


Ow 18w 18w 
Aw = tsa t 3 (6.67c) 
Op  pOp pOp 
Remark: If functions of more than two variables should be substituted, then similar substitution 
formulas can be derived. 


6.2.5 Extreme Values of Functions of Several Variables 


6.2.5.1 Definition of a Relative Extreme Value 


A function u = f(x1,22,..., Zi,- . , €n) has a relative extreme value at a point Po(x10, 129, . .. , Tios- ++; 
£no), if there is a number e such that for every point P(x1, £2, . . . , &n) belonging to the domain z19 — € < 
£1 < T10 + €, 299 — € < Lo < X39 + €, . .., Eno — € € Tn € Tno + € and to the domain of the function but 
different from Po, then for a maximum the inequality 

f (21,29,..., En) < f (T10, T20, . - - , Eno) (6.68a) 
holds, and for a minimum the inequality 

f (21,22,..., 24) > f (t10, 290,...., no) (6.68b) 


holds. Using the terminology of several dimensional spaces (see 2.18.1, p. 118) a function has a relative 
maximum or a relative minimum at a point if it is greater or smaller there than at the neighboring 
points. 


6.2.5.2 Geometric Representation 


In the case of a function of two variables, represented in a Cartesian coordinate system as a surface 
(see 2.18.1.2, p. 119), the relative extreme value geometrically means that the applicate (see 3.5.3.1, 3., 
p. 210) of the surface in the point A is greater or smaller than the applicate of the surface in any other 
point in a sufficiently small neighborhood of A (Fig. 6.16). 

If the surface has a relative extremum at the point P, which is an interior point of its domain, and 
if the surface has a tangent plane at this point, then the tangent plane is parallel to the x,y plane 
(Fig. 6.16a,b). This property is necessary but not sufficient for a maximum or minimum at a point 
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Figure 6.16 


Py. For example Fig. 6.16c shows a surface having a horizontal tangent plane at Po, but there is a 
saddle point here and not an extremum. 


6.2.5.3 Determination of Extreme Values of Differentiable Functions of 
Two Variables 


If u = f(x,y) is given, then one solves the system of equations fy = 0, fy = 0. The resulting pairs of 


values (x1, y1), (12. ya). ... can be substituted into the second derivatives 
Of Of Of 
A= B = — ;---. ; 
Oa?" OxOy' d Oy? (6.69) 
Depending on the expression 
AB , : 
A= E Οἱ! AC — ΒΡ = [fee fuy Ua dee es (i =1,2,.. 9 (6.70) 


it can be decided whether an extreme value exists and of what kind it is: 

1. In the case A > 0 the function f(x,y) has an extreme value at (x;,y;), and for f,, < O it isa 
maximum, for [ως > 0 it is a minimum (sufficient condition). 

2. In the case A < 0 the function f(x,y) does not have an extremum. 

3. In the case A = 0, one needs further investigation. 


6.2.5.4 Determination of the Extreme Values of a Function of n Variables 


If u = f(xi,v2,..., En) is given, then first it is to find a solution (x10, 720, . .. , 40) of the system of the 
n equations 
"S = 0, "nm = 0, 5 ον diis . 0, (6.71) 
because it is a necessary condition for an extreme value. (6.71) is not a sufficient condition. Therefore 
Of 


. Then it is to 


one prepares a matrix of the second-order partial derivatives such that aj; = ΕΤ; 
TiOT; 

substitute a solution of the system of equations (6.71) into the terms, and to prepare the sequence of 

left upper subdeterminants (a11, @11@22 — a12021, ...). Then there are the following cases: 

1. The signs of the subdeterminants follow the rule —, +, —, +, ..., then there is a maximum there. 


2. The signs of the subdeterminants follow the rule +, +, +, +, ..., then there is a minimum there. 

3. There are some zero values among the subdeterminants, but the signs of the non-zero subdetermi- 
nants coincide with the signs of the corresponding positions of one of the first two cases. Then fur- 
ther investigation is required: Usually one checks the values of the function in a close neighborhood of 
110; T20, ... ; Tno. 

4. The signs of the subdeterminants does not follow the rules given in cases 1. and 2.: There is no 
extremum at that point. 


456 6. Differentiation 


The case of two variables is of course a special case of the case of n variables, (see [6.4]). 


6.2.5.5 Solution of Approximation Problems 


Several different approximation problems can be solved with the help of the determination of the ex- 
treme values of functions of several variables, e.g., fitting problems or mean squares problems. 


Problems to solve: 

e Determination of Fourier coefficients (see 7.4.1.2, p. 475, 19.6.4.1, p. 992). 

e Determination of the coefficients and parameters of approximation functions (see 19.6.2, p. 984 ff). 

e Determination of an approximate solution of an overdetermined linear system of equations (see 19.2.1.3, 
p. 958). 

Methods: For these problems the following methods are used: 

e Gaussian least squares method (see, e.g., 19.6.2, p. 984). 

e Least squares method (see 19.6.2.2, p. 985). 

e Approximation in mean square (continuous and discrete) (see, e.g., 19.6.2, p.984). 

e Calculus of observations (or fitting) (see 19.6.2, p. 984) and regression (see 16.3.4.2, 1., p. 841). 


6.2.5.6 Extreme Value Problem with Side Conditions 


To determine are the extreme values of a function u = f(z£1,%2,..., £n) of n variables with the side 
conditions 
plti Ga, .- $5) = 0, O(a, Va, -.. np) = 0)... XLi; σοι. Bn) — 0. (6.72a) 


Because of the conditions, the variables are not independent, and if the number of conditions is k, 
obviously k « n must hold. One possibility to determine the extreme values of u is to express k variables 
with the others from the system of equations of the conditions, to substitute them into the original 
function. Then the result is an extreme value problem without conditions for n — k variables. The other 
way is the Lagrange multiplier method. Introducing k undefined multipliers A, 4, . . . , &, and composing 


the Lagrange function (Lagrangian) of n + k variables x1, £2, ... 234, À, 4t... & gives: 
Q (71,22,..., Xn À; JG... κ) 
= f(zi,29,..., En) + λρίπι, σοι... En) + μψίσι, 29,... qu) +: 
TR X(T, 12, Tn). (6. 72b 


The necessary condition for an extremum of the function Φ is a system of n + k equations (6.71) with 
the unknowns £1, 25,... , En, À ly... K? 

Q—0,5—0,...,x20,9,—50,0,,—0,...,9,, —0 (6.72c 
The necessary condition for an extremum of the function f at the point Po(x19, x29, .. ., 490) with the 
side conditions (6.72a) is that the system of values x19, £20, ..., &no must fulfill the equations (6.72c). 
So it is to look for the extremum points of f among the solutions 249, 20,..., &no of the system o 
equations (6.72c). To determine whether there are really extreme values at these points fulfilling the 
necessary conditions requires further investigations, for which the general rules are fairly complicated. 
Usually one uses some appropriate and individual calculations depending on the function f to prove i 
there is an extremum, or not. Often approximation calculations are helpful, i.e., the comparison with 
values of the function in the neighborhood of Po. 
Β The extreme value of the function u = f (v, y) with the side condition y(x, y) = 0 will be determined 
from the three equations 


σίου) Ner, )] — 0 (6.73) 
y 
There are three unknowns, x,y, A. Since the three equations (6.73) are only necessary but not suffi- 
cient conditions for the existence of an extremum, further investigation is needed whether there is an 
extremum at the solution of this system. A mathematical criterion is rather complicated (see [6.3], 
(6.8]); comparisons of the values of the function at points in the close neighborhood are often useful. 


φία, y) — 0, Žien) + λφία,ν)] = 0, 


7 Infinite Series 


7.1 Sequences of Numbers 


7.1.1 Properties of Sequences of Numbers 


7.1.1.1 Definition of Sequence of Numbers 
An infinite sequence of numbers (hereafter briefly sequence of numbers) is an infinite system of numbers 

d1,02,...,Qn,... or briefly {an} with n=1,2,..., (7.1) 
arranged in a given order. The numbers of the sequence of numbers are called the terms of the sequence. 
Among the terms of a sequence of numbers the same numbers can occur several times. A sequence is 
considered to be defined if the law of formation, i.e., a rule is given, by which any term of the sequence 
can be uniquely determined. Mostly there is a formula for the general term απ. 
Examples of Sequences of Numbers: 


α A:0,—7:1,2,3,4,5,.... MB: a, — 44 3(n — 1): 4,7,10,13,16, .... 
qe 33 3 3 
C: an = 3(-= :3,-=,-,-=,=;- D: a, = (-1)""!: 1,-1,1,-1,1,.... 
mC: a -3(75) 5-3 4'-8 16 a c MEL E 
1 l3 3 11 
2 an =3- επ ο 2 2-2; ad 2— = —). 
BE: a 3 25-2 1,2 252 ?g (read 1 1! 
1 4 —n-l 
BF: ο - 33 x 10 2 for odd n and 
1 2 πο | 
an= 33 Ἐξ 10 2 for even n: 3; 4; 3.3; 3.4; 3.33; 3.34; 3.333; 3.334; .... 
1 1111 
. Exc nM NEN " —(—1 tg: ,—29.9,—4,5..—6, nas 
BG: m= lg peres B H: a, = (-1)"*!n: 1, 2,3, 4,5, —6, 
+1 
BI: a,= aw S for odd n and αμ = 0 for even n: —1,0, —2, 0, —3, 0, —4,0,.... 
1 1 1 
WB J: a, = 3— ———, for odd n and a, = 13 — -π j for even n: 1, 11,2, 12, 2-, 131 ο) τοῦ "A 
22 — 9 293 — 2" 2) 4) 4 
7.1.1.2. Monotone Sequences of Numbers 
A sequence of numbers a4, @2,...,@n,... is monotone increasing if 
a L aa ag € Lln, (7.2) 
is valid and it is monotone decreasing if 
a >a, >a3>°::>a,>°:: (7.3) 


is valid. One talks about a strictly monotone increasing sequence of numbers or strictly monotone de- 
creasing sequence of numbers, if equality never holds in (7.2) or (7.3). 

Examples of Monotone Sequences of Numbers: 

@ A: Among the sequences of numbers from A to J the sequences A, B, E are strictly monotone in- 
creasing. 

W B: The sequence of numbers G is strictly monotone decreasing. 


7.1.1.3 Bounded Sequences of Numbers 
A sequence of numbers is called bounded if for all terms 


[αμ] < K (7.4) 
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is valid for a certain K > 0. If such a K (bound) does not exist, then the sequence of numbers is 
unbounded. 


W Among the sequences of numbers from A to J the sequences of numbers C with K = 4, D with 
K —2, E with K = 3, F with K = 5, G with K = 2 and J with K = 13 are bounded. 


7.1.2 Limits of Sequences of Numbers 


1. Limit of a Sequence of Numbers 
An infinite sequence of numbers (7.1) has a limit A if for an unlimited increase of the index n the 
difference a,, — A becomes arbitrarily small. Precisely defined this means: For an arbitrarily small € > 
0 there exists an index no(£) such that for every n > no 

la, — Al < €. (7.5a) 
The sequence of numbers has the limit +-oo (—oo), if for arbitrary K > 0 there exists an index no(K’) 
such that for every n > no 

tm >K —(a,«-—K). (7.5b) 


2. Convergence of a Sequence of Numbers 
If a sequence of numbers {αμ} satisfies (7.5a), then one says it converges to A. This is denoted by 


jim, Qn =A or à,- A. (7.6) 


W Among the sequences of numbers from A to J on the previous page, C with A = 0, E with A = 3, 
F with A= 3i , G with A = 0 are convergent. 


3. Divergence of a Sequence of Numbers 

Non-convergent sequences of numbers are called divergent. One talks about proper divergence in the 
case of (7.5b), i.e., if as n exceeds any value, an exceeds any large positive number K (K > 0) so that 
it never goes below, or if as n exceeds any value, an goes below any negative number -K (K > 0) with 
arbitrarily large magnitude and never increases above it, i.e., if it has the limit +00. Then one writes: 
lim. ας —oo (an < —K Vn > no). (7.7) 


lim a, =0o (an KVn > no) or 
n—oo n 


η. 
Otherwise the sequence of numbers is called improperly divergent. 

Examples of Divergent Sequences of Numbers: 

W A: Among the sequences of numbers from A to J on the previous page, A and B tend +00, they 
are properly divergent. 

W B: Among the sequences of numbers D is improperly divergent. 

4. Theorems for Limits of Sequences of Numbers 

a) If the sequences of numbers {αμ} and {bn} are convergent, then 


Jim (an ο ο ο ΠΟΤ 


hold, and if b,, Z 0 for every n, and im, bn # 0, then holds 


lim a 
da, n 
1---.- Πο _ (7.10) 
no p, lim bp 
n: " 


Remark: If Jim. bn = 0 and {αμ} is bounded, then lim. (anbn) = 0 even if {αμ} does not have any 
finite limit. 
b) If im ln = πα. b, = A is valid and at least, from an index n; and beyond, the inequality a, < 
Cn € b, holds, then also holds 

jim c, = A. (7.11) 
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c) A monotone and bounded sequence of numbers has a finite limit. If a monotone increasing sequence 
of numbers a; € az € ag < ... is bounded above, i.e., a, < Ay for all n, then it is convergent, and 
its limit is equal to its least upper bound which is the smallest possible value for Κ᾿. If a monotone 
decreasing sequence of numbers a4 > az > a3 > ... is bounded below, i.e., an > Ko for all n, then it is 
convergent, and its limit is equal to its greatest lower bound which is the largest possible value for 5. 


7.2 Number Series 
7.2.1 General Convergence Theorems 
7.2.1.1 Convergence and Divergence of Infinite Series 


1. Infinite Series and its Sum 
From the terms a; of an infinite sequence of numbers {ax} (see 7.1.1.1, p. 457) the formal expression 


ay + a2 +++ +ant = a (7.12) 
k=1 


can be composed and this is called an infinite series (hereafter briefly series); a; is the general term of 
the series. The finite sums 


Sı = 01, S2 = a1 +02, ..., Sn = αι (7.13) 
k-i 


are called partial sums. 
2. Convergent and Divergent Series 
A series (7.12) is called convergent if the sequence of partial sums {Sn} is convergent. The limit 


$= Jim, 54.— » ak (7.14) 


is called the sum of the series. If the limit (7.14) does not exist or it is equal to +0ọ, then the series is 
called divergent. In this case the partial sums are not bounded or they oscillate. So the determination 
of the convergence of an infinite series is reduced to the determination of the limit of a sequence {Sn}. 
ΒΑ: The geometric series (see 1.2.3, p. 19) 

tititi tat (7.15) 
is convergent with the sum S = 2 (see (1.54b), p. 19) with ag = 1, q = 1/2). 
WB B: The harmonic series (7.16) and the series (7.17) and (7.18) 


1 1 1 

τ. a es (7.16) 1+14+14---4+1+4+--- (7.17) 
TL 

πα ο (7.18) 


are divergent. For the series (7.16) and (7.17) lim 5, = oo holds, (7.18) oscillates. 

3. Remainder " 

The remainder of a convergent series S = 5 αι is the difference between its sum S and the partial sum 
k=1 


Sn: 


Rr, 6 — Sn y ak = an+1 + an2 rss. (7.19) 
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7.2.1.2 General Theorems about the Convergence of Series 
1. Necessary Criterion for the Convergence of a Series The sequence of terms of a convergent 
series is a null sequence: 


]im a, = 0. (7.20) 


This is only a necessary but not sufficient condition. 
W For the harmonic series (7.16) lim a, = 0 holds, but lim δ, = oo. 
n—> o0 n—oo 
2. Leaving out the Initial Terms If one leaves out finitely many initial terms of a series or one 
introduces finitely many further terms into it at the begin or if changing the order of finitely many 


terms, then its convergence behavior does not change. Exchange of the order of finitely many terms 
does not affect the sum if it exists. 


3. Multiplication of all Terms If all terms of a convergent series are multiplied by the same factor 
c, then the convergence of the series does not change; its sum is multiplied by the factor c. 


4. Termwise Addition or Subtraction By adding or subtracting two convergent series 


àjc 3c ade So ax Si, (7.21a) bi tg tes + bp tee M = SQ (7.21b) 
k=1 k=1 


term by term, then the result is a convergent series, and its sum is 


(αι + bi) + (a9 + 09) +-+- + (an ΞΕ bn) +++ = Si E 51. (7.21c) 


7.2.2 Convergence Criteria for Series with Positive Terms 


7.2.2.1 Comparison Criterion 


Suppose there are two series 


αι T d2 o: o s = Ma (7.22a) bibe bb te = VO hn (7.22b) 


n=1 n=1 


with only positive terms (an > 0, bn > 0). If an > bn holds from a certain no, then the convergence of 
the series (7.22a) yields the convergence of the series (7.22b), and the divergence of the series (7.22b) 
yields the divergence of the series (7.22a). In the first case (7.22a) is called a convergent majorant and 
in the second case (7.22b) a divergent minorant . 


W A: Comparing the terms of the series 


1 1 1 

ο ες pen (7.23a 
with the geometric series (7.15), the convergence of the series (7.23a) follows. From n — 2 the terms o 
the series (7.23a) are smaller than the terms of the convergent series (7.15): 


l i (n > 2). (7.23b 


n” 9n-l κά. 
W B: From the comparison of the terms of the series 

pinos dope doe (7.242. 

ν3 y3 vn 
with the terms of the harmonic series (7.16) follows the divergence of the series (7.24a). For n > 1 the 
terms of the series (7.24a) are greater than those of the divergent series (7.16): 
1 
Se 


νη” π 


(n > 1). (7.24b) 
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7.2.2.2 D’Alembert’s Ratio Test 


If for the series 


addo +ant Yo an (7.252) 


n=1 


r Gn41 : OM" 
all the ratios are smaller than a number q < 1 from a certain ng onwards, then the series is 


απ 
convergent: 
a, 
«ας (n>n): (7.25b) 
απ 


If these ratios are greater than a number Q > 1 from a certain ng onwards, then the series is divergent. 
From the two previous statements it follows that if the limit 


"E CN 
im m P (7.25¢ 
exists, then for p < 1 the series is convergent, for p > 1 it is divergent. In the case p = 1 the ratio tes 
gives no information whether the series is convergent or not. 
ee eee 3 n 
WB A: The series gt gp tag t tat] (7.26a 
is convergent, because 1 
1+— 
. 1151 η Ρ mcd 
p Jim. ( Gen =) "πα. 5 5 « 1 holds. (7.26b 
3 4 n+1 
ΒΒ: For the series 2+-+=—+---4 — e 7.27a 
4 9 n? ( 
the ratio test does not give any result about whether the series is convergent or not, because 
n+2 n+l 
lim =: - 1. 7.27b 
κ... (z +1) n? ) ( 


7.2.2.3 Root Test of Cauchy 


If for a series 


a doc e +ant a, (7.282) 


n=1 


from a certain ny onwards for every value ϱ/αμ 


va,«q-«l1 (7.28b) 
holds, then the series is convergent. If from a certain no every value ϕ/αμ is greater than a number Q 
where Q > 1 holds, then the series is divergent. 
From the previous statements it follows that if 


dim Van = p (7.280) 


exists, in the case p < 1 the series is convergent, in the case p > 1 it is divergent. For p = 1 with this 
test one cannot tell anything about the convergence behavior of the series. 


ον 4 9 n? 
W The series - H (3) (2) Fed (X) Eee (7.29a) 
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is convergent because 


n 


noon n ΝΤ . 1 1 
p= lim ( τ) im 1 πρ 1 holds. (7.29b) 


1+- 
n 
7.2.2.4 Integral Test of Cauchy 


1. Convergence [fa series has the general term a, = f(n), and f(x) is a monotone decreasing 
function such that the improper integral 


no dx (see 8.2.3.2, 1., p. 507) (7.30) 


exists (it is convergent), then the series is convergent. 

2. Divergence If the above integral (7.30) is divergent, then the series with the general term a, = 
f (n) is divergent, too. 

The lower limit c of the integral is almost arbitrary but it must be chosen so that for c < x < oo the 
function f(x) should be monotone decreasing. 

Β The series (7.272) is divergent because 


etl seat 17% 
Fa) ==> / = de = [Ina =] — oo. (7.31) 


2 


7.2.3 Absolute and Conditional Convergence 


7.2.3.1 Definition 


Along with the series (7.12) whose terms can have different signs, one considers also the series 


[αι] t |a2| Pish [αμ] ο Y lanl, (7.32 


n=1 


whose terms are the absolute values of the terms of the original sequence (7.12). If the series (7.32 
is convergent, then the original one (7.12) is convergent, too. (This statement is valid also for series 
with complex terms.) In this case, the series (7.12) is called absolutely convergent. If the series (7.32 
is divergent, then the series (7.12) can be either divergent or convergent. In the second case, the series 
(7.12) is called conditionally convergent. 

WB A: The series a + =< ke] e ο. (7.33a 


where q is an arbitrarily constant number, is absolutely convergent, because the series of absolute values 


with the general term is convergent. This is obvious by comparing it with the geometric series 

(7.15): 

sinna 
2n 


Eo (7.33b) 


E B: The series 1 ο. (7.34) 
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is conditionally convergent, because it is convergent according to (7.36b), and the series made of the 


: : ; : Qd 
absolute values of the terms is the divergent harmonic series (7.16) whose general term is — = |an]. 
n 


7.2.3.2 Properties of Absolutely Convergent Series 


1. Exchange of Terms 

a) The terms of an absolutely convergent series can be exchanged with each other arbitrarily (even 
infinitely many of them) and the sum does not change. 

b) Exchanging an infinite number of terms of a conditionally convergent series can change the sum and 
even the convergence behavior. Theorem of Riemann: The terms of a conditionally convergent series 
can be rearranged so that the sum will be equal to any given value, even to +00. 

2. Addition and Subtraction 

Absolutely convergent series can be added and subtracted term-by-term; the result is absolutely con- 
vergent. 

3. Multiplication 

Multiplying a sum by a sum, the result is a sum of the products where every term of the first factor 
is multiplied by every term of the second one. These two-term products can be arranged in different 
ways. The most common way for this arrangement is made as if the series were power series, i.e.: 


(αι + dg as) + bo bb e) 
= abı + dab) + aybg + a3b1 + azb2 + ab ++ ++ + agb + an—1b2 d abs s. (735a) 
πο eS ee ————————Á 


If two series are absolutely convergent, then their product is absolutely convergent, so it has the same 
sum in any arrangement. If X` an = S, and X` bn = S; hold, then the sum of the product is 


S = 8,8, (T.35b) 


oo [5.5] 
If two series a4 + ag + +++ +4, +: — Y an and bi -- bo +--+ +b, +--+ = DY b, are convergent, and 
n=1 n=1 


at least one of them is absolutely convergent, then their product is also convergent, but not necessarily 
absolutely convergent. 


7.2.3.3 Alternating Series 
1. Leibniz Alternating Series Test (Theorem of Leibniz) 


For an alternating series 


a, — dg + a3 — cam, (7.36a) 


where an are positive numbers, a sufficient condition of convergence is if the following two relations 


hold: 

1, jim an =0 and 2. a, > @ >a3>°+:->a,>°°-. (7.36b) 
Β The series (7.34) is convergent because of this criterion. 
2. Estimation of the Remainder of an Alternating Series 


Considering the first n terms of an alternating series, then the remainder R,, has the same sign as the 
first omitted term an+, and the absolute value of R, is smaller than |a, |: 


sign R, —sign(a,41) with R,=S—S,, (7.37a) [S — | < Jas]. (7.37b) 


W Considering the series 


1 
l-s45-—[4+---4—F---=In2 (738a) the remainder is |In2—S,| < EZ (7.38b) 
rR n 
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7.2.4 Some Special Series 
7.2.4.1 The Values of Some Important Number Series 
ek ΝΗΡ. "p 
Tarte πι | à 
joi. d (=1) 1 

]! 2| 3 ων, 6 

1 1 1 ο”. 
1--+--—+-:: ον 

2°3 4 um" TO 

1 1.1 1 
ΙΕΞΕΣΕΣ!:ἘπΕ''.--2, 

2°4°8 2n 

1.1 1 (=D, 2 
ee ae t U t i 
gota T | [D T 

3.5 7 9 ' 2n-1 TA? 

1 1 1 1 

| eed teers ils 

1.2 2.3 3.4 n(n 4-1) 

1 1 1 1 "m 
1-3 3.5 5.7 (2n — 1)(2n + 1) 

1 1 1 1 " 3 
1-3 2.4 3.5 (n — 1)(n 4 1) EP 

1, 1 1, 1 " π 
3-5 T.9 11.13 °° (4n—1)(4n +1) 8! 

1 1 1 1 
1.2.3 2-3-4 n(n + 1)(n +2) ` 4) 

1. "T, 1 | 1 
1-2...1 918... (1 Ε 1) n... (n+l-=1) ` (—1)0-1) 
igigd ο B πλ 

92 31 42 ΠΝ 6) 

i 1 i 1 1 (-1)' E πλ 

2'5 42 ο. 19 
1 I 1 | 1 I L 1 | = T? 

PT 32 5  Qncr1?' 8) 
νο P. a 

21 35 44 "n 90 ' 

1 l 1 i (-1)"! Trt 

24 34 ^ mn 5 720 
1 | 1 | 1 | 1 1 | π' 

1 3 δ. Qn +14 | 96 
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1 1 1 1 π2κΟΡΚ--1 
T+ 22k | 32k t qc | nak [ο (2k)! By, (7.57 
í 1 : 1 1 i l (δα l qk (O21 = 1) (7 58 

Qu ge 42k om | (2k)! i 

1 1 1 1 mk (22k m 1) 
lb b e I mt By, 7.59 

gr T gm t m (2n — 1) ο.) ^" ( 

1 1 1 l (1j qeu 

I 32-1  52k+1 ΤΆΚΕΙ Td (2n — 1)21 Tege R" | (7.60 


7.2.4.2 Bernoulli and Euler Numbers 


1. First Definition of the Bernoulli Numbers The Bernoulli numbers Bẹ occur in the power 
series expansion of some special functions, e.g., in the trigonometric functions tan x, cot x and csc x, 
also in the hyperbolic functions tanh x, coth x, and cosech x. The Bernoulli numbers Bx can be defined 


as follows 

n qn 
B: ET j mit Be 4 
: (=1) (2n)! 


z T a? 
1 ΕΒ j 
et —1 2 s ? € 


Eee (|x| < 27) (7.61) 


and they can be calculated by the coefficient comparison method with respect to the powers of x. The 
first in this way calculated values are given in Table 7.1. 
Table 7.1 The first Bernoulli numbers 
k By k By k By k By 
1 1 7 174611 
1 Ξ 4 = = 1 
6 30 , 6 0 330 
1 5 3617 854513 
2 Ll = 11 
30 66 P 510 138 
1 691 43 867 
3 42 i 2730 " 798 


2. Second Definition of Bernoulli Numbers Some authors define the Bernoulli numbers in the 
following way: 


12 qn 


mU Mts 
e—1 di P Ps 2! pus Bon ji paet Cel < 27). (7.62) 

So one gets the recursive formula 
Bei=(B+1)"  (k=1,2,3,...), (7.63) 


where after the application of the binomial theorem (see 1.1.6.4, 1., p. 12) it is to replace B” by B,, 
i.e., the exponent becomes the index. The first few numbers are: 


1 1 μμ. 
H--- Bas, Boos Bao, 
i οἳ 098 emna "5 42 
_ 1 — 5 = 691 -= 7 
Bos “Ree Bas ο... 7.64 
is 30° UT 3:9 MG? Ue) 


* By are the Bernoulli numbers 
ΤΕΙ are the Euler numbers 
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By =-—=.,... 3 = B5 = B=- =0. 
16 510^" 3 5 0 
The following relation is valid: 
By,-(-1"Buy — (k-1,2,3,...). (7.65) 


3. First Definition of Euler Numbers The Euler numbers Ex occur in the power series expansion 
of some special functions, e.g., in the functions sec 2; and sech x. The EULER numbers E; can be defined 
as follows 


2 4 yen 


x x π 
secr = 1 E E breed En Opi Ee ({α| =) (7.66) 
and they can be calculated by coefficient comparison with respect to the powers of x. Their values are 
given in Table 7.2. 

4. Second Definition of Euler Numbers Analogously to (7.63) the Euler numbers can be defined 
with the recursive formula 


(E-1*F-(E-i1*-20 8 (k=1,2,3,...), (7.67) 


where after the application of the binomial theorem it is to replace E" by E,. For the first values one 
gets: 


Ey = 1, Ey = 5, Eg = 61, Es = 1385, 


Fig = —50521, Ei = 2702765, Ej = —199 360 981, (7.68) 
Eis = 19391512145,..., M=% == =0. 

The following relation is valid: 
Er =(-1)"Ex (k =1,2,3,...). (7.69) 


Table 7.2 First Euler numbers 


1 1 5 50521 
2 5 6 2702765 
3 61 ri 199 360 981 
4 1385 


5. Relation Between the Euler and Bernoulli Numbers The relation between the Euler and 
Bernoulli numbers is: 

ο qub y 4) 241 

Eo, = ——— | B- - kz152...3: 7.70 

ae πι i) e (770) 


7.2.5 Estimation ofthe Remainder 
7.2.5.1 Estimation with Majorant 


In order to determine how well the n-th partial sum approximates the sum of a convergent series, the 
absolute value of the remainder 


oo oo 
|[$-S,-2|R,|—2| ας M; lal (7.71) 
k=n+1 k=n+1 
oo oo 
of the series Y ^ a, must be estimated. For this estimation one uses a majorantfor Y^ |a,|, usually a 
k=l k=n+1 


geometric series or another series which is easy to sum or to estimate. 
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oo 


1 P $ ] . 0m41 
E Estimate the remainder of the series e = Ὁ —. For the ratio t of two subsequent terms of 
enl m 
m! 1 1 
j A mii m! . 
this series with m > n + 1 holds: == = = < = q < 1. So the remainder 


Am (m +1)! m+1 7 n+2 
1 1 1 


~ (n4 1! (n+ 2)! | (n+ 3)! FM 


1 
quotient q — 
n+ 


: can be majorized by the geometric series (7.15) with the 


1 
7 and with the initial term a = jn and it yields: 


(n 1)! 


a 1 m+2 1 n+2 1 
Rn < < ! . 7.12 
l1—q (n4+1)!n4+1 n!n2+2n nən! ES 


7.2.5.2 Alternating Convergent Series 


For a convergent alternating series, whose terms tend to zero with monotone decreasing absolute values, 
the easy estimation for the remainder is (see 7.2.3.3, 1., p. 463): 


|Ral = |S = Θα] < Jan+ıl : (7.73) 
7.2.5.3 Special Series a 


For some special series, e.g., Taylor series, there are special formulas to estimate the remainder (see 


7.3.3.3, p. 471). 
7.3 Function Series 
7.3.1 Definitions 
1. Function Series is a series whose terms are functions of the same variable x: 
Πα) + falx) 4 + fr(x) ἡ = η fii). (7.74) 


n=1 


2. Partial Sum 5S,(z) is the sum of the first n terms of the series (7.74): 


Sn(a) = fi(z) + fa(x) +++ + fal -Σλω (7.75 


3. Domain of Convergence of a function series (7.74) is the set of values of x = a for which al 
the functions f(x) are defined and the series of constant terms 


Sila) + fala) + faa) +00 = Y: fala) (7.76 


n=1 


is convergent, i.e., for which the limit of the partial sums S, (a) exists: 


Jim. S, (a = lim L5 fela (7.77 


4. The Sum ofthe Series (7.74) is the function S(x), and one says that the series converges to the 
function S(x). The values for x = a are the points of convergence. 

5. Remainder Πη(α) is the difference between the sum S(x) of a convergent function series and its 
partial sum S, (x): 


R,(«) = S(x) — S,(x) = fari (£) + faso(m) + + farma) +. (7.78) 
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7.3.2 Uniform Convergence 
7.3.2.1 Definition, Weierstrass Theorem 


According to the definition of the limit of a sequence of numbers (see 7.1.2, p. 458 and 7.2.1.1, 2., 
p. 459) the series (7.74) converges for all values x of a domain to S(x) if for an arbitrary ε > 0 there is 
an integer N(x) such that |S(x) — S,,(a)| < £ holds for every n > N(x). For function series there are 
to be distinguished between two cases: 
1. Uniformly Convergent Series 
If there is a number N such that for every x in the domain of convergence of the series (7.74), |S(x) — 
Sn(a)| < £ holds for every n > N, then the series is called uniformly convergent on this domain. 
2. Non-Uniform Convergence of Series 
If there is no such number N which holds for every value of x in the domain of convergence, i.e., there are 
such values of e for which there is at least one x in the domain of convergence such that |S(x)— S, (x)| > € 
holds for arbitrarily large values of n, then the series is non-uniformly convergent. 

. ror? gh R 
WB A: The series 14 T H ate | nl firs =e (7.79a) 
(see Table 21.5, p. 1059) is convergent for every value of x. The convergence is uniform for every 
bounded interval of x, and for every |x| < a using the remainder of the Maclaurin formula (see 7.3.3.3, 
2., p. 472) the inequality 


qnl 
(n 4- 1) 


is valid. Since (n + 1)! increases faster than a"** in n, the expression on the right-hand side of the 
inequality, which is independent of x, will be less than e for sufficiently large n. The series is not uni- 
formly convergent on the whole numerical axis: For any large n there will be a value of x such tha 


n+1 
ec 
! (n 4- 1) 


μα) — Sn()| = 


το" («θςτ) (7.79b 


n+l 


απ δει. . , 
πε is greater than a previously given Ε. 
MB: Theseries e+a(1—a2)+a(1l—a2)?+---+a(1-a)"+--, (7.80a 
is convergent for every x in [0, 1], because corresponding to the d’Alembert ratio test (see 7.2.2.2, p. 461 
p= lim a E |1 — z| < Lis valid for 0 < x € 1 (for x = 0 S = 0 holds). (7.80b 


The convergence is non-uniform, because 

S(z) — S,(z) = z[(1— zx)" + (1- z)"? e... = (1— x)" (7.80c 
is valid and for every n there is an x such that (1 — x)"*! is close enough to 1, i.e., it is not smaller than 
c. In the interval a € x € 1 with 0 < a < 1 the series is uniformly convergent. 
3. Weierstrass Criterion for Uniform Convergence 
The series (7.81a) is uniformly convergent in a given domain if there is a convergent series of constant 
positive terms (7.81b) such that for every x in this domain the inequality (7.81c) is valid. 


fila) + fala) ++ fam) (7.81a) Οι tegte tet: (7.81b) 


|fn(x)| Es (7.81c) 
(7.81b) is called a majorant of the series (7.81a). 
7.3.2.2 Properties of Uniformly Convergent Series 


1. Continuity 
Ifthe functions fı (x), (ο)... fa(x), ++ are continuous in a domain and the series f (x)+ falx)+ +++ 
falx) +: is uniformly convergent in this domain, then the sum S(x) is continuous in the same domain. 
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If the series is not uniformly convergent in a domain, then the sum S(x) may have discontinuities in 
this domain. 


ΒΑ: The sum of the series (7.80a) is discontinuous: S(x) = 0 for x = 0 and S(x) = 1 for x > 0. 
WB B: The sum of the series (7.79a) is a continuous function: The series is non-uniformly convergent 
on the whole numerical axis. But it is uniformly convergent in every finite interval. 


2. Integration and Differentiation of Uniformly Convergent Series 
In the domain [a, b] of uniform convergence it is allowed to integrate the series term-by-term. It is also 
allowed to differentiate term-by-term if the result is a uniformly convergent series. That is: 


Tx f(t) dt = D n dt for zo, € [a,b], (7.822) 


zo ^-1 στρ 


(X πίω) = p fila) for x € [a,b]. (7.82b) 
n=1 

7.3.3 Power series 

7.3.3.1 Definition, Convergence 

1. Definition 


'The most important function series are the power series of the form 


do 
ασ + a4z + aga? +--+ + αμα ee = Maa Or (7.83a) 
=0 
n: «Ὁ 
ao + ai (z — zo)  ax(z — a)” +++» + απ(α-- αο)"ἠ-:'-- Y asm 20)", (7.83b) 
n=0 


where the coefficients a; and the center of expansion 20 are constant numbers. 

2. Absolute Convergence and Radius of Convergence 

A power series is convergent either only for x = xo or for all values of x or there is a number r > 0, the 
radius of convergence, such that the series is absolutely convergent for |x — zo| < r and divergent for 
|x — zo| > r (Fig. 7.1). The radius of convergence can be calculated by the formulas 


or r— 1 (7.84) 


lim {ή|α 
n—oo nl 


απ 


r= lim 
n99 | Ant 


domain of convergence if the limits exist. If these limits do not exist, then one has to take 


΄ ` the limit superior (lim) instead of the usual limit (see [7.6], Vol. I). 


XE x Xr At the endpoints x = +r and x = —r for the series (7.83a) and 
0 0 0 : - 
xz = 20 +r and x = x — r for the series (7.83b) the series can be 
Figure 7.1 either convergent or divergent. 


3. Uniform Convergence 
A power series is uniformly convergent on every subinterval |x — ο] € ro < r of the domain of conver- 
gence (theorem of Abel). 


x [2 a 1 +1 

E For the series de ee He μα ds holds — = lim ^ =1, ie, r=1. (7.85) 
1 2 n r n30 n 

Consequently the series is absolutely convergent in —1 < x < +1, for xv = —1 it is conditionally 


convergent (see series (7.34), p. 462) and for x = 1 it is divergent (see the harmonic series (7.16), p. 459). 
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According to the theorem of Abel the series is uniformly convergent in every interval [—r1, +11], where 
rı is an arbitrary number between 0 and 1. 

7.3.3.2 Calculations with Power Series 

1. Sum and Product 


Convergent power series can be added, multiplied, and multiplied by a constant factor term-by-term 
inside of their common domain of convergence. The product of two power series is 


> ana) . (Σ han) aobo + (agb + a1bo)x + (αρα + αιδι + abo)? 
n=0 n=0 
+(aobg + aiba + agb, + agbo)a? +. (7.86) 
2. First Terms of Some Powers of Power Series: 
S = a 4- bx 4- cx? 4- da? ex^ + f+, (7.87) 
S? = a? + 2abx + (D + 2ac)x” + 2(ad + be)x* + (c? + 2ae + 2bd)x* 
+2(af 1- be + cd)a? +--+, (7.88) 
| i 1 1b le 1DPY, 14 lbe LN a 
VS = 5 =a? [ "a" (5: 1π)» i (3 4a | ea)? 
le lbd l 3 be B B 4 
| (35 4a? 8a? 16a? 138 a) τ ον; (7-89) 
1 1 1b 30) Τελ , 36c ld 50DYN, 
ο =a |1- ο ο ο = ΝΡ; 
JS a | 2a. (15 aE (is 2a 168^ 
3bd 3c le 15be 35 UN , 
+ E a? 8a 2a 16a 138 τ) SN η (7:90) 
1 E E b b ἐν Ὁ [2c d PY. 
S j - h a e α) "ilg a a)” 
20d c æ Pc bt\ 4 
+ (5 oa 3-3 + τ) r4 | (a z: 0), (7.91) 
1 9 E ; p cia, bc d απ. 
Ez S αμ 277 | ὃ 2s z^4 6 3 2, 48 2 
bd c € bc bt) a 
| (s + ὃ E ἘΞ + 2 25 ἠ-»: | (a#0). (7.92) 
3. Quotient of Two Power Series 
5 αμα" m 
"c ao l + ax + aot“ re αρ ; 2 
14 βι)α + (a; a + B Ba) 
Saas o TEBE τη στ ο ..-".π; 
ία (a3 — aal, — a1 82 — Bs — Bj? + αιβι; + 28, 85)a? +--+] (bo #0). (7.93) 


One gets this formula by considering the quotient (7.93) as a series with unknown coefficients, and after 
multiplying by the numerator the unknown coefficients follow by coefficient comparison. 
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4. Inverse of a Power Series 
If the series 


y = f(x) = ax + bx? + ex? + drt +ea° + fs +- (a 0) (7.94a) 
is given, then its inverse is the series 
x = p(y) = Ay + By? + Cy? + Dyt + Ey? + Fy +.. (7.94b) 
Taking powers of y and comparing the coefficients yields 
1 b 1 i 1 , : 
A . B ay € -(20 — ac), D = —(babc — a?d — 5b*), 
a ay a? a‘ 
1 o "m , , 
E- s (6a7bd + 3a? + 14b* — abe — 21ab?c), (7.94c) 
αἱ 


F a (Ta? be + Ta?cd + 84ab?c — a* f — 28a? Ud — 28a?bc? — 420°). 
a 
The convergence of the inverse series must be checked in every case individually. 


7.3.3.3 Taylor Series Expansion, Maclaurin Series 


There is a collection of power series expansions of the most important elementary functions in Ta- 
ble 21.5, p. 1057. Usually, one gets them by Taylor expansion. 

1. Taylor Series of Functions of One Variable 

If a function f (x) has all derivatives at x = a, then it can often be represented with the Taylor formula 
as a power series (see 6.1.4.5, p. 443). 

a) First Form of the Representation: 


fO (a) ντι (7.95a) 


f(x) = f(a) 4 fa) i (z a f" (a) 4- «4 


This representation (7.95a) is correct only for the x values for which the remainder R, = f(x) — S, 
tends to zero if n — oo. This notion of the remainder is not identical to the notion of the remainder 
given in 7.3.1, p. 467 in general, only in the case if the expressions (7.95b) can be used. 

There are the following formulas for the remainder: 


po gy 
fr arene) (a<€<a or ©<€<a) (Lagrange formula), (7.95b) 
n ! 
R= E (a — t)^ fOD (t) dt (Integral formula). (7.95c) 
n! 
b) Second Form of the Representation: 
hi h^. h” qs) 
f(a-- h) = f(a) + j^ 9 | af (a) 9-4 d (a) T. (7.96a) 
The expressions for the remainder are: 
h^ 
= ™ (a+ Oh 1 96] 
R mei (a+ Oh) (0 « O « 1), (7.96b) 


Ra = — [(h — t)^ FY (a + t) dt. (7.96c) 
n: 
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2. Maclaurin Series 
The power series expansion of a function f(x) is called a Maclaurin series if it is a special case of the 
Taylor series with a = 0. It has the form 


fG) = £00) + ο ο. S HMO) 4 (7.974) 
with the remainder a 

Γι - hen (0<@<1), (7.97b) R,— πῇς -η"/γπθ()αι.. (7.97c) 
The convergence of the Taylor series and Maclaurin series can be proven either by examining the re- 


mainder R, or determining the radius of convergence (see 7.3.3.1, p. 469). In the second case it can 
happen that although the series is convergent, the sum S(x) is not equal to f(a). This holds for instance 
for the function f(x) = exp(—4) for x # 0, and f(0) = 0. The terms of its Maclaurin series are all 
identically zero, e.g., S(x) = 0 # f(x). 

7.3.4 Approximation Formulas 


Considering only a neighborhood small enough of the center of the expansion, one can introduce ratio- 
nal approximation formulas for several functions with the help of the Taylor expansion. The first few 
terms of some functions are shown in Table 7.3. The data about accuracy are given by estimating the 
remainder. Further possibilities for approximate representation of functions, e.g., by interpolation and 
fitting polynomials or spline functions, can be found in 19.6, p. 982 and 19.7, p. 996. Important series 
expansions see Table21.5, p. 1057. 

7.3.5 Asymptotic Power Series 


Even divergent series can be useful for calculation of function values. Some asymptotic power series 


; 1 : : 
with respect to — are considered next to calculate the values of functions for large values of |x]. 
T 


7.3.5.1 Asymptotic Behavior 


Two functions f(a) and g(x), defined for rg < x < oo, are called asymptotically equal for x — oo if 


=1 (7.98a) or f(x) = g(x)+o(g(x)) for x» oo (7.98b) 


hold. Here, o(g(x)) is the Landau symbol “little o" (see 2.1.4.9, p. 57). If (7.98a) or (7.98b) is fulfilled, 
then one writes also f(x) ~ g(x). 

MA: /i?-c-1-z. MB: enl. ag ο X 
41955153 4r? 
7.3.5.2 Asymptotic Power Series 


1. Notion of Asymptotic Series 
dy 


A series 577: — is called an asymptotic power series of the function f(x) defined for x > xo if 
g” 


f(z) = Y = +0 e (7.99) 


v=0 
1 
holds for every n = 0,1,2,.... Here, O (zx) is the Landau symbol *big O". For (7.99) one also 
z 
writes f(a) ~ = 
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Table 7.3 Approximation formulas for some frequently used functions 


Tolerance interval for x with an error of 
Approximate Next 0.1% 1% 10% 
formula term from to from to from to 
"vm a? | —0.077 | 0.077 | —0.245 | 0.245 | —0.786 | 0.786 
bti 6 —4.4? 4.45 —14.0° | 1409 | —45.0° | 4500 
mou gi n α | —0.580 | 0.580 | —1.005 | 1.005 | —1.632 | 1.632 
Hu 120 | —33.2° 33.95 —57.6° 57.6° —93.5? 93.5? 
x | —0.045 | 0.045 | —0.141 | 0.141 | —0.415 | 0.415 
cosg πε 1 =— . 
2 —2.6° 2.6° —8.1° 8.19 —25.89 | 95.85 
ai? z^ | —0.386 | 0.386 | —0.662 | 0.662 | —1.036 | 1.036 
cosx 1— — du j am ; 
2 24 | —22.1° | 221° | —37.9° | 37.9° | —59.3° | 59.3 
a? | —0.054 | 0.054 | —0.172 | 0.172 | —0.5817 | 0.517 
tantr 54 2: + ͵ 
3 —3.1* 3.19 -9.85 9.85 -9965 | 929.00 
mem ge NN —0.293 | 0.203 | —0.519 | 0.519 | —0.895 | 0.895 
PE 15 | -16.8° | 16.8° | —29.7° | 29.7° | —51.3° | 51.35 
K. pe. 
Vata rat eR E —0.085a? | 0.093a? | —0.247a? | 0.328a? | —0.607a2 | 1.545a? 
a a` 
1 e ate 
=-|a 
2 a 
L i | 432" | 9.05102 | 0.05202 | 0.15702 | 0.1660? | -0.488αΣ | 0.53002 
Veta ~ A η Τ Bab Uola T 40 = 9/4 i aA =j a ub a 
1 1 g αφ 
x--2 “ [-0.031a |0.031a | —0.099a |0.099a | —0.301a | 0.301a 
atr a a a 
ΓΣ xl+r 15 —0.045 |0.045 | -0.134 |0.148 | —0.375 | 0.502 
dnd 
In(1-- a) e x E —0.002 | 0.002 | —0.020 [0.020 | —0.176 | 0.230 


2. Properties of Asymptotic Power Series 

a) Uniqueness: If for a function f (x) the asymptotic power series exists, then it is unique, but a func- 
tion is not uniquely determined by an asymptotic power series. 

b) ip ae Convergence is not required for an asymptotic power series. 


m 
E "as is an asymptotic series, which is convergent for every x with |x| > xo (xo > 0). 


oo ρ-αί 
E B: The integral f(x) = | E d (x > 0) is convergent for r > 0 and repeated partial inte- 
c 


1+t 
: δ . 1 i 2 3 n — 1)! 
gration results in the representation f(x) "ime ! = um + ( jet = ) + R(x) 
pom | NT n! 


. n! re ο ae 
with R,(r) = (—1)" a i TENET dt. Because of |R,(x)| € mf edt = Qui One gets 
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1 
R(x) =O (za) and with this estimation 
oo v! 


oo e t +s ! 
/ ats CUZ (7.100) 


v=0 


The asymptotic power series (7.100) is divergent for every x, because the absolute value of the quotient 
of the (n + 1)-th and of the n-th terms has the value 7 —. However, this divergent series can be used for 


a reasonably good approximation of f(x). For ihsSuos for x = 10 with the partial sums (10) and 
4-10 


S5(10) the estimation 0.09152 < [ Uu z dt < 0.09164 holds. 
“0 


7.4 Fourier Series 
7.4.1 Trigonometric Sum and Fourier Series 
7.4.1.1 Basic Notions 


1. Fourier Representation of Periodic Functions 
Often it is necessary or useful to represent a given periodic function f(x) with period T exactly or 
approximatively by a sum of trigonometric functions of the form 
a 
δα) = E + a, cos uz + a» cos 2wz + +++ + a, cos nuc 
+b, sin wg + bo sin 2wz +---+ b, sinnwz. (7.101) 
m ‘ , . 2π 
This is called the Fourier expansion. Here the frequency is w = Τ᾽ In the case T = 27 holds w = 1. 


One can get the best approximation of f(x) in the sense given on 7.4.1.2, p. 475 by an approximation 


function s, (x), where the coefficients a; and by (k = 0,1,2,...,n) are the Fourier coefficients of the 
given function. They are determined with the Euler formulas 
2 T zo+T 9 Th 2 
n. [1 cos ἴω: dx = = r f fe cos kwa dx = — ra (x) + f(—x)] cos kwx dz, (7.102a) 
and 
ο oT o T 
=F no sin kws dx = 7 | He η) sin kwa dx = = τῇ! f(x) —x)|sinkwxdx, (Τ.1025) 
0 


or approximatively with the help of methods of harmonic analysis (see 19.6.4, p. 992). 

2. Fourier Series 

If there is a system of x values such that the sequence of functions s,,(a) tends to a limit s(x) for n — oo, 
then for the given function exists a convergent Fourier series for these x values. This can be written in 
the form 


ao 
s(x) = * + a4 cos wa + à» cos wg + +++ + an cos mug d ss 


+b; sin wx + bg sin 2wa +-+- + b, sinnwa +: (7.103a) 
and also in the form 
s(x) = 2 + Αι sin(wx + φι) + Ag sin(2wa + p2) +--+ + As sin(nwx + qu) 4-77, (7.103b) 


where in the second case: 


Ap = Van? + ὑκ'. tangy— B . (7.103c) 
k 
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3. Complex Representation of the Fourier Series 
In many cases the complex form is very useful: 


+00 
Y eem, (7.1042) 
k-—oo 
TN fork — 0, 
17 1 
EE / ο. da = 5 (tx ib) fork > 0, (7.104b) 


1 
Pi +ib) fork«0. 


7.4.1.2 Most Important Properties of the Fourier Series 


1. Least Mean Squares Error of a Function 


2π 
If a function f(x) over the interval [0, T] (τ = =) is approximated by a trigonometric sum 
ω 


(η) = R + ap cos ζωα + x bj sin kwz, (7.105a) 


also called the Fourier sum, then the mean square error (see 19.6.2.1, p. 984, and 19.6.4.1, 2., p. 993) 


r- stre — sy (x)]? dx (7.105b) 


is smallest if one defines a; and by as the Fourier coefficients (7.102a,b) of the given function. 
2. Convergence of a Function in the Mean, Parseval Equation 


: : ' 2m... : "NE 
The Fourier series converges over the interval [0, T] (T = —) in mean to the given function, i.e., 
ω 


[ρω —s,(t)P2de +0 for n—oo (7.106a) 


holds, if the function is bounded and in the interval 0 < x < T it is piecewise continuous. A consequence 
of the convergence in the mean is the Parseval equation: 


T oo 
2 fiot as = 2+ p aj? + bè). (7.106b) 
0 k=1 


3. Dirichlet Conditions 

If the function f(x) satisfies the Dirichlet conditions, i.e., 

a) the interval of definition can be decomposed into a finite number of intervals where the function f(x) 
is continuous and monotone, and 

b) at every point of discontinuity of f(x) the values f(a +0) and f(x — 0) are defined, 

then the Fourier series of this function is convergent. At the points where f(a) is continuous the sum 
is equal to f(a), at the points of discontinuity the sum is equal to E 

4. Asymptotic Behavior of the Fourier Coefficients 

If a periodic function f(x) and its derivatives up to k-th order are continuous, then for n — oo both 
expressions αρ ΤΊ and b,n**! tend to zero. 
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f(x) f(x) 


Figure 7.2 Figure 7.3 Figure 7.4 
7.4.2 Determination of Coefficients for Symmetric Functions 
7.4.2.1 Different Kinds of Symmetries 


1. Symmetry of the First Kind 
If the periodic function f(x) with the period T is an even function, i.e., if f(x) = f(—x) (Fig. 7.2), 
then its Fourier coefficients are 
4 TP 2 
πα 
m=z / Λα) cok dz, be=0 — (k-012,.... (7.107) 


2. Symmetry of the Second Kind 
If the periodic function f(a) with the period T is an odd function, i.e., if f(x) = —f(—z) (Fig. 7.3), 
then its Fourier coefficients are 
4; 9 
ak=0, bk= 5 l f(z)snk de  (k=0,1,2,...). (7.108) 
T 4 T 


3. Symmetry of the Third Kind 
If for a periodic function with the period T holds f(x + 7/2) = — f(x) (Fig. Τ.Α), then the Fourier 
coefficients are 


T/2 
4 20x 
dni = m / f(x) cos(2k + 1) dx, do, = 0, (7.109a 
0 
ATP 3πο 
bia y / f(a) sin(2k + 1) = dz, b%=0 (k=0,1,2,...). (7.109b 
0 


4. Symmetry of the Fourth Kind 
If the periodic function f(x) with the period T is odd and also the symmetry of the third kind is satisfiec 
(Fig. 7.5a), then the Fourier coefficients are 


T/A 
d cdam bki = B / f(x) sin(2k + τμ (k =0,1,2,...). (7.110 
0 


If the function f(a) is even and also the symmetry of the third kind is satisfied (Fig.7.5b), then the 
Fourier coefficients are 


T/A 
/ 2nx 


bk =a = 0, ap 7 / Λία) cos(2k + = de (k= 0,1,2,-..): (7.111) 


0 
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Figure 7.6 | Figure 7.7 


7.4.2.2 Forms of the Expansion into a Fourier Series 


Every function f(x), satisfying the Dirichlet conditions in an interval 0 € x < l (see 7.4.1.2, 3., p. 475), 
can be expanded in this interval into a convergent series of the following forms: 


Qrx πα 2πο 
1 A(e)= P+ αἱ cos 77. + aa cos +--+ + an cosn i e 
Qra 2nr Brr 
b, sin T + by sin 2 T fees +b, sinn = Been (7.112a) 


The period of the function fi(v) is T = l; in the interval 0 < x < I the function fi(x) coincides 
with the function f(x) at the points of continuity (Fig. 7.6). At the points of discontinuity fi(x) = 


1 
5 [f(x — 0) + f(x 4- 0)] holds. The coefficients of the expansion are determined with the Euler formulas 


(7.102a,b) for w = xs 


2. falx) E + αι COS 7 I az cos 277 +-+. I a, Cos n= Peers (7.112b) 


The period of the function f;(v) is T = 21; in the interval 0 € x < l the function f(x) has a symmetry 
of the first kind and it coincides with the function f(x) (Fig. 7.7). The coefficients of the expansion 
of fa(x) are determined by the formulas for the case of symmetry of the first kind with T = 31. 


3. fs(x) = bisin T + by sin27* fee 4 b, sinn = Een (7.112c) 


The period of the function f3(x) is T = 21; in the interval 0 < x < l the function f(x) has a symmetry 
of the second kind and it coincides with the function f(x) (Fig. 7.8). The coefficients of the expansion 
are determined by the formulas for the case of symmetry of the second kind with T = 24. 


7.4.3 Determination ofthe Fourier Coefficients with Numerical 
Methods 


If the periodic function f(x) is a complicated one or in the interval 0 € x < T its values are known 


kT 
only for a discrete system x, = N with k = 0,1,2,..., N — 1, then the Fourier coefficients have to be 


/ 
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f(x) 
y 
l rH O 
0 X O0 TT TX 
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Figure 7.8 Figure 7.9 


approximated. Furthermore, e.g., also the number of measurements N can be a very large number. In 
these cases one uses the methods of numerical harmonic analysis (see 19.6.4, p. 992). 


7.4.4 Fourier Series and Fourier Integrals 


1. Fourier integral 

If the function f(x) satisfies the Dirichlet conditions (see 7.4.1.2, 3., p. 475) in an arbitrarily finite 
+00 

interval and, moreover, the integral / | f (x)| da is convergent (see 8.2.3.2, 1., p. 507), then the following 
τοο 


formula holds (FOURIER integral): 


1 +00 +00 1 oo +00 
f(a) = / ci^? dw Jj foe“ dt = — nz / f(t) cos w(t — x) dt. (7.113a) 
T π 
—oo -οο 0 —oo 
At the points of discontinuity can be used the substitution 
1 
f(x) = aui — 0) + f(x 4- 0)]. (7.113b) 


2. Limiting Case of a Non-Periodic Function 

The formula (7.113a) can be regarded as the expansion of a non-periodic function f(2) into a trigono- 
metric series in the interval (—l, +1) for | — oo. 
With Fourier series expansion a periodic function with period T is represented as the sum of harmonic 


vibrations with frequency wp = num with n = 1,2,... and with amplitude Αμ. This representation is 


based on a discrete frequency spectrum. 
With the Fourier integral the non-periodic function f(x) is represented as the sum of infinitely many 
harmonic vibrations with continuously varying frequency w. The Fourier integral gives an expansion 
of the function f(x) into a continuous frequency spectrum. Here the frequency w corresponds to the 
density g(w) of the spectrum: 


+00 
1 —iwt e 
gw) = x | f(t)e^* dt. (7.1130) 
The Fourier integral has a simpler form if f(a) is either a) even or b) odd: 
2 oo oo 
a) f(x)-— [ en à» f F(t cos wt dt, (7.114a) 
π 
Ὁ 0 


b) f(x) = MET dw f f(t) sinwt dt. (7.114b) 


0 
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E The density of the spectrum of the even function f(x) = e~!*! and the representation of this function 
are 
9 


ο 9 
g(w) — — [ e * cosut dt = — 
π Jo π 


2 [ο cosur 
[ τω. (Τ1151) 
0 


8 ole] — 
TE (7.115a) and e = zh urri 


7.4.5 Remarks on the Table of Some Fourier Expansions 


In Table 21.6, p. 1062 there are given the Fourier expansions of some simple functions, which are 
defined in a certain interval and then they are periodically extended. The shapes of the curves of the 
expanded functions are graphically represented. 

1. Application of Coordinate Transformations 

Many of the simplest periodic functions can be reduced to a function represented in Table 21.6 either 
by changing the scale (unit of measure) of the coordinate axis or by translation of the origin. 


W A function f(x) = f(—x) defined by the relations 


T 

2 for0 <z < n 

B 0 f j <a< 2 
ο «v « — 

4 2 
(Fig. 7.9), can be transformed into the form 5 given in Table 21.6, by substituting a — 1 and in- 
2πα 
troducing the new variables Y = y — 1 and X = di + a By the substitution of the variables in 


T 2 
2 ya 2 T 
series 5, because sin(2n 4- 1) ( τ 5) = (—1)" cos(2n + DT one gets for the function (7.116a) 


(7.1162) 


T 2 


the expression 


4 3πι 1 2s: 1 2πα 
y=1+ (cos TT e Dil us + = cos ei MN ) . (7.116b) 
T T 5 T 


2. Using the Series Expansion of Complex Functions 

Many of the formulas given in Table 21.6 for the expansion of functions into trigonometric series can 
be derived from power series expansion of functions of a complex variable. 

W The expansion of the function 


i =1+z+2 + (z| < 1) (7.117) 
yields for 
z= ae? (7.118) 


after separating the real and imaginary parts 


l--acosq +a’ cos2p +-+- +a" cosny +++ = ee 
i Don DN ~ 1-2acosp-- a?’ 

; 2. € asin 

3 sin2p ἠ-::: sin e = —— —— for Ja] < 1. 7.119 
asiny γα sin2g t --- a^ sinng + Έτη” r |a] ( ) 


8 Integral Calculus 


1. Integral Calculus and Indefinite Integrals Integration represents the inverse operation of 
differentiation in the following sense: While differentiation calculates the derivative function f'(x) of a 
given function f (x), integration determines a function whose derivative f'(x) is previously given. This 
process does not have a unique result, so it leads to the notion of an indefinite integral. 

2. Definite Integral By starting with the graphical problem of the integral calculus, determining 
the area between the curve of y = f(x) and the x-axis, and for this purpose approximating it with a 
set of thin rectangles (Fig. 8.1), the notion of the definite integral can be introduced. 

3. Connection Between Definite and Indefinite Integrals Therelation between these two types 
of integral is the fundamental theorem of calculus (sce 8.2.1.2, 1., p. 495). 


8.1 Indefinite Integrals 


8.1.1 Primitive Function or Antiderivative 


1. Definition 
Consider a function y = f(x) given on an interval [a,b]. F(x) is called a primitive function or an- 
tiderivative of f (x) if F(x) is differentiable everywhere on fa, b] and its derivative is f(x): 

F'(x) = f(x). (8.1) 
Because under differentiation of F(x) + C (C const) the additive constant disappears, a function has 
infinitely many primitive functions, if it has any. The difference of two primitive function is a constant. 


So, the graphs of all primitive functions F; (x), F2(x),..., Fa(x) can be got by parallel translation of a 
particular primitive function in the direction of the ordinate axis (Fig. 8.2). 
y: 


Ol a=Xy XXX Ëk Xk Χκμ Xu Xn=b x 
Figure 8.1 

Εις 

E09 

F(x) 


<< 


xY 


Figure 8.2 Figure 8.3 


2. Existence 

Every function continuous on an interval [α, b] has a primitive function on this interval. If there are 
some discontinuities, then one decomposes the interval into subintervals in which the original function 
is continuous (Fig. 8.3). The given function y — f(x) is in the upper part of the figure; the function 
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y = F(x) in the lower part is a primitive function of it on the considered intervals. 
8.1.1.1 Indefinite Integrals 
The indefinite integral of a given function f(x) is the set of primitive functions 


F(a) +C= nuo dz. (8.2) 


The function f(x) under the integral sign / is called the integrand, x is the integration variable, and C 


is the integration constant. It is also a usual notation, especially in physics, to put the differential dx 
right after the integral sign and so before the function f(x). 


Table 8.1 Basic Integrals 


Powers Exponential Functions 
qnl 
je dx = S (ασε —1) Je dx = e* 
[€ = ln |z| fe dx = É (a2 0,a 7 1) 
Trigonometric Functions Hyperbolic Functions 
/ sing dz = —cosr / sinh z dz = cosh x 
/ cos «dx = sin x ή cosh x dx = sinh x 
[am xdx = — ln | cos x| E dx = In| cosh z| 
[ zd = ln |sin z| | coth xdg = In|sinha| 
da =tanz J da = tanh x 
J cos? x J cosh“ x 
dix = —cotz a dr |... cothz 
J sin’ x J sinh x 
Fractional Rational Functions Irrational Functions 
dx... I artani » 
gm arctan 7 (|z| <a,a> 0) 
dx  lA.- r 1 a+r dx — Avena 2 — . v 3 
PET qArtanh 5 = σᾳ In | m z| | 73 EET Arsinh 5 = In | Lyra 
(|x| < a, a > 0) (a > 0) 
a = 1 Arcoth T = 2 In E F 2 J- a> Arcosh 7 =In E Tyr?— | 
(|z| > a, a > 0) (|z| > a, a > 0) 


8.1.1.2 Integrals of Elementary Functions 


1. Basic Integrals 

The integration of elementary functions in analytic form is reduced to a sequence of basic integrals. 
These basic integrals can be got from the derivatives of well-known elementary functions, since indefi- 
nite integration means the determination of a primitive function F(x) of the function f (x). 

The collection of integrals given in Table 8.1 comes from reversing the differentiation formulas in Ta- 
ble 6.1, p. 434 (Derivatives of elementary functions). The integration constant C is omitted. 
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2. General Case 

For the solution of integration problems, one tries to reduce the given integral by algebraic and trigono- 
metric transformations, or by using the integration rules to basic integrals. The integration methods 
given in section 8.1.2 make it possible in many cases to integrate those functions which have an el- 
ementary primitive function. The results of some integrations are collected in Table 21.7, p. 1065 
(Indefinite integrals). The following remarks are very useful in integration: 


a) The integration constant is mostly omitted. Exceptions are some integrals, which in different forms 
can be represented with different arbitrary constants. 

b) Ifin the primitive function there is an expression containing In f (x), then one has to consider always 
In | f(a)| instead of it. 

c) If the primitive function is given by a power series, then the function cannot be integrated in an 
elementary fashion. 

Α wide collection of integrals and their solutions are given in [8.1] and [8.2]. 


8.1.2 Rules of Integration 

The integral of an integrand of arbitrary elementary functions is not usually an elementary function. 
In some special cases it is possible to use some tricks, and by practice one can gain some knowledge of 
how to integrate. Today the calculation of integrals is mostly left to computers. 

The most important rules of integration, which are finally discussed here, are collected in Table 8.2. 
1. Integrand with a Constant Factor 

A constant factor o in the integrand can be factored out in front of the integral sign (constant multiple 
rule): 


a f(x) dz = a f f) dx. (8.3) 


2. Integration of aSum or Difference 
The integral of a sum or difference can be reduced to the integration of the separate terms if one can 
tell their integrals separately (sum rule): 


"c +v- ω) απ = [uae + fe dx — /ω dx. (8.4) 


The variables u, v, w are functions of x. 
5 


a [a++ 1) dx = f (ο + 6x? + 102? + 62 +9) dx = - 


s Da 


3 ; 
ub t3 + 32? -- 9x 4- C. 


3. Transformation of the Integrand 
The integration of a complicated integrand can sometimes be reduced to a simpler integral by algebraic 
or trigonometric transformations. 


1 
E [sin 2r cos x dx = J jin 3a + sin x) dz. 
4. Linear Transformation in the Argument 


If f f(x)dx = F(x) is known, e.g., from an integral table, then one gets: 
1 
/ Λας) ἀσ = -F(az)--C, (8.52) i f(z 4-b)dz = F(x --5) 4- C, (8.5b) 
a 


[ fec αλλα Lotes (8.5c) 


a 


1 a 1. us 
W A: [sin ax dx = ——cosax+C. HB: e dy = —e* ΙΟ, 
: a : [n 
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BC: ie = arctan(r + a) + C. 
l4 (r4 a? 


Table 8.2 Important Rules of Calculation of Indefinite Integrals 


Rule Formula for Integration 

Integration constant / f(a) du = F(x)+C (C const) 
Integration and Hii OE phe 

differentiation Pe dro f(z) 


Constant multiple rule / af (a) dx =a / f(a) dx (α const) 
Sum rule ο υ(α)] dz = fe (x) dx + EO dx 


Partial integration | uae) dx = u(r)v(r) — | υ(ϑυ(α) dx 


x=u(t) or {--υ(α); 
u und v are inverse functions of each other : 


Substitution rule no ) da 2i (u(t))u'(t) dt or 


f (u(t)) 
EO dr = v(u(t)) dt 


Ila 
f ο dr = ln|f(x)| +C (logarithmic integration) 
f(x) 


Special form of 
the integrand 


2. [n z)f(z)dr = ο) + 
u und v are inverse functions of each other : 


Integration of the [ «o dx = ru(x)— F(u(x)) -- C with 
inverse function s 
F(x) = "EO απο (Ci, C2 const) 


5. Power and Logarithmic Integration 


a) If the integrand has the form of a fraction such that the numerator is the derivative of the denomi- 
nator, then the integral is the logarithm of the absolute value of the denominator: 


/ πο de= XS = In|f(z)| 4 C. (8.6) 


πλι/ ος Ἐς z de = m |a? +30 — 5| +C. 


b) Ifthe ... is a product of a power of a function multiplied by the derivative of the function, 
and the power is not equal to —1, then holds 


μμ. 70 


2x+3 1 
B: Ix = EC. 
" le (x? + δα — δ)' a (-2)(z? + 8α — 5)? G 


+C (aconst, a 4-1). (8.7) 
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6. Substitution Method 
If « = u(t) where t = v(x) is the inverse function of x = u(t), then according to the chain rule of 
differentiation follows 


/ f(x) dx = J f(u(t)]u'(t) dt or EO dr = / 


F 
"( 


U 


u(t)) 
(5j 4^ (8.8) 


u 


εὐ --1 dx 1 
W A: / Τ dx. Substituting x = Int, (t > 0), — = T then taking the decomposition into 


eu dt 
partial fractions gives: 
εἴ —]1 t—1dt 2 1 
t= = -]αἱ--2]η(ο 1-1} --α ; 
[E dab frc Pea i) Mcd 
; dæ : It x dx di 1 ; 
B B: / a = . Substituting 1 +2? = t, = = 2x, then one gets. E - itum 5 al + x?) 4 


C. 


7. Partial Integration 
Reversing the rule for the differentiation of a product gives 


[rewa dx = u(x)v(x) — io u(x) dz, (8.9) 
where u(x) and v(x) have continuous derivatives. 


E The integral I xe” dx can be calculated by partial integration choosing u = x and v’ = e*. Then 


u = 1 and v =e”, therefore fre dx = xe" Je dx = (x — l1)e* + C. 


8. Non-Elementary Integrals 

Integrals of elementary functions are not always elementary functions. These integrals are calculated 
mostly in the following three ways, where the primitive function will be approximated by a given accu- 
racy: 

1. Table of Values The integrals which have a particular theoretical or practical importance but 
cannot be expressed by elementary functions can be given by a table of values. (Of course, the table 
lists values of one particular primitive function.) Such special functions usually have special names. 
Examples are: 

W A: Integral Logarithm (see 8.2.5, 3., p. 513): 


[s a (8.10 


Jo mt 
E D: Elliptic integral of the first kind (see 8.1.4.3, p. 490): 
sin ϱ dax 
li : = F(k, ). (8.11 
Jo 


(1 — 22)(1— k?x?) 
W C: Error function (see 8.2.5, 5., p. 514): 


ar (8.12 
——]|e = erf(x). : 
vm Jo 
2. Integration by Series Expansion Using the series expansion of the integrand, and if it is uni- 
formly convergent, then it can be integrated term-by-term. 


BA: / m dx, (see also Sine integral p. 513). 


T 


oT 
E B: [5 dx, (see also Exponential integral p. 514). 
Jom 
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3. Graphical integration is the third approximation method, which is discussed in 8.2.1.4, 5., 
p. 499. 
8.1.3 Integration of Rational Functions 
Integrals of rational functions can always be expressed by elementary functions. 
8.1.3.1 Integrals of Integer Rational Functions (Polynomials) 
Integrals of integer rational functions are calculated directly by term-by-term integration: 

"n αμα" + ay 4a" 1 +++» E aya + ag) dx 

p, 


än- αι ς 
= vp $+ gh pee η ο + aox + C. (8.13) 
n n 


8.1.3.2 Integrals of Fractional Rational Functions 
P(x) 


The integrand of an integral of a fractional rational function i Ja dx, where P(x) and Q(x) are 
polynomials with degree m and n, respectively, can be transformed algebraically into a form which is 
easy to integrate. There are the following steps: 

1. Simplifying the fraction by the greatest common divisor, so P(x) and Q(x) have no common factor. 
2. Separating the integer and rational parts of the expression. If m > n , then P(x) is divided by Q(x). 
Then a polynomial and a proper fraction should be integrated. 

3. Decomposing the denominator Q(x) into linear and quadratic factors (see 1.6.3.2, p. 44): 


Q(x) = αμία — α) (x — B) --- (a? + px +q) (a2 + ρα 4- q') --- (8.14a) 
p p? 
with gis% uo 4*9 : (8.14b) 


4. Factoring out the constant coefficient αμ in front of the integral sign. 


5. Decomposing the fraction into a sum of partial fractions: The proper fraction which is obtained 
after the division, cannot be further simplified and its denominator is decomposed into a product of 
irreducible factors. Then it can be decomposed into a sum of partial fractions (see 1.1.7.3, p. 15), each 
of them is easy to integrate. 


8.1.3.3 Four Cases of Partial Fraction Decomposition 


1. Case: All Roots of the Denominator are Real and Single 


Q(x) = (x — a)(a — B)---(x— A) (8.15a) 
a) Forming the decomposition: 

P(x) A, B i -- 

Q(x) r-a z-  z—A (8.15b) 

. P(a) P(B) P(X) 
with A= , = —...,L= ; 8.15c 
aay ο Qj Pe 

b) The numbers A, B, C,..., L can also be calculated by the method of undetermined coefficients (see 


1.1.7.3, 4., p. 17). 
c) Integration by the formula 


/ AER Aln |x — αἰ. (8.15d) 


C= ea 
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a= [Cre . 2z 4-3 A B C a= 20 -( 2x +3 ) 
J a+a2—2e° αα- Όα κ) cv r—-1 z42' Q'(0) 32? + 29 — 2/ 420 
^ n Jy 
u 3 Β-{ 21 3 ) -. e Ed ) u A 
2 3324-27 2/1 3 3a? + 2x — 2/1 ο 6 
3 5 1 ; 5/3 
zoo: «ἡ 3 1,5 M Cle - 05 
I /( + E | i) ae = ginlel 3In|z 1| απ aga 


2. Case: All Roots of the Denominator are Real, Some of them with a Higher Mul- 
tiplicity 


Q(x) = (x — α) (x — By"... (8.164) 
a) Forming the decomposition: 
P(x) Αι ; A» i ' Αι 
Ql) (z-a) (α-- α) | τα — a)! 
Bı By j Bm i 
| (c8 + (s — BP fees ZI (8.16b) 


(x — B)m i 


b) Calculation of the constants Αι, Α»,..., Ai, Bi, Bo,..., Bm,... by the method of undetermined co- 
efficients (see 1.1.7.3, 4., p. 17). 


c) Integration by the rule 


A dx A; dx Ax 
= Ailn [z — αἰ, / - k > 1). 1 
ic -α ideo o (x — a)* (k — 1)(x — o)*-t Weg (8-166) 
23 od Ἷ ! 
| NES / τ L uc x a B | B 4 B -- Bs ~. The method of undetermined 
a(x — 1) z(r—1? xv a«-1 (#-1) (x-1) 


coefficients yields A+ Bj = 1, —3A — 2B; + By = 0, 3A + Bı — B2 + B; = 0, -A = 1; A 1, Bi 
2, By = 1, B3 = 2. The result of the integration is 


1 2 1 2 
I /| 2:013 DG 5 dr = —ln |z| + 21n | — 1| sel e. C 
(x — 1)? 
T 


GIP i 


In 


3. Case: Some Roots of the Denominator are Single Complex 


Suppose all coefficients of the denominator Q(x) are real. Then, with a single complex root of Q(x) its 
conjugate complex number is a root too and one can compose them into a quadratic polynomial. 


Q(z) = (x — α) (x — B)" ... (a? + px - q)(a? «ρα. q)... (8.17a) 
T p pr, i 
with 449 gbee (8.17b) 


because the quadratic polynomials have no real zeros. 
a) Forming the decomposition: 
P(x) Αι | As eae Άι | B B E Bm 
Q(x) α--α (a — a)? ; > — By 
σα -ε D Ex ΕΕ 
ME κο ον (8.1το) 
w+prt+q αερα -α 
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b) Calculation of the constants by the method of undetermined coefficients (see 1.1.7.3, 4., p. 17). 


Cr+ D 
c) Integration of the expression = τ by the formula 
xt +pr+q 
Cx+D)dr | C , D — Cp/2 2 
= = jae = In |x? + pz 4- q| 4 Ῥ/ adan T2 (8.17d) 
a + px τα q—p?/4 q—p?/4 
4dx 4 _A ‘g + D 
| Ei f - = : - t Ga τ . The method of undetermined coefficients yields the 
w+ 4x PH r 22 +4 


equations A + C = 0, D = 0, 4A = 4, A=1, C 1, D=0. 


1 T à 
quom / E =) dx = ln [v| pinta? + 4)+In|C\| = In 


the term arctan is missing. 


Cız 
Vr? +4 


, where in this particular case 


4. Case: Some Roots of the Denominator are Complex with a Higher Multiplicity 


Q(x) = (z — ay (x — 8) ... (a? + px - qg)" (x? +p'etq)”.... (8.184) 
a) Forming the decomposition: 
P(x) A, A» TE En oB B T B, 
Ql) x—o (z-a? "(r—o) z—B (α-- BP ' (x — B) 
QCr-D| — CouctD) , | Cnt + Din 
z +pr+q (αἳ +pr +q)? ` E (x? + px + q)™ 
„Peth p Pith op Lu M (8.180) 


e+pet¢ j (a2 + pla 4 q} (a? + pla + q)” 
b) Calculation of the constants by the method of undetermined coefficients. 


σπα + Din 


c) Integration of the expression for m > 1 in the following steps: 


(+ peta)" 
α) Transformation of the numerator into the form 
Om C 
Cmt + Dm = 5 (2x +p) + (Pn — zP) : (8.18c) 


B) Decomposition of the integrand into the sum of two summands, where the first one can be integrated 
directly: 
Cin (2x + p) da: Cm 1 
2 (x? + pr +q)” 2(m — 1) (92 + pr + qn * 
η) The second one will be integrated by the following recursion formula, not considering its coefficient: 


(8.18d) 


dx z+ p/2 
(2 +prtqg™ 3(m-1)(q-1i PAK (232 F pz + q)" 
, 2m — dx (8.180) 
' 2(m — 1)(q— πο (x? + pa + q)m-i* -- 
arc Qa? + 2x +13 "T 2x? + 2r + 13 A Ciz t Di , Cox Do 
(α-2ω +12 " (x-2) (2+1?  xr—2  a?-1 ` (a24-1)^ 


The method of undetermined coefficients results in the following system of equations: 
A4 Cj; =0, -2Οι1-Ρι = 0, 2A C41 —2D4 +C = 2, —2C14-D4 205 4- D2 = 2, A-2D1 - 2D; = 13; 
the coefficients are A — 1, Ci 1, Di 2, C5 3, D» 4, 


1 21-32 3r 54 
1 =j - - da. 
J \a-2 2311 (αἱ -- 12 
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i dx £ 
According to (8.180) / TES ip 7 a m D 2 η. ma ΠΕ; 41) 
3—-4r 1 1 (x — 2)? 


2-1) 2^ SI 


1 
| 7 arctan x follows, and 


finally the result is / = 4arctan z + C. 


Table 8.3 Substitutions for Integration of Irrational Functions I 


Integral * Substitution 


where r is the lowest common multiple 
of the numbers m,n,.... 


[Re R(: ax? + br + Vax? + bx 4 c) da: One of the three Euler substitutions: 
1. αν. vaz’ + bx +c=t— yar 
2. Force > 0 Vax? + bx +e=at+ c 


3. If the the polynomial ax? + bx + c 
has different real roots: 
ax? + br + c = a(z — o)(x — f) ax? + bz - c — t(x — a) 


* The symbol R denotes a rational function of the expressions in parentheses. 
The numbers n, m, . .. are integers. 
t Ifa < 0, and the polynomial az? + ba + c has complex roots, then the integrand is not defined 
for any value of x, since Vax? + bx + c is imaginary for every real value of x. In this case the in- 
tegral is meaningless. 


8.1.4 Integration of Irrational Functions 


8.1.4.1 Substitution to Reduce to Integration of Rational Functions 


Irrational functions cannot always be integrated in an elementary way. Table 21.7, p. 1065 contains 
a wide collection of integrals of irrational functions. In the simplest cases one can introduce substitu- 
tions, as in Table 8.3, such that the integral can be reduced to an integral of a rational function. 


The integral | R (x, Vax? + ba + c) dx can be reduced to one of the following three forms 


nc Vx? + a?) da, (8.19a) [Re Va? — a?) da, (8.19b) 


nc Va? — x?) dx, (8.19c) 


because the quadratic polynomial ax? + ba + c can always be written as the sum or as the difference of 
two complete squares. Then, one can use the substitutions given in Table 8.4. 


i 1 F iy? ; 183 
B A: 42?+1624 τα (αἱ F 4x -- 44 7) -4 exse (5) | =4 [a+ (5) ο. =24+2. 
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2 τ 2 5 a 
E B: 3z +1 =z? Fist i (e$) (2) =x (2) with y= 245. 
B C: - +2¢ = 1 — r? + 2r — 1 = 1? — (x — 1)? = P — r? with zx, = x — 1. 
Tabelle 8.4 Substitutions for Integration of Irrational Functions II 
| Integral Substitution 
nic va? + a?) dx w=asinht or x=atant 
n e Vu m da: x=acosht or cz-—asect 
[R(x να =z?) dx r-—asint or r-acost 
8.1.4.2 Integration of Binomial Integrands 
An expression of the form 
x" (a + bz")? (8.20) 


is called a binomial integrand, where a and b are arbitrary real numbers, and m, n, p are arbitrary pos- 


itive or negative rational numbers. The theorem of 


EC + ba")? dx 


Chebyshev tells that the integral 
(8.21) 


can be expressed by elementary functions only in the following three cases: 


Case 1: If pis an integer, then the expression (a + bx")? can be expanded by the binomial theorem, so 


the integrand after eliminating the parentheses wil 
to integrate. 


Case 2: If ΠῚ 


is an integer, then the integral 


n 
function by substituting t = να + br", where r ist 


be a sum of terms in the form cz^, which are easy 


8.21) can be reduced to the integral of a rational 
1e denominator of the fraction p. 


1 (8.21) can be reduced to the integral of a rational 


m+1 i : Ρ 
Case 3: If + pis an integer, then the integra 
. "E „|a T ox" . 
function by substituting t = πο Where r is 
2 


ΒΑ: peta = ja? (14: αι) 


? de: m= 


he denominator of the fraction p. 


Len 
n 


= 2, (Case 2): 


Substitution t = 91 4- Yz, x = (t? — 1)*, dz = 12? (t 


p 
1)3dt; ji A 13) dt 


= 3 caps —7)+C. 
a? dx n Jei 1 m-l1 4 m-l 13 
a = E q3Y71/4. 0 } = = j : 
B B: / VIG r"(l-4 c) τ m=3, n=3, p 1 os ΓΕΝΝ 12 


Because none of the three conditions is fulfilled, the 


integral is not an elementary function. 
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8.1.4.3 Elliptic Integrals 


1. Indefinite Elliptic Integrals 
Elliptic integrals are integrals of the form 


[Re 


Usually they cannot be expressed by elementary functions; if it is still possible, the integral is called 
pseudoelliptic. The name of this type of integral originates from the fact that the first application of 
them was to calculate the perimeter of the ellipse (see 8.2.2.2, 2., p. 502). The inverses of elliptic inte- 
grals are the elliptic functions (see 14.6.1, p. 762). Integrals of the types (8.22), which are not integrable 
in elementary terms, can be reduced by a sequence of transformations into elementary functions and 
integrals of the following three types (see [21.1], [21.2], [21.7]): 


(= (0<k <1), (8-23a) [= a M 


?)ü — 150) γα - #)(1 - κ) 


(0 « k « 1). (8.23c) 


ax? + ba? + cx + e) dz, [R (2; NT + ba + cx? + ex + f) dx. (8.22) 


0<k<1), (823b) 


/ dt 
(1 + nt?),/(1 — t2)(1 — Κ213) 


Concerning the parameter n in (8.23c) one has to distinguish certain cases (see [14.1]). 


T 
By the substitution t = sing (0 << 5) the integrals (8.23a,b,c) can be transformed into the 


Legendre form: 
d 
Elliptic Integral of the First Kind: / — ; (8.24a) 
à 1— k? sin? y 
Elliptic Integral of the Second Kind: / V1 — K? sin? o dọ. (8.24b) 
lo 
Elliptic Integral of the Third Kind: / i = =] (8.24c) 
(1 + n sin? e) /1— k? sin? o 


2. Definite Elliptic Integrals 
Definite integrals with zero as the lower bound corresponding to the indefinite elliptic integrals are de- 
noted by 


e e 
"m z 

je = F(k, ), (8.255) / 1 — k?sin? y dy = E(k, o), (8.25b) 

ὁ V1— sin? y ‘ 

7 di 

J 2 =II(n,k,y) (for all three integrals 0 < k < 1 holds). (8.25c) 


4 (19 nsin? )1— k? sin? i 


T 
These integrals are called incomplete elliptic integrals of the first, second, and third kind. For o = s 


the first two integrals are called complete elliptic integrals, and they are denoted by 


K=F (v. z)- j MET " (8.262) 
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E 


E- E (k, JI y1- ke sin? y dy. (8.26b) 


Tables 21.9.1, 2, 3 contain the values for incomplete and complete elliptic integrals of the first and 
second kind F, E and also K and E. The series expansions of K and E see 8.2.5, 7. p. 515. 


W The calculation of the perimeter of the ellipse leads to a complete elliptic integral of the second kind 
as a function of the numerical eccentricity e (see 8.2.2.2, 2., p. 502). Fora = 1.5, b = 1 it follows that 
e = 0.74. Since e = k = 0.74 holds, one gets from Table 21.9.3, p. 1103: sina = 0.74, i.e., a z 48° 


and E(k, 5) = E(0.74) = 1.3238. It follows that U = 4aE(0.74) © 4aE(a = 48°) = 4-1.3238a ~ 7.94. 


Numerical integration yields the better approximation 7.932 711. 


8.1.5 Integration of Trigonometric Functions 


8.1.5.1 Substitution 
With the substitution 


; 2 dt 2t 1-9 
t = tan - ie, dr = 1 E . Sinr— IFE’ coss — 4 FE’ (8.27) 
an integral of the form 
| Gin as cosa) dx (8.28) 


can be transformed into an integral of a rational function, where R denotes a rational function of its 
arguments. 


2t 2 
. l4 
l--sinr ( 2 $ 1 P 
./ + sin 1 τὰν = f 5 142/148 di i E24 z) dt =—+t4 sill | 


1 
sin x(1 + cosa 1-8 2. 4 
σι 1 ; 
1+? 1+? 
yi 
tan“ — $ 1 r 
2 + tan — 4 5 In|tan 5 + C. In some special cases more simple substitutions can be applied. 


4 2 
If the integrand in (8.28) contains only odd powers of the functions sin x and cos x, then by the substi- 
tution t = tan a rational function can be obtained in a simpler way. 


8.1.5.2 Simplified Methods 


Case 1: n (sin x) cos x dx. Substitution t= sing, cosrdx = dt. (8.29 
Case 2: [5 (cos x) sin x dz. Substitution t= cosr, sinadx = —dt. (8.30 
Case 3: n gdr: (8.31a 


a) n = 2m + 1, odd: 
[so zdr = /α — cos? x)” sina dx = — /α — ελ) dt with t= cosx. (8.31b 


b) n = 2m, even: 


1 m ul 
/ sin" z dx = i [5a — cos 20)| dx = πη /α — cost)" dt with t= 22. (8.31c 
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The power is halved in this way. After removing the parentheses in (1 — cost)" follows integration 
term-by-term. 


Case 4: n v da. (8.32a 
a) n = 2m + 1, odd: 
/ ων" zdr = fa — sin? x)" cosa dx = fa — 8)" dt with t -—sinz. (8.32b 
b) n — 2m, even: 
1 m 1 
fos xdr = / [δα + cos 2: dx = m /α + cost)” dt with t= 2z. (8.32c 


'The power is halved in this way. After removing the parentheses follows integration term-by-term. 


Case 5: ή sin" x cos" z dx. (8.33a 
a) One of the numbers m or n is odd: Reducing it to the cases 1 or 2. 
ΒΑ: foe x cos? x dr = e x (1 — sin? x)? cosa dz = fea —)dt+C with t-sinz. 


sina dt 

ms: / FC with t= cosg. 
Ven" yt 

b) The numbers m and n are both even: Reducing it to the cases 3 or 4 by halving the powers using 

the trigonometric formulas 


sin 2x , 1 — cos 2g ; 1 5 24; 
sin z cosa = 27 , sin? r = EAT cos? z = lH ; (8.337) 
2 2 2 
, , ; 1 j 1 : 
E fs x cos! x dx = [in x cos x) cos? α dx = 8 [so 2x(1 + cos 2x) dr = ^ foe 2x cos 2x dx + 
1 1 , 1 1 
— /α — cos 4x) dx sin? 2a + —r — — sin4x +C. 
16 48 16 64 


Case 6: | tan” zde = rare 7? (sec? z — 1) dr = n 7? y (tana) dx — [ x dx 


tan’! g , 
= y tan"? y da. (8.344) 
^n-1l. 
By repeating this process one decreases the power and depending on whether n is even or odd one finally 
gets the integral 


fæ =g or f x dx = — ln | cos x| (8.34b) 
respectively. 
Case 7: / cot” zdz. (8.35) 


The solution is similar to case 6. 
Remark: Table 21.7, p. 1065 contains several integrals with trigonometric functions. 


8.1.6 Integration of Further Transcendental Functions 
8.1.6.1 Integrals with Exponential Functions 


Integrals with exponential functions can be reduced to integrals of rational functions if it is given in the 
form 


fR (Em ε΄”... ie”) da, (8.36a) 
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where m,n,...,p are rational numbers. Necessary are two substitutions to calculate the integral: 
1. Substitution of t = e” results in an integral 


1 
/; R(t^,t^,...,)dt. (8.36b) 


2. Substitution of z = Vt, where r is the lowest common multiple of the denominators of the fractions 
m,n,...,p, results in an integral of a rational function. 

8.1.6.2 Integrals with Hyperbolic Functions 

Integrals with hyperbolic functions, i.e., containing the functions sinh x, cosh x, tanh x and coth xv in 
the integrand, can be calculated as integrals with exponential functions, if the hyperbolic functions 


are replaced by the corresponding exponential functions. The most often occurring cases / sinh” z dz , 


y cosh" x dx, I sinh" x cosh” z dx can be integrated in a similar way to the trigonometric functions 
(see 8.1.5, . p. 491). 
8.1.6.3 Application of Integration by Parts 


If the integrand is a logarithm, inverse trigonometric function, inverse hyperbolic function or a product 
of x" with Inz, e"*, sinax or cosax or their inverses, then the solution can be got by a single or 
repeated integration by parts. 

In some cases the repeated partial integration results in an integral of the same type as the original 
integral. In this case one has to solve an algebraic equation with respect to this expression. One can 
calculate in this way, e.g., the integrals f e°" cos bx dx, I e°” sin ba dx, where one needs integration by 
parts twice. The same type of function should be chosen for the factor u in both steps, either the 
exponential or the trigonometric function. 


One also uses integration by parts if there are integrals in the forms / P (x)e^* dx, | P (x) sin bx dx 


and / P (x) cos bx dx, where P (x) is a polynomial. (Choosing u = P (x) the degree of the polynomial 
will be decreased at every step.) 


8.1.6.4 Integrals of Transcendental Functions 
The Table 21.7, p. 1065, contains many integrals of transcendental functions. 


8.2 Definite Integrals 


8.2.1 Basic Notions, Rules and Theorems 
8.2.1.1 Definition and Existence of the Definite Integral 


1. Definition of the Definite Integral 

The definite integral of a bounded function y = f (x) defined on a finite closed interval [a, b] 18 a number, 
which is defined as a limit of a sum, where either a < b can hold (case A) or a > b can hold (case B). 
In a generalization of the notion of the definite integral (see 8.2.3, p. 506) in the following sections also 
functions are considered, which are defined on an arbitrary connected domain of the real line, e.g., on 
an open or half-open interval, on a half-axis or on the whole numerical axis, or on a domain which is 
only piecewise connected, i.e., everywhere, except finitely many points. These types of integrals belong 
to improper integrals (see 8.2.3, 1., p. 506). 

2. Definite Integral as the Limit of a Sum 

The notion of the definite integral is obtained by the limit given in the following procedure (see Fig. 8.1, 
p. 480): 
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1. Step: The interval [α, ὑ] is decomposed into n subintervals by the choice of n — 1 arbitrary points 


14,22,..., 24, 80 that one of the following cases occurs: 
a = To < T1 < Ta L'e xdi SU L Tn =b (case A) or (8.37a) 
αξταρσραπι a a S S e S > ao (case B). (8.37b) 


2. Step: A point €; is chosen in the inside or on the boundary of each subinterval as in Fig. 8.4: 


U1<& <a; (incase A) or Tia >& >a; (incase B). (8.37c) 

Ax, Ax, AX; Δ) 

a Eras 

a-Xx [x] X x Xa 

$5 δι 
Ax, AX); 
5 Ie l 
box χι U«So [xa χ 


Figure 8.4 


3. Step: The value f (£;) of the function f (x) at the chosen point is multiplied by the corresponding 
difference Az; = x; — 2-1, i-e., by the length of the subinterval taken with a positive sign in case A 
and taken with negative sign in case B. This step is represented in Fig. 8.1, p. 480 for the case A. 

4. Step: Then all the n products f (£;) Av; ι are added. 


5. Step: The limit of the obtained integral approximation sum or Riemann sum 


Y F (&) Avia (8.38) 


i=1 
is calculated if the length of each subinterval Az; 1 tends to zero and consequently their number n 
tends to oo. Based on this, one can also denote Az; as an infinitesimal quantity. 


If this limit exists independently of the choice of the numbers x; and £;, then it is called the definite 
Riemann integral of the considered function on the given interval. One writes 


b n 
[Fo de= lim ^f (&) Asia. (8.39) 
a pon i=1 


The endpoints of the interval are called limits of integration and the interval [a, b] is the integration 
interval; a is the lower limit, b is the upper limit of integration; x is called the integration variable and 
f(x) is called the integrand. 


3. Existence of the Definite Integral 


The definite integral of a continuous function on [a,b] is always defined, i.e., the limit (8.39) always 
exists and is independent of the choice of the numbers x; and €;. Also for a bounded function having 
only a finite number of discontinuities on the interval [a, b] the definite integral exists. The function 
whose definite integral exists on a given interval is called an integrable function on this interval. 


8.2.1.2 Properties of Definite Integrals 


'The most important properties of definite integrals explained in the following section are presented in 
Table 8.5, p. 496. 
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1. Fundamental Theorem of Integral Calculus 


If the integrand f(x) is continuous on the interval fa, b], and F (x) is a primitive function, then 


/ f (z)dz = I F'(z)dz = F (4) = F (b) — F (a) 


(8.40) 


holds, i.e., the calculation of a definite integral is reduced to the calculation of the corresponding indef- 


inite integral, to the determination of the antiderivative: 


2 [Fo dz +C. 


(8.41) 


Remark: There are integrable functions which do not have any primitive function, but one will see 


that, if a function is continuous, it has a primitive function. 
2. Geometric Interpretation and Rule of Signs 


1. Area under a Curve Let f (x) > 0 for all x in [a, b]. Then the sum (8.38) can be considered as the 
total area of the rectangles (Fig. 8.1), p. 480, which approximate the area under the curve y = f (x). 


Therefore the limit of this sum and together with it the defini 


e integral is equal to the area of the region 


A, which is bounded by the curve y = f (x), the x-axis, and the parallel lines x = a and x = b: 


b 
A= or (a « b and f(x) 20 for a € x € b). 


(8.42) 


2. Sign Rule If a function y = f(x) is piecewise positive or negative in the integration interval 


(Fig. 8.5), then the integrals over the corresponding subin 


servals, that is, the area parts, have pos- 


itive or negative values, so the integration over the total interval yields the sum of signed areas. 


In Fig. 8.5a—d four cases are represented with the different 


possibilities of the sign of the area. 


id ja B Bg 


f(x)>0, a<b f(x)>0, a>b f(x)«0, a<b f(x)«0, a>b 


a) b) c) 


Figure 8.5 


d) 


T 
ΒΑ: [ sinzdx (read: Integral from x = 0 to x = x) = (— cosa|§ = (— coss + cos0) = 
Jo 


2π 
B B: [ sinrzdr (read: Integral from x = 0 to x = 27) = 
Jo 


3. Variable Upper Limit 
1. Particular Integral If the upper limit is considered as 
there is an area function in the form 


= [ro dt (a<b and f(a) 2 0 for «> a). 


This integral is called a particular integral. 


(— cos |2* = (— cos2m + cos0) = 0. 


variable (Fig. 8.6, region ABC D), then 


(8.43) 


To avoid accidently confusing the variable upper limit « with the variable of the integrand, often the 


integration variable is denoted by t instead of x as in (8.43). 
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y=f(x) 
M 
7 SQ) f(x) 
7, 
> Z, N, 
x 0 | x 
Figure 8.6 Figure 8.7 Figure 8.8 


2. Differentiation of the Definite Integral with Respect to the Upper Limit A definite integral 
q 
with a variable upper limit / f(t) dt, if this integral exists, is a continuous function F(x) of the upper 
Ja 


limit. If f(x) is continuous, then F(x) is differentiable with respect to x, i.e., it is a primitive function 
of the integrand. So, if f(x) is continuous on [a,b], and x € (a,b) holds 


F'(x) = f(x) or x f(t) dt = f(x). (8.44) 


The geometrical meaning of this theorem is that the derivative of the variable area S (x) is equal to the 
length of the segment N M (Fig. 8.7). Here, the area, just as the length of the segment, is considered 
according to the sign rule (Fig. 8.5). 


4. Decomposition of the Integration Interval 


The interval of integration [a, b] can be decomposed into subintervals. The value of the definite integral 
over the complete interval is 


fro dx = jro απ + fro dx. (8.45) 


This is called the interval rule. If the integrand has finitely many jumps, then the interval can be 
decomposed into subintervals such that on these subintervals the integrand will already be continuous. 
Then the integral can be calculated according to the above formula as the sum of the integrals on the 
subintervals. 

At the endpoints of the subintervals the function must be defined by its corresponding left or right-sided 
limit, if it exists. If it does not, then the integral is an improper integral (see 8.2.3.3, 1., p. 509). 
Remark: The formula above is valid also in the case if c is outside of the interval [a, b] if one can suppose 
that the integrals on the right-hand side exist. 


8.2.1.3 Further Theorems about the Limits of Integration 


1. Independence of the Notation of the Integration Variable 


The value of a definite integral is independent of the notation of the integration variable: 


b b 


no dz = f ftu) a= [19 dt. (8.46) 


a a 
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Table 8.5 Important Properties of Definite Integrals 


Property Formula 
b 
Fundamental theorem of [fo dr = F(x) ὅν... F(b) -- F(a) with 
the integral calculus AC a 
(f(x) is continuous) F(z) = | κο dz + C or F'(z) = f(x) 
b a 
Interchange rule i f(a) da = -- / f(x) dx 
a b 
Equal integration limits / f(a) dx = 0 
b c b 
Interval rule [κο dz = [19 απ + [fo dx 
Independence of the notati f f f 
ndependence of the notation ITE T 
of the integration variable | f(z) dz = EO du = / F(t) at 
Differentiation with respect to d f : : 
the upper limit F ae / f(t) dt = f(x) with f(x) continuous 
b 
Mean value theorem 
of the integral calculus / f(x)dx = (b-a) f(E) (a<& <b) 


2. Equal Integration Limits 


If the lower and upper limits are equal, then the value of the integral is equal to zero: 
a 
no dax = 0. (8.47) 
a 


3. Interchange of the Integration Limits 
After interchanging the limits, the integral changes the sign (interchange rule): 


b a 
/ f(x) de -- -- / f(a) de. (8.48 
a b 
4. Mean Value Theorem and Mean Value 
1. Mean Value Theorem Ifa function f(x) is continuous on the interval fa, b], then there is at leas 


one value € in this interval such that in the case A with a < € < b and in the case B with a > € > b (see 
8.2.1.1, 2., p. 494) 


/ f(a) dx = (b — a)f(£) (8.49 


is valid. 


The geometric meaning of this theorem is that between the points a and b there exists at least one poin 
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€ such that the area of the figure ABC D is equal to the area of the rectangle AB'C'D in Fig. 8.8. The 
value 


b 
m= pul fo dz (8.50) 


is called the mean value or the arithmetic average of the function f (x) in the interval [α, b] . 


2. Generalized Mean Value Theorem If the functions f(x) and (x) are continuous on the closed 
interval [α, b], and y(x) does not change its sign in this interval, then there exists at least one point £ 
such that 

b 


[ feto dx = FE | e) dx (a « & « b) (8.51) 
is valid. 


5. Estimation of the Definite Integral 

The value of a definite integral lies between the values of the products of 
the infimum m and the supremum M of the function on the interval [a, 0] 
multiplied by the length of the interval: 


m(b — a) € nio dr < M(b—a) (a«b,f(x) 2 0). (8.52) 


If f is continuous, then m is the minimum and M is the maximum of the 
function. It is easy to recognize the geometrical interpretation of this the- 
orem in Fig. 8.9. Figure 8.9 


8.2.1.4 Evaluation of the Definite Integral 


1. Principal Method 

The principal method of calculating a definite integral is based on the fundamental theorem of integral 
calculus, i.e., the calculation of the indefinite integral (see 8.2.1.2, 1., p. 495), e.g., using Table 21.7 p. 
1065. Before substituting the limits it must be ckecked if there is an improper integral. 

Nowadays computer algebra systems can be used to determine analytically the indefinite and definite 
integrals (see Chapter 20). 

2. Transformation of Definite Integrals 

In many cases, definite integrals can be calculated by appropriate transformations, with the help of the 
substitution method or partial integration. 


a 
W A: Using the substitution method for J = [ va? — a? dx. 
20 


First one substitutes: x = y(t) = asint, t = y(x) = arcsin B ψ(0) -- 0, φ(α) = 
a 
z 


a arcsinl , . ; i 21 
Then J = / va? — r? dy = / αγ! — sin? t cost dt = e cos? t dt — è f (1 + cos 2t) dt. 
0 a 0 0 


ap | 33 


re 


resin 0 2 
With the further substitution t = (z) a z = W(t) = 2t, v(0) = 0, W9) π the value 
8... ανν ae 2 y2 2 
I= = tà + Tf cos z dz “ ! " sin z|o = z is obtained. 


1 1 
W B: Method of partial integration: / ze dx = [πες / e dr =e—(e—-1)=1. 
0 0 
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3. Method for Calculation of More Difficult Integrals 
If the determination of an indefinite integral is too difficult and complicated, or it is not possible to 
express it in terms of elementary functions, then there are still some further ideas to determine the value 
of the integral in several cases. Here the integration of functions with complex variables is mentioned 
(see the examples on p. 754-757) or the theorem about the differentiation of an integral with respect 
to a parameter (see 8.2.4, p. 512): 

b 


df. s [Oft 
"ECL dz = Ja e (8.53) 


Ig ; 1gt— 1 
B: - [ dx. Introducing the parameter t: F(t) = [ 
Jo lng Jo lng 


1 ð [gt— 1 tln 1 1 
Using (8.53) for F(t): τ [ 2 E Ἴ dz [ ving x [ abd Εμμ]. 1 
Jo Jo 


dt Jo Ot} lng lng EE o t+l1° 


Integration: F(t) — F(0) [ Er 
Jo T+1 
4. Integration by Series Expansion 
If the integrand f(x) can be expanded into a uniformly convergent series 
f(x) = gala) + pala) + +++ + enm) t 
in the integration interval [a, b], then the integral can be written in the form 


nio dx = f eas f ext) det: f pala) dz. 


In this way the definite integral can be represented as a convergent numerical series: 


In(r + 1)|5 = In(t + 1). Result: J = F(1) = 1n2. 


b b b b 
no dx = fal) dx + oO ο... no dx. (8.56) 
When the functions p(x) are easy to integrate, if, e.g., f(x) can be expanded in a power series, which 


b 
is uniformly convergent in the interval [a, b], then the integral f f (x) dx can be calculated to arbitrary 
a 
accuracy. 


2 2 


1/2 2 , 
Β Calculate the integral J = [ e U dr with an accuracy of 0.0001. The series e^ = 1 — T + 
Jo ! 


q4 46 8 
or — ST + AT —.---is uniformly convergent in any finite interval according to the Abel theorem (see 
` ` 9 T? xt a 18 
7.3.3.1, p. 469), so a ο πο aiy t gg) holds. With this result it 
1/2 2 
: = πας. 1 1 1 . 1 NS 
follows that 1 = [ [2 dx Σα 31.3 + στο 31.7 πε ) 


= 1 (1 " H Ts "m t πεσε U ) Το achieve the accuracy 0.0001 for the calculation of the 


integral it is enough to consider the first four terms, according to the theorem of Leibniz about alter- 
nating series (sce 7.2.3.3, 1., p. 463): 


yb 
~ 4(1 — 0.08333 + 0.00625 — 0.00037) = 4 - 0.92255 = 0.46127, T e^? dx = 0.4613. 
0 


5. Graphical Integration 
Graphical integration is a graphical method to integrate a function y — f(x) which is given by a curve 


b 
AB (Fig. 8.10), i.e., to calculate graphically the integral / f(x) dx, the area of the region Mj) ABN: 
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1. The interval ΜΙΝ is divided by the points 
11/2, 01, 13/2, 02, - - Zn αν Tn—1/2 into 2n equal parts, where the 
result is more accurate if there are more points of division. 

2. At the points of division 11/5, 33/5, ... πι 1 one draws ver- 
tical lines intersecting the curve. The ordinate values of the seg- 
ments are denoted on the y-axis by OA1, O4», ..., OAn. 

3. A segment OP of arbitrary length is placed on the negative 
a-axis, and P is connected with the points Αι, A», .. ., Αμ. 

4. Through the point Mo a line segment is drawn parallel to P A, 
to the intersection point with the line x = 2; this is the segment 
MoM. Through the point M; the segment Mı M^» is drawn paral- 
lel to P. As to the intersection with the line x = πο. etc., until the 
last point M, is reached with the abscissa £n. 


p The integral is numerically equal to the product of the length of 
Χν, ν, κ; OP and the length of N Mn: 
b 
Figure 8.10 [κο dx OP. ΝΜ. (8.57) 


a 


By asuitable choice of the arbitrary segment OP the extent of the graph can be influenced; the smaller 
a κ. b -... 
the graph is wanted, the longer the segment OP should be chosen . {OP = 1, then / f(a) dz = NM), 
Ja 
and the broken line Mo, Mi, Mo,..., Mn represents approximately the graph of a primitive function of 


f (x), i.e., one of the functions given by the indefinite integral / f(a) da. 


6. Planimeter and Integraph 

A planimeter is a tool to find the area bounded by a closed plane curve, thus also to compute a definite 

integral of a function y = f(x) given by the curve. Special types of planimeter can evaluate not only 

fy dx, but also [9 dx and [9 dz. 

j ` : T 

An integraph is a device which can be used to draw the graph of a primitive function Y = / f(t) dt if 
Ja 

the graph of a function y = f(x) is given (see [19.30]). 

7. Numerical Integration 

If the integrand of a definite integral is too complicated, or the corresponding indefinite integral cannot 

be expressed by elementary functions, or the values of the function are known only at discrete points, 

e.g., from a table of values, then the so-called quadrature formulas or other methods of numerical math- 

ematics are used (see 19.3.1, p. 963). 


8.2.2 Applications of Definite Integrals 
8.2.2.1 General Principle for Applications of the Definite Integral 


1. The quantity A to be determined is decomposed into a large number of very small quantities, i.e., 
into infinitesimal quantities: 

A= a +a +: + as. (8.58) 
2. Every one of these infinitesimal quantities a; is replaced by a quantity à;, which differs only very 
slightly in value from a;, but which can be integrated by known formulas. Here the error a; = a; — à; 
should be an infinitesimal quantity of higher order than a; and à;. 


3. Representation of à; by a variable x and a function f(x) so that à; has the form f(x;) Az;. 
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4. Evaluation of the desired quantity as the limit of the sum 


n n b 
a jm oa - ΠΣ Καλδα; = / f(z) dz, (8.59) 


where Ax; > 0 holds for every i. The lower and upper limit for x is denoted by a and b. 

W Evaluating the volume V of a pyramid with base area S and height H (Fig. 8.11a-c): 

a) Decomposition of the required volume V by plane sections into frustums (Fig. 8.11a): V = vı + 
vdd Un. 
b) Replacing every frustum by a prism, whose volume is δι, with the same height and with a base area 
of the top base of the frustum (Fig. 8.11b). The difference of their volumes is an infinitesimal quantity 
of higher order than v;. 


c) Representation of the volume 9; in the form 0; = S; Ah; , 


d) Calculation of the limit of the sum 


Λ where h; (Fig. 8.11c) is the distance of the top surface 
ολ AN " x from the vertex of the pyramid. Since S; : S = h? : H? 
4 dre me Ole AR, 
SA S^ Ah, one can write: 0; = T Ah;. 
| b i C) | 


n 


x 20 AS ΠΕΙΣ SH 
Figure 8.11 d Pu UM e= LE 


EL 0 
8.2.2.2 Applications in Geometry 


1. AreaofPlanar Figures 


1. Area ofa Curvilinear Trapezoid Between B and C (Fig. 8.12a) if the curve is given by an 

equation in explicit form (y = f(x) and a € x < b) or in parametric form (x = x(t), y = y(t), tı € 
b ty 

Sapcp = [κω dx = / y(t)o (t) dt (f(a) > 0;a(t1) = a;x(te) = b; y(t) > 0). (8.60a) 


2. Area ofa Curvilinear Trapezoid Between G and H (Fig. 8.12b) if the curve is given by an 
equation in explicit form (x = g(y) and a € y € 8) or in parametric form (x = a(t), y = y(t), tı € 
B tə 
Seron = EO dy — [roy dt (g(y) 2 0; y(ti) = a; y(t2) = 8; x(t) 5 0). (8.60b) 
a ti 


3. Area of a Curvilinear Sector (Fig. 8.12c), bounded by a curve between K and L, which is 
given by an equation in polar coordinates (p = p(y), yı € y € pa): 


1 2 
Sokr = 5 [e dy. (8.60c) 
v1 
Areas of more complicated figures can be calculated by partition of the area into simple parts, or by 
line integrals (see 8.3, p. 515) or by double integrals (see 8.4.1, p. 524). 
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Figure 8.12 


2. Arc-length of Plane Curves 


1. Arc-length of a Curve Between Two PO (I) A and B, given in explicit form (y — f(x) or 
x = g(y)) or in parametric form (x = x(t), y = y(t)) (Fig. 8.13a) can be calculated by the integrals: 


b B [2] 
Lo = [ i eG σσ Οσο (8.614) 


With the differential of the arc-length dl one gets 
p= / di — with — d? = da? + dy. (8.61b) 


Β The perimeter of the ellipse with the help of (8.614): With the substitutions x = z(t) = asint, y = 


t2 : 
y(t) — bcost it follows that Le = / y a? — (a? — b?) sin? tdt = af V1-— e?sin?t dt, where 
AB ü 


e = Va? — b? /a is the numerical eccentricity of the ellipse. 
Since x = 0, y = band x =a, y = 0 the limits of the integral in the first quadrant are {1 = 0 and 


71/2 r 
t2 = 7/2 so, for the perimeter of the ellipse holds L~ = 4a [ V1—e2sin? tdt = a E(k, 5) with 
AB Jo 


k =e. The value of the integral E(k, 5) is given in Table 21.9 (see example in 8.1.4.3, p. 491). 


2. Arc-length of a Curve Between Two 
Points (II) C and D, given in polar coordinates „Ay D 


= β B 
(p = p(y)) (Fig. 8.13b): > dy dp) g 
A ράφ 
2 8.61c) α ax do 
a= sof p (2) (8. ς 
> ο > 
With the differential of the arc-length dl one gets aj 0! a bx b) 9,9; X 


L= fa with dl? = Pde? + dp”. (8.614) 
d Figure 8.13 


3. Surface Area of a Body of Revolution (see also First Guldin Rule, p. 506) 


1. The area of the surface of a body given by rotating the graph of the function y = f(x) > 0 around 
the x-axis (Fig. 8.14a) is: 


b b d 
S= an fy dl = 2r f uz) 1+ (2) dx. (8.62a) 
dx 
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2. The area of the surface of a body given by rotating x = f(y) > 0 around the y-axis (Fig. 8.14b) is: 
ΐ ^ dz V? 
S = 2π E dl — 2π "EO (: + =) dy. (8.62b) 
a a 
3. To calculate the area of more complicated surfaces see the application of double integrals in 8.4.1.3, 
p. 527 and the application of surface integrals of the first kind, 8.5.1.3, p. 535. General formulas for 
the calculation of surface areas with double integrals are given in Table 8.9 (Applications of double 
integrals), p. 528. 


y dl 
0 E X dl 
+ > 
b x 
a) b) ew 
Figure 8.14 Figure 8.15 


4. Volume (sce also Second Guldin Rule, p. 506) 
1. The volume of a rotationally symmetric body given by rotation around the x-axis (Fig. 8.14a) is: 
b 
V2m [2 dx. (8.63a) 
a 
2. The volume of a rotationally symmetric body given by rotation around the y-axis (Fig. 8.14b) is: 
b 
Vet [46 dy. (8.63b) 


a 


3. The volume of a body, whose section perpendicular to the x-axis (Fig. 8.15) has an area given by 
the function $ — f(x), is: 


V= | f(a) dex. (8.64a) 


W Calculation of the volume of an ellipsoid of revolution centered at the origin. Since the ellipsoid 
of revolution z?/a? + y?/b? + 27/c? = 1 (see (3.412), 3.5.3.13,2., p. 224 and Fig.8.16) with a = c 
arises by rotating the ellipse y?/b? + 2?/c? = 1 about the y-axis, the area of the circle shaped cross- 
sections being parallel to the x, z-plane is S = f(y) = ποῦ = «c(1 — y?/0?), so by integration V = 
2nc? ρα — y? /b?)dy = (4/9 πυς. 


4. Cavalieri Principle If in the interval fa, b] there exists in addition to a cross-sectional area func- 


tion S = f(x) a second cross-sectional area function S = f(x) which has the same value for every x as 
f(x), then the volumes V according to (8.64a) and V according to (8.64b) are equal: 


b 
σ-[Τω-ν. (8.649) 
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Figure 8.16 Figure 8.17 Figure 8.18 


The original Cavalieri’s principle tells: If two bodies are bounded below and above by two parallel 
planes and in these planes and in every plane being parallel to these ones their cross-sectional areas are 
equal to each other, then the volumes of these bodies are also equal (Fig.8.17). 

5. Tables General formulas to calculate volumes with multiple integrals are given in Table 8.9 
(Applications of double integrals, see p. 528) and Table 8.11 (Applications of triple integrals, see 
p. 533). 


8.2.2.3 Applications in Mechanics and Physics 


1. Distance Traveled by a Point 
The distance traveled by a moving point during the time from tọ until T with a time-dependent velocity 


v — f(t)is 


T 
s= foa. (8.65) 
to 
2. Work 


The work in moving a body in a force field depends of the direction of motion. Here it is supposed that 
the direction of the field and the direction of the movement are constant and coincide along the x-axis. 


If the magnitude of the force Fis changing, i.e., F | = f(x), then the work W necessary to move the 
body from the point x = a to the point x = b along the z-axis is 


b 
W= no dz. (8.66) 


In the general case, when the direction of the force field and the direction of the movement are not 
coincident, the work is calculated as a line integral (see (8.130), p. 522) of the scalar product of the 
force and the variation of the position vector at every point of r along the given path. 
3. Gravitational and Lateral Pressure 
In a fluid at rest in the gravitational field of the Earth with a density o and gravitational acceleration 
9 (see Table 21.2, p. 1053) the gravitational pressure p and the lateral pressure p, are distinguished. 
The gravitational pressure p at a depth x under the surface of the fluid (sce Fig.8.18) is 

τοσα. (8.67a) 
The lateral pressure p, acting e.g. on a cover plate (surface area A) of a side opening of a container (see 
Fig.8.18), is caused by the pressure force F acting in all directions. The differential pressure force dF’ 
acting perpendicular on a side surface-element dA in depth x under the surface of the fluid is 


dF-—ogxdA-ogzocy(x)dz. (8.67b) 
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Integration and division by A yields 
ha 


0 
ps = 25 ανν) - (2) de = ogee. (8.676) 
hi 


Functions yı(x) and y(x) describe the left and right boundaries of the cover plate and xc is the z- 
coordinate of its center of mass (see center of gravity of planar figures paragraph 5., p. 505). 
Remark: The center of mass of the cover plate usually does not coincide with the point of impact of 
pressure force F since the lateral pressure force is proportional to x. 


4. Moments of Inertia 


1. Moment of Inertia of an Arc The moment of inertia of a homogeneous curve segment y — f(x) 
with constant density o in the interval τα, b] with respect to the y-axis (Fig. 8.198) is: 


b b 
I,—o [ea =ø [ Ai vy as. (8.68) 


If the density is a function o(z), then its analytic expression is in the integrand. 


2. Moment of Inertia of a Planar Figure The 


yA y moment of inertia of a planar figure with a homo- 
geneous density o with respect to the y-axis, where 
dy S y is the length of the cut parallel to the y-axis = 

x 1 d (Fig. 8.19b), is: 

y m 
χή----ὴ D efe y dx. (8.69) 

à dx bx ο a 
b) 


a dx b x 
a) 


(See also Table 8.9, (Applications of the Double 
Integral), p. 528.) If the density is position depen- 


Figure 8.19 dent, then its analytic expression must be in the 
integrand. 
ya γα y^ 
L B B 
Yc C Yc 
N 
θα Xc b X 0 Xc X 

a) b) ο) 


Figure 8.20 


5. Center of Gravity, Guldin Rules 


1. Center of Gravity of an Arc Segment The center of gravity C of an arc segment of a homo- 
geneous plane curve y = f(x) in the interval [a,b] with a length L (Fig. 8.204) considering (8.614), 
p. 502, has the coordinates: 


b b 
E 1 + y? dx ΠΠ 


Ὅο-Ξ--------π----Β Uc 


n (8.70) 


L 
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2. Center of Gravity of a Closed Curve The center of gravity C of a closed curve y = f(x) 
(Fig. 8.20b) with the equations y; = f(a) for the upper part and y2 = f(x) for the lower part, and 
with a length L has the coordinates: 
G À 
JEEE P + VI+ 99) dz [OVIE Q4 “πο 
gaz a . TAE a 
C a Yo L 
3. First Guldin Rule Suppose a plane curve segment is rotated around an axis which lies in the 
plane of the curve and does not intersect the curve. Choosing it as the x-axis the surface area Spot of 
the body generated by the rotated curve segment is the product of the perimeter of the circle drawn by 
the center of gravity at a distance rc from the axis of rotation, i.e., 2zrc, and the length of the curve 
segment L: 
S rot =L. 2πτα. (8.Τ2) 
4. Center of Gravity ofa Trapezoid The center of gravity C of a homogeneous trapezoid bounded 
above by a curve segment between the points of the curve A and B (Fig. 8.20c), with an area S of the 
trapezoid, and with the equation y = f(x) of the curve segment AB, has the coordinates: 
b b 


ETT T ES dx 


ro =* Sg ^ w= —$ : (8.73) 
5. Center of Gravity of an Arbitrary Planar Figure The center of gravity C of an arbitrary 
planar figure (Fig. 8.204) with area 5, bounded above and below by the curve segments with the 
equations yi = fi(x) and ys = fo(x), has the coordinates 


(8.71) 


NI = 


b b 
[ey --υὐ) ae [A - as 


5 7 e 5 
Formulas to calculate the center of gravity with multiple integrals are given in Table 8.9 (Applications 
of double integrals, p. 528) and in Table 8.11 (Applications of triple integrals, p. 533). 
6. Second Guldin Rule Suppose a plane figure is rotated around an axis which is in the plane o 
the figure and does not intersect it. Choosing it as the z-axis the volume V of the body generated by 
the rotated figure is equal to the product of the perimeter of the circle drawn by the center of gravity 
under the rotation, i.e., 27rc, and the area of the figure S: 

Vrot = S: 2πτο. (8.75 


(8.74 


Zo = 


8.2.3 Improper Integrals, Stieltjes and Lebesgue Integrals 


8.2.3.1 Generalization of the Notion of the Integral 

The notion of the definite integral (see 8.2.1.1, p. 493), as a Riemann integral (see 8.2.1.1, 2., p. 494), 
was introduced under the assumptions that the function f (x) is bounded, and the interval [a, b] is closed 
and finite. Both assumptions can be relaxed in the generalizations of the Riemann integral. Some of 
them are mentioned in the following section. 
1. Improper Integrals 
These are the generalization of the integral to unbounded functions and to unbounded intervals. In the 
next paragraphs integrals with infinite integration limits and integrals with unbounded integrands are 
discussed. 
2. Stieltjes Integral for Functions of One Variable 

Considering two finite functions f(x) and g(x) defined on the finite interval [a, b] and decomposing the 
interval into subintervals, just as with the Riemann integral, but instead of the Riemann sum (8.38) 
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the following sum is constructed: 


Σ FEE) — οίαι ε)] (8.76) 


If the limit of (8.76) exists, when the length of the subintervals tends to zero, and it is independent of 
the choice of the points x; and &;, then this limit is called a definite Stieltjes integral (see also [8.8]). 

Β For g(x) = x the Stieltjes integral becomes the Riemann integral. 

3. Lebesgue Integral 

Another generalization of the integral notion is connected with measure theory (see 12.9, p. 693), where 
the measure of a set, measure spaces, and measurable functions are introduced. In functional analysis 
the Lebesgue integral is defined (see 12.9.3.2, p. 696) based on these notions (see [8.6]). The generaliza- 
tion with comparison to the Riemann integral is, e.g., the domain of integration can be a rather general 
subset of IR" and it is partitioned into measurable subsets. 


There are different notations for the generalizations of the integrals (see [8.8]). 
8.2.3.2 Integrals with Infinite Integration Limits 
1. Definitions 


a) If the integration domain is the closed half-axis [a, +00), and if the integrand is defined there, then 
the integral is by definition 


too B 
Jj f(x) dz = Jim | f(x) dz. (8.77) 


If the limit exists, then the integral is called a convergent improper integral. If the limit does not exist, 
then the improper integral (8.77) is divergent. 
b) If the domain of a function is the closed half-axis (—oo, b] or the whole real axis (--οο, +00), then 
one defines analogously the improper integrals 


b too B 
J f(x) dx = im ο xv, (8.78a) / f(x) dz = Jim. EO dx. (8.78b) 
ο. -.δο Boo Ἅ 


c) At the limits of (8.782) the numbers A and B tend to infinity independently of each other. If the 
limit (8.78b) does not exist, but the limit 


+A 
Jim / f(x) de, (8.78c) 
ROSA 


exists, then this limit (8.78c) is called the principal value of the improper integral, or Cauchy's principal 
value. 

Remark: An obviously necessary but not sufficient condition for the convergence of the integral (8.77) 
is lim f(z) = 


2. Geometrical Meaning of Integrals with Infinite Limits 
The μόν. (8.77), (8.78a) and (8.78b) give the area of the figures represented in Fig. 8.21. 


d. ^ 
BA: lim Be jim In B = oo (divergent). 
λος 


J1 B—aooJ]1 2 


T- 
dx B dx 1. 1 1 
HB: La lim - lim G ) 5 (convergent). 
2 a? 


Bow /2 τ B—oo \ 2 B 
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Figure 8.21 


3. Sufficient Criteria for Convergence 


If the direct calculation of the limits (8.77), (8.78a) and (8.78b) is complicated, or if only the convergence 
or divergence of an improper integral is the question, then one of the following sufficient criteria can be 
used. Here, only the integral (8.77) is considered. The integral (8.78a) can be transformed into (8.77) 
by substitution of x by --ᾱ; 


j f(z)de = Tia dx. (8.79) 


The integral (8.78b) can be decomposed into the sum of two integrals of type (8.77) and (8.78a): 


coo c +0 
J f(z)dz = | f(a) dx + / f(z) dz, (8.80) 


where c is an arbitrary number. 


Criterion 1: If f is integrable in any finite interval of [a, oo) , and if the integral 


Too 
J | f(x) | dz (8.81) 


is convergent, then also the integral (8.77) is convergent. In this case the integral (8.77) is called abso- 
lutely convergent, and the function f(x) is absolute integrable on the half-axis [a, +00). 


Criterion 2: If for the functions f(x) and y(x) 
f(x) >0, v(x) »0 and f(x) < φ(α) for a € x « οσο (8.82a) 
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hold, then from the convergence of the integral 


-Foo Ἔσο 
/ y(x)dx (8.82b) the convergence of the integral / f(a) dx (8.82c) 


follows, and conversely, from the divergence of the integral (8.82c) the divergence of the integral (8.82b) 
follows. 
Criterion 3: Substituting (8.83a) and considering that for a > 0, a > 1 the integral (8.83b) 
+00 
1 dx 1 
plz)=—, (8.534) Κω μι i (a>0, m) (8.83b 
x : 


p" Ex p=} 
J v^ (a—1)a* 


is convergent and has the value of the right-hand side, and the integral of the left-hand side is divergen: 
for a < 1, then one can deduce a further convergence criterion from the second one: 
If f(x) ina € x < oo is a positive function, and there exists a number œ > 1 such that for all x large 
enough 

f(x)z^ « k «oo. (k » 0, const) (8.83c 
holds, then the integral (8.77) is convergent; if f(x) is positive and there exists a number a € 1 such 
that 

f(x)z* >ce>0 (620, const) (8.83d 
holds from a certain point x, then the integral (8.77) is divergent. 


ae - 
À ia dii = . Substituting a = : , gives D gl? = I = » 1. The integral is divergent. 
4. Relations Between Improper Integrals and Infinite Series 
If £1, %2,...,@p,... is an arbitrary, unlimited increasing infinite sequence, i.e., if 
a < Tı < Ta <: Say «s with lim. Bn 00; (8.84a 


and if the function f(x) is positive for a € x < oo, then the problem of convergence of the integra 
(8.77) can be reduced to the problem of convergence of the series 


πι πο In 
no dx + no dz / f(z)dr ++. (8.84b 
a Ti UVn-1 

If the series (8.84b) is convergent, then the integral (8.77) is also convergent, and it is equal to the 
sum of the series (8.84b). If the series (8.84b) is divergent, then the integral (8.77) is also divergent. 
So the convergence criteria for series can be used for improper integrals, and conversely, in the integra 
criterion for series (see 7.2.2.4, p. 462) one can use the improper integrals to investigate the convergence 
of infinite series. 


8.2.3.3 Integrals with Unbounded Integrand 
1. Definitions 


1. Right Open Interval For a function f(x), which has a domain [a, b) open on the right, and at 
the point b it has the improper limit lim p f(x) = œ, the definition of the improper integral is the 
wb 


following: 
b 


/ f(z) de = Jim, i f(x) dz. (8.85) 


a 
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If this limit exists and is finite, then the improper integral (8.85) exists, and is called a convergent 
improper integral. If the limit does not exist or it is not finite, then the integral is called a divergent 
improper integral. 

2. Left Open Interval For a function f(x), which has a domain open on the left (a, b], and at the 
point a it has the limit i lim n f(x) = oc, the definition of the improper integral analogously to (8.85) is 


b b 
J Λία) dz = lim. / f(z) dz. (8.86 
a ate 


3. Two Half-Open Continuous Intervals For a function f(x), which is defined on the interva 
(a, b] except at an interior point c with a < c < b, i.e., for a function f(x) defined on the half-open 
intervals ία, c) and (ο, b], or is defined on the interval τα, b], but at the interior point c it has an infinite 
limit at least from one side ,um i f(x) =œ or jim f(x) = oo, the definition of the improper integra 


is 


b ο-ε b 
/ Λία) dz = lim. / f(x) dz + lim / f(x) dz. (8.87a 
^ a có 

Here the numbers ε and à tend to zero independently of each other. If the limit (8.87a) does not exist, 
but the limit 


c—& b 
Jim, | / f(x) dz + / f(x) a} (8.87b) 
a cte 

does, then the limit (8.87b) is called the principal value of the improper integral or Cauchy's principal 
value. 

2. Geometrical Meaning 

The geometrical meaning of the integrals of unbounded functions (8.85), (8.86), and (8.87a) is to find 
the area of the figures bounded, e.g., from one side by a vertical asymptote as represented in Fig.8.22. 


yA ya ya ya yA 


b b a b . 
oa bx Ola & x 0a @x Gx 0 
a) s. (8.85) b) s. (8.86) c) s. (8.87a) 
Figure 8.22 


b dx 
ΒΑ: —: 
Jo να 


b dx b dx 
E es “™ = lim (2Vb — 2/2) =: 'onvergent). 
hya jim f Ja Jim vb — 2/2) =2Vb (convergent) 
π 


π/2 
ΒΡ: [ tanadx: Case (8.85), singular point at 2 = 3 
20 


Case (8.86), singular point at x = 0. 


π/2 π/2-ε π 
[ tanrdr = lim ri tanadx = lim |Incos0 — In cos G — 2] —oo (divergent). 
Jo e>+0 Jo e>-+0 2 
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B C: Ls Case (8.87a), singular point at x = 0. 


8 Fa | - dz . 3, 9/3 so 9,5, καν 9 os 
E i EM jim, [5 Vr Γης ιο en ) = z (convergent). 


2x d 
B D: 7 E D Case (8.872), singular point at x = +1. 


2r2— 

2 coe -----ε 1-ν 2 

/ P z lim, f + lim, f + lim / 

-2z*—]l  seo40/-2 40 Jags 70 1 ΣΥ 

= lim hjr? — 1| |-3* +--+ = lim [In|1 + 2e ?— 1| 2 In3] ο — oo. (divergent). 
e>+0 e>+0 


3. The Application of the Fundamental Theorem of Integral Calculus 
1. Warning The calculation of improper integrals of type (8.87a) with the incorrect use of the formula 


/ f(x)de = F(x) |? with F(x) = f(x) (8.88) 


(see 8.2.1.2,1., p. 495) usually results in mistakes if the singular points in the interval [a, b] are not taken 
into consideration. 
W E: Using formally the fundamental theorem one gets for the example D 


2 Qa dr P ; . 
f. In [a? — 1| [252 13 — 1n3 = 0, 


though this integral is divergent. 
2. General Rule The fundamental theorem of integral calculus can be used for (8.87a) only if the 
primitive function of f(x) can be defined to be continuous at the singular point. 


ΒΕ: In example D the function In |x? — 1| is discontinuous at x = +1, so the conditions are not 
fulfilled. Considering example C, the function y — 3 «2/9 is such a primitive function of yz on the 
a 


intervals [a, 0) and (0, b] which can be defined continuously at x = 0, so the fundamental theorem can 
be used in example C: 


8 dx _ 39/3 m 2/3 12/3 9 
να. aag ON eg 


4. Sufficient Conditions for the Convergence of an Improper Integral with 
Unbounded Integrand lim, f(x) — oo 


b b 
1. If the improper integral / |f(x)| dx converges, then the improper integral / f(x) dx also con- 
a a 


verges. In this case it is called an absolutely convergent integral and the function f(x) is an absolutely 
integrable function on the considered interval. 


2. Ifthe function f(x) is positive in the interval [α, b), and there is a number a < 1 such that for the 
values of all x close enough to b 

f(x) (b — x)” < oo (8.89a) 
holds, then the integral (8.87a) is convergent. But, if the function f(x) is positive in the interval [a, b), 
and there i is a number a > 1 such that for the values of x close enough to b 

f(x) (b—2)* >ce>0 (cconst) (8.89b) 
holds, then the integral (8.87a) is divergent. 
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8.2.4 Parametric Integrals 


8.2.4.1 Definition of Parametric Integrals 
The definite integral 


ie y) dx = F(y) (8.90) 


is a function of the variable y considered here as a parameter. In several cases the function F(y) is 
no longer elementary, even if f(x,y) is an elementary function of x and y. The integral (8.90) can be 
an ordinary integral, or an convergent improper integral with infinite limits or unbounded integrand 
Λία, y). 
For theoretical discussions about the convergence of improper integrals depending on a parameter see, 
e.g.. [8.3]. 
B Gamma Function or Euler Integral of the Second Kind (see 8.2.5, 6., p. 514): 
oo 
I(y)-— joe da (convergent for y > 0). (8.91) 
0 


8.2.4.2 Differentiation Under the Symbol of Integration 


1. Theorem Ifthe function (8.90) is defined in the interval c € y < e, and the function f(x,y) is 
continuous on the rectangle a € x < b, c € y € e and it has here a continuous partial derivative with 
respect to y, then for arbitrary y in the interval [c, e] holds 

b 


b 
1 Of (v, i 
Sa / f(x,y) dx = / Of (2,9) dx. (8.92) 
dy a a ὃν 
This is called differentiation under the symbol of integration. 
2 


1 [! r lo x 1 gdy 1 1 
W For arbitrary y > 0: A / arctan Ž dx = / arctan —) dz = / = d 
` dy Jo y ο ὃν y 0 


= In — ; 
x+y? 2 "i.g 
1 T 1. 1 yd 1 1 2 1 ? 
Checking: / arctan a dr = arctan = + = yln y =; arctan — + = yln y z|-—zln y =. 
0 y y 2 1+ y?’ dy y 2 1c 2 1-49 


For y = 0 the condition of continuity for f(x,y) is not fulfilled, and there exists no derivative. 

2. Generalization for Limits of Integration Depending on Parameters The formula (8.92) 
can be generalized, if with the same assumptions as have been made for (8.92) the functions a(y) and 
B(y) are defined in the interval [c, e], they are continuous and differentiable there, and the curves x = 
a(y), x = (y) do not leave the rectangle a < x <b, c < y < e: 


d Bly) KENE y) 
£ J fede | μου (8.93) 
Y a) ay OY 


8.2.4.3 Integration Under the Symbol of Integration 


If the function f(x,y) is continuous on the rectangle a € x € b, c € y € e, then the function (8.90) is 
defined in the interval [c, e], and 


[κων is c || ne ij dx (8.94) 


is valid. This case of commutability of the order of integration is called integration under the symbol of 
integration. 


e 


/ 


c 
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W A: Integration of the function f(x,y) = x" on the rectangle 0 € x < 1, a € y € b. The function 


25 is discontinuous at x = 0, y = 0, for a > 0 it is continuous. So one can change the order of 
by pl 1f pb ν dy 14b 
integration: / | / x η dy = / 1 x” dy| dx. On the left-hand side one gets / — — — |n , 
a 190 0 Ίνα alcty l+a 
1 xè — r’ 
on the right-hand side / i dx. The corresponding indefinite integral cannot be expressed by 
o lng 
elementary functions. Anyway, the definite integral is known, so now follows: 
pma mit? Φεαεῦ 
———— dr = ln —— a . 
Jo Ine lca 
"P 
W D: Integration of the function f(x, y) = GV over the rectangle 0 < x < 1,0 € y € 1. The 
22 +y 
function is discontinuous at the point (0, 0), so the formula (8.94) cannot be used. Checking it yields: 
fe phe 29 T i 1 d 
Yy -r 1 a—l y qo 
— dr - a =; ; = arctan y = —; 
ri (a? + y?)? rty “50 14y Jo 1+y? y lo 4 
E eae ; 1 ; 
y- y -1 1 dx 1 π 
dy = ος ; / - = — arctan x |p2 ——. 
i (a? + y?)? 9$. y? "= 22 1-1᾽ ο z?41 retan Ig 4 


8.2.5 Integration by Series Expansion, Special Non-Elementary 
Functions 

It is not always possible to express an integral by elementary functions, even if the integrand is an ele- 

mentary function. In many cases one can express these non-elementary integrals by series expansions. 

If the integrand can be expanded into a uniformly convergent series in the interval [a, b], then one gets 


£ 
also a uniformly convergent series for the integral / f(t) dt by integrating it term by term. 
1. Sine Integral (|x| < oo, see also 14.4.3.2, 2., p. 756) 


x oo 


A sint T sin t 
Si (x) / ; dt ᾿ / : dt 
r3 T? (-1γηχ2πεὶ 
TS 5B Gane (8.95) 
2. Cosine Integral (0 < x < oc) 
[ cost f 1— cost 
Ci (x) {= dt =C+ Ina / c dt 
z 0 
r? qt (=1)"x™” 
=CtIng | e res with Oba 
C+iIna 2.91 4. 4l T 2n- ni with (8.962) 
or - fe mntat = 0.577 215 665... (Euler constant). (8.96b) 


0 


3. Integral Logarithm (0 < æ < 1, for 1 < x < oo as Cauchy Principal Value) 


i j? qon 
di C +ln|lnz|+lnz4 (nz) kerea (In z) l 
à Int 


Li (x) 


2.2! πι. τι iR 
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4. Exponential Integral (—oo < x < 0, for 0 < x < oo as Cauchy Principal Value) 


E 7 οἱ r2 r” 
Bios 7 dt =C + Info| πμ, (8.984) 
Ei (In x) = Li (x). (8.98b) 


5. Gauss Error Integral and Error Function 
The Gauss error integral is defined for the domain |x| < oo and it is denoted by Φ. The following 
definitions and relations are valid: 


G(x) = m / oa dt, (8.99a) jim Φ(α) = 1, (8.99b) 
y. τ ο 1 
Φρ(α) = Van é 2di= P(x) — 7 8 (8.99c) 


The function (x) is the distribution function of the standard normal distribution (see 16.2.4.2, p. 819) 
and its values are tabulated in Table 21.17, p. 1133. 

The error function erf (x), often used in statistics (see also 16.2.4.2, p. 819), has a strong relation with 
the Gauss error integral: 


erf (x) = τ. / e” dt = 20)(xV2), (8.100a) Jim erf (x) = 1, (8.100b) 
0 
9 19 T? (- jg 
erf (x) = Ta (+ 1-3 3*5 qe RD bree], (8.100c) 


fot (t) dt = x erf (x) 4 


71 1), (8.100d) ο τε (8.1000) 


6. Gamma Function and Factorial T(x) 4 
1. Definition The gamma function, the Euler integral of the 51 
second kind (8.91), is an extension of the notion of factorial 41 
for arbitrary numbers x, even complex numbers, except zero 3. 
and the negative integers. The curve of the function I(x) is 24 
represented in Fig. 8.23. Its values are given in Table 21.10, 1. 
p. 1105. It can be defined in two ways: 
x no 5» Σαχ’ 


1 


Γ(α) = [et dt (x0) or (8.101a) 2 
0 3 

zn! = 
Γ(α) = lim id i ἢ E 


noo (η + 1)(x 4-2)...(x ἠ- n) 
(z 4 0,—1,-2,...). (8.101b) 


Figure 8.23 
2. Properties of the Gamma Function 


I(zr--1)- αΓ(α), (8.102a) I(n+1)=n! (n=0,1,2,...),  (8.102b) 
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π 
Γία) Γ{1 -- ο) = daas ο (sd EA si (8.102c 
1 oo 
Γ (5) = afe® diee (8.102d 
0 
1 (2η)!γπ ; 
r(n+ 5) 2E om (n0,12,. (8.102e 
(d (C Dar 
d n4 5) Qn)i (1. —.0,1,2;...). (8.102f 


The formulas (8.102a) and (8.102c) are also valid for complex arguments z, but only if Re (z) > 0 holds. 


3. Generalization of the Notion of Factorial The notion of factorial, defined until now only for 
positive integers n (see 1.1.6.4, 3., p. 13), leads to the function 


xz! — Γ(α 1-1) (8.103a 
as its extension for arbitrary real numbers. The following equalities are valid: 
For positive integers v: a! =1-2-3---2, (8.103b) forr —0: 0!=J/(1) =1, (8.103c 
for negative integers 7: x! = coo, (8.103d) for x = i : (5)! =I (5) = B , (8.103e 
1 1 1 3 3 1 
ORIS asia ο αυ Lo UP £ TOES SI TO L5. ; 
for x 5° ( ;) D 5) Υπ. (8.103f) for x af ( ;) i ;) 2/7 . (8.103g 


An approximate determination of a factorial can be performed for numbers > 10, also for fractions n 
with the Stirling formula: 


nA" 1 
twfl—) Von νο uh 1 
" ο) 3 ( + Ton + 288) 7 ) ᾿ ο 
In(n!) z (n + 3 Inn — n4 In v2r. (8.1031) 


7. Elliptic Integrals 
For the complete elliptic integrals (see 8.1.4.3, 2., p. 490) the following series expansions are valid: 


d T JE 2. 1:342 1.3.52, | 
K- BAUEN ESSE (51) kA ( ) βδ....]., κ <1, (8.104 
ποτε il (3) + (aa) (aoc) e] es 109 


$ 152k 13N ke. (1:53:51 ko 
dq oldqdsssieg4 = -— 
g- | μας ab is 1 a 3 τα) 5 | 


k? <1. (8.105) 


The numerical values of the elliptic integrals are given in Table 21.9, p. 1103. 


8.3 Line Integrals 

The notion of the integral can be generalized in different ways. While the domain of an ordinary definite 
integral is an interval on the numerical axis, for a line integral, the domain of integration is a segment 
of a planar or space curve. The curve, i.e., the path of integration can also be closed; it is called also 
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circuit integral and it gives the circulation of the function along the curve. There are distinguished line 
integrals of the first type, of the second type, or of general type. 


8.3.1 Line Integrals of the First Type ya A=B 
8.3.1.1 Definitions 


The line integral of the first type or integral over an arcis the definite 
integral 


/ f(x,y) ds, (8.106) 
(C) 


where f(x, y) is a function of two variables defined on a connected 


domain and the integration is performed over an arc C =AB of 
a plane curve given by its equation. The considered arc is in the 
same domain, and it is called the path of integration. The numerical 
value of the line integral of the first type can be determined in the 
following way (Fig. 8.24): Figure 8.24 


1. Decomposing the rectifiable arc segment AB into n elementary parts by points A, Ao, ..., Αμι 
chosen arbitrarily, starting at the initial point A = Αρ and finishing at the endpoint B = A,. 


2. Choosing arbitrary points P; inside or at the end of the elementary arcs A; , A;, with coordinates 
ĉi and ni. 


3. Multiplying the values of the function f(£;,r;) at the chosen points with the arc-length A; ,A;— 
As; ι which should be taken positive. (Since the arc is rectifiable, As; ι is finite.) 

4. Adding the n products f(£;,;) ^s; 4. 

5. Evaluating the limit of the sum 


n 
Y f(& m) Asi- (8.1072) 
i-l 
as the arc-length of every elementary curve segment As; tends to zero, while n obviously tends to oo. 


If the limit of (8.107a) exists and is independent of the choice of the points A; and P;, then this limit is 
called the line integral of the first type, and the function f(x,y) is called integrable along the curve C: 


/ f(x, y) ds = ο» ο (8.107b) 
(C) notes: d 


Analogously the line integral of the first type can be defined for a function f(x, y, ο) of three variables, 
whose path of integration is a curve segment of a space curve: 


n 
f(a,y, 2) ds = jim, f. Ni, Gi) Δι ι. (8.107c) 
(C) n= i—l 
8.3.1.2 Existence Theorem 
The line integral of the first type (8.107b) or (8.107c) exists if the function f(x, y) or f(x,y,z) is con- 
tinuous along the continuous arc segment C, and the curve has a tangent which varies continuously. In 
other words: The above limits exist and are independent of the choice of A; and P;. In this case, the 
functions f(x, y) or f(x, y, 2) are called integrable along the curve C. 
8.3.1.3 Evaluation of the Line Integral of the First Type 


The calculation of the line integral of the first type can be done by reducing it to a definite integral. 
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1. The Equation of the Path of Integration is Given in Parametric Form 
If the defining equations of the path are x = z(t) and y = y(t), then 


| fle was= f fist. OW Ile COP + lr COT at (8.1082) 
(C) to 


holds, and in the case of a space curve x = x(t), y = y(t), and z = z(t) 


πο 45 -- μμ (8.108b) 
(C) to 


where to is the value of the parameter t at the point A and T is the parameter value at B. The points 
A and B are chosen so that ty < T holds. 


2. The Equation of the Path of Integration is Given in Explicit Form y —y(x) 
Substituting t = x one gets from (8.108a) for the planar case 


[ fenas= | fis swa + lr OP ae (8.1092) 


(C) s 


and from (8.108b) for the three dimensional case 
b 
[ i6 ds — f isis tl OP + POP de (8.109) 
(C) 5 


Here a and b are the abscissae of the points A and B , where the relation a < b must be fulfilled. Then 
one considers x as a parameter if every point corresponds to exactly one point on the projection of the 
curve segment C onto the x-axis, i.e., every point of the curve is uniquely determined by the value of its 
abscissa. If this condition does not hold, then the curve segment has to be partitioned into subsegments 
having this property. The line integral along the whole segment is equal to the sum of the line integrals 
along the subsegments. 


8.3.1.4. Application of the Line Integral of the First Type 


Some applications of the line integral of the first type are given in Table 8.6. The curve elements ds 
needed for the calculations of the line integrals are given for different coordinate systems in Table 8.7. 
8.3.2 LineIntegrals ofthe Second Type 

8.3.2.1 Definitions 


A line integral of the second type or an integral over a projection onto the α-. y- or z-axis is e.g., the 
definite integral 


f fos (8.1102) Or | fla,y,2)ax, (8.110b) 
(C) (C) 


where f(x,y) or f(x,y,z) are two or three variable functions defined on a connected domain, and the 
integration is done over a projection of a plane or space curve C —AB (given by its equation) onto the 
u-, y-, or z-axis. The path of integration is in the same domain. 

The line integral of the second type one gets similarly to the line integral of the first type, but in the 
third step the values of the function f(&,7;) or f(&,7:,¢;) are not multiplied by the arc-length of the 


elementary curve segments A; ; A;, but by its projections onto a coordinate axis (Fig. 8.25). 
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Table 8.6 Line Integrals of the First Type 


Length of a curve segment C 


L- [as 


(C) 


Mass of an inhomogeneous 
curve segment C 


/ eas (o = f(x,y,z) density function) 
(C) 


Center of gravity coordinates 


1 1 1 
to= T / rods, Yyo= L / yods, το T / zods 
(C) (C) (C) 


Moments of inertia of a plane 


" ; 
curve in the z, y plane l= / wods, Ty = / yPods 
(C) (C) 
Tq Je + 2°)ods yes "n + 2*)ods 
Moments of inertia of a (C) (0) 
space curve with respect j a 
to the coordinate axes I= | (αἱ +y")ods 


In the case of homogeneous curves o = 1 is substituted. 


Table 8.7 Curve Elements 
Cartesian coordinates x, y = y(x ds = 4/1 + [y (x)]?dx 
Plane curve in | Polar coordinates y, p = p(y) 2 = "eg πῃ», 
the z, y plane x = p(y) cos y, y = p(y) sing ds = y go) + [p (p) Pay 
Parametric form in Cartesian ds = ος 
coordinates x = x(t), y = y(t) 
Space curve Parametric form in Cartesian ds = V (x (OP ο Ες 
coordinates x = x(t), y = y(t), z = z(t) 
1. Projection onto the z- Axis y 
With Pr, A; 1A;— xj — z;:1 = Δα; 1 one gets (8.111) y; A, B 
n ΥΗ[-----» "A P(E; πι) 
/ ία, ) ἀν = lim M7 f(&.m) Avi, (8.112a) Avail 
3 Az; 120 4 ! i 
©) R At 1 
I Λία, ψ, e)de= lim, = F&a G) Azi. (8.112b) ΠῚ WES x 
T Figure 8.25 
2. Projection onto the y- Axis 
[ Gs du = im FG.) Avis, (8.113a) 
(€) ποσο  i—l 
| Fey, 5) ἀν = „lim | S&G) Aia (8.113b) 
ο noo i=l 


( 
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3. Projection onto the z-Axis 


J fov dz lim | ο (8.114) 
(C) 1 


n-oc 


8.3.2.2 Existence Theorem 

The line integral of the second type in the form (8.112a), (8.113a), (8.112b), (8.113b) or (8.114) exists 
if the function f (c, y) or f(x,y,z) and also the curve are continuous along the arc segment C', and the 
curve has a continuously varying tangent there. 

8.3.2.3 Calculation of the Line Integral of the Second Type 

The calculation of the line integral of the second type can be done by reducing it to a definite integral. 
1. The Path of Integration is Given in Parametric Form 

With the parametric equations of the path of integration 


ατα, y=y(t) and (for a space curve) z= z(t) (8.115 
we get the following formulas: 
T 
For (8.112a f f(x,y) απ = ποσο α (8) dt. (8.116a 
(C) ti 
T 
For (8.113a / f(x,y) dy = | fe, y (ty (t) dt. (8.116b 
(C) to 
T 
For (8.112b / f(x,y, 2) απ = | te), y(t), z(t)]a" (t) dt. (8.116c 
(€) to 
T 
For (8.113b / f(x,y, 2) dy = JEGOR t), z(t)]y' (t) dt. (8.116d 
(C) D 
T 
For (8.114) / f(x,y, 2) dz = “ο y(t), z(t)]z (t) dt. (8.116e 
(0) to 
Here, £y and T are the values of the parameter t for the initial point A and the endpoint B of the arc 
segment. In contrast to the line integral of the first type, here we do not require the inequality ty < T. 


Remark: In the case of reversion of the path of the integral, i.e., interchanging the points A and B, 
the sign of the integral changes. 
2. The Path of Integration is Given in Explicit Form 
In the case of a plane or space curve with the equations 

y = y(x) Or y — y(x), z= σα) (8.117) 
as the path of integration, with the abscissae a and b of the points A and B, where the condition a < bis 
no longer necessary, the abscissa x takes the place of the parameter { in the formulas (8.112a) — (8.114). 


8.3.3 Line Integrals of General Type 
8.3.3.1 Definition 


A line integral of general type is the sum of the integrals of the second type along all the projections of 
a curve. If two functions P(x, y) and Q(z, y) of two variables, or three functions P(x, y, z), Q(x, y, 2), 
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and R(x, y, z) of three variables, are given along the given curve segment C', and the corresponding line 
integrals of the second type exist, then the following formulas are valid for a planar or for a space curve. 


1. Planar Curve 


[ (Pav + Qdy) = / Pac+ f Q dy. (8.118a) 
(C) (C) (C) 

2. Space Curve 
[ (Pax Qdy + Raz) = [ Pac+ [ Q2 | Raz. (8.118b) 
(C) (C) (C) (C) 


The vector representation of the line integral of general type and an application of it in mechanics will 
be discussed in the chapter about vector analysis (see 13.3.1.1, p. 719). 


B 
8.3.3.2 Properties of the Line Integral of General Type κ᾽ 
Μ 


1. The Decomposition of the Path of the Integral 4 
o dB 


by a point M, which is on the curve, and it can even be outside of AB 


(Fig. 8.26), results in the decomposition of the integral into two parts: A M 
[ (Pax + Qdy) = / (Pdr +Q dy) + / (P dx + Qdy)." (8.119) B B 


A A — 


AB AM MB 
Figure 8.26 
2. The Reverse of the Sense of the Path of Integration 
changes the sign of the integral: 
[ 0a: Qdy) == [ (Pav αυ (8.120) 
AB BÀ 
3. Dependence on the Path 
In general, the value of the line integral is dependent not only on the initial and B 
endpoints but also on the path of integration (Fig. 8.27): A 
/ (P dz +Q dy) 4 / (P dz + Q dy)* (8.121) D 
A A Figure 8.27 
AMB ADB 


MA: /= [αν dx + yz dy + zx dz), where C is one turn of the helix x = acost, y = asint, z = bt 
(C) 
(see Helix on p. 260) from tp = 0 to T = 2: 
Qn n --- , πα’ 
I= / (—à? sin? t cost + abt sin t cost + abt cost) dt = oa ae 
0 
HB: / - / (y? dx + (xy — a?) dy], where C is the arc of the parabola y? = 9x between the points 
(0) 


*Similar formulas are valid for the three-variable case. 
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3/2. yo yt 3 
A(0,0) and B(1,3): T= f°] oy 4 ENTER 
(0,0) and T8) Jo E 6 si} | 9 = 9; 


8.3.3.3 Integral Along a Closed Curve 


1. Notion of the Integral Along a Closed Curve A circuit integral or the circulation along a 
curve is a line integral along a closed path of integration C , i.e., the initial point A and the end poin 
B coincide. The following notation is used: 


f (Pax + Qdy) or f (Pde + Qdy + Raz), (8.122 
(€) (C) 
In general, this integral differs from zero. But it is equal to zero if the conditions (8.127) are satisfied, 
or if the integration is performed in a conservative field (see 13.3.1.6, p. 721). (See also zero-valuec 
circulation, 13.3.1.6, p. 721.) 
2. The Calculation of the Area S of a Plane Figure is a typical example of the application o 
the integral along a closed curve in the form 


ο ; fe dy — y dx), (8.123 
(©) 


where C is the boundary curve of the plane figure. The integral is positive if the path is oriented 


counterclockwise. 
8.3.4 Independence of the Line Integral of the Path of Integration [5 


The condition for independence of a line integral of the path of integration is also called integrability of 
the total differential. 

8.3.4.1 Two-Dimensional Case 

If the line integral 


i [P(x, y) dz + Q(x, y) dy] (8.124) 
(C) 
with continuous functions P and Q defined on a simple connected domain depends only on the initial 
point A and the endpoint B of the path of integration, and does not depend on the curve connecting 
these points, i.e., for arbitrary A and B and arbitrary paths of integration ACB and ADB (Fig. 8.27) 
the equality 


/ (P dx +Q dy) = / (P dx + Q dy) (8.125) 
ACB ADB 


holds, then it is a necessary and sufficient condition for the existence of a function U (x, y) of two vari- 
ables, whose total differential is the integrand of the line integral: 
oU oU 
puce dec 
Ox Oy 
The function U (x, y) is a primitive function of the total differential (8.126a). In physics, the primitive 
function U (x, y) means the potential in a vector field (see 13.3.1.6, 4., p. 722). 


(8.126b) 


P dz 4 Qdy — dU, (8.1262) ie., Q 


8.3.4.2 Existence of a Primitive Function 
A necessary and sufficient criterion for the existence of the primitive function, the integrability condition 
for the expression P dx + Q dy, is the equality of the partial derivatives 
OP  0Q 
u———, 8.127 
ob ag? (8.127) 
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where also the continuity of the partial derivatives is required. 
8.3.4.3 Three-Dimensional Case 
The condition of independence of the line integral 
[IPs 2) ἀν + QU, 2) dy + Rwy, 2) dz (8.128 


of the path of integration analogously to the two-dimensional case is the existence of a primitive function 
U (x, y, z) for which 


P dx 4- Qdy 4- Rdz = dU, (8.129a 
holds, i.e., 
OU OU OU 
pc ==, R=—. 8.199] 
Ox? Q Oy’ az ( ) 


The integrability condition is now that the three equalities for the partial derivatives 
ðQ OR OR OP OP OQ 
Oz Oy’ Ov Oz’? Oy Ox 
should be simultaneously satisfied, provided that the partial derivatives are continuous. 
E The work W (see also 8.2.2.3, 2., p. 504) is defined as the scalar product of force F (7) and displace- 
ment δ. In a conservative field the work depends only on the place 7, but not on the velocity v. With 


F = Pē, + QE, + ΠΟ, = gradV and ds = dxé, + dyé, + dzé, the relations (8.1292), (8.129b) are 
satisfied M the potential V(r), and (8.129c) is valid. Independently of the path between the points P, 
and P, holds: 


(8.129c 


W= E F(r).ds— [vw dx +Q dy + Rdz] = V(P5) — V(P). (8.130) 


8.3.4.4 Determination of the Primitive Function 


1. Two-Dimensional Case (Fig.8.28) 


If the integrability condition (8.127) is satisfied, then along an arbitrary path of integration connecting 
an arbitrary fixed point A(xo, yo) with the variable point P(x, y) and passing through the domain where 
(8.127) is valid, the primitive function U(x, y) is equal to the line integral 


Us [ (Pav + Qa). (8.131) 


AP 
In practice, it is convenient to choose a path of integration parallel to the coordinate axes, i.e., one 


of the segments AK P or ALP, if they are inside the domain where (8.127) is valid. There exist two 
formulas for the calculation of the primitive function U (x, y) and the total differential P dx + Q dy: 
T y 
U=U(a,y)+ f+ f =C+ | PE y)ae+ f QG man. (8.132a) 
AK KP To Yo 
U=U(e,m)+ f+ f =C+ j Οίκω η) dn + i (εν) ἆ (8.132) 
AL ΤΡ Yo To 


Here C is an arbitrary constant. 
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P(x, y, 2) 
A(X, Yor Zo) 


Figure 8.28 Figure 8.29 


2. Three-Dimensional Case (Fig. 8.29) 
Ifthe condition (8.129c) is satisfied, the primitive function can be calculated for the path of integration 
AKLP with the formulas 


U = U (x, yo, 20) Ἔν] πε] 


AK KL LP 


x y 2 

f Pe, Yo, 20) d£ + l Ο( η. zo) dn + [ 8G. y,£)d£ -- C (C arbitrary constant). (8.133) 
To Yo Zo 
For the other five possibilities of a path of integration with the segments being parallel to the coordinate 
axes one gets five further formulas. 
ΒΑ: Pdr+Q dy = i . The condition (8.129c) is satisfied: o = 29 = Ww πο ; 
Application of the formula (8.132b) and the substitution of ro 0, Yo 1 (αρ 0, Yo 0 
may not be chosen since the functions P and Q are discontinuous at the point (0,0)) resulting in 


1 “di z yd v " 
Ber I | e + U(0, 1) arctan ~ + C = arctan 2. + ΟἹ. 
1 0? +7 ο & y y T 


: TN | 1 1 c+ 2 x 1 s relations 
B B: Pdr + Qdy - Rdz = z (3 z 1-5) dx + m dy 4 (= np 5) dz. The relations 
(8.129c) are satisfied. Application of the formula (8.133) and substitution of ro = 1, yo = 1, 29 = 0 


; T y 3 μη 1 2 2 
result in U / 0 - αξ 4 / 0 ἆη 4 / (ate =) d¢+C = arctan 2m FC. 


8.3.4.5 Zero-Valued Integral Along a Closed Curve 
The integral along a closed curve, i.e., the line integral P dx + Q dy is equal to zero, if the relation 
(8.127) is satisfied, and if there is no point inside the curve where even one of the functions P, Q, 7 


Q 


ο ον ας δ 
or ΠΕ is discontinuous or not defined. 


Remark: The value of the integral can be equal to zero also without this conditions, but then one gets 
this value only after performing the corresponding calculations. 


8.4 Multiple Integrals 


The notion of the integral can be extended to higher dimensions. If the domain of integration is a region 
in the plane or on a surface in space, then the integral is called a surface integral, if the domain is a part 
of space, then it is called a volume integral. Furthermore, for the different special applications there are 
other special notations. 
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8.4.1 Double Integrals 

8.4.1.1 Notion of the Double Integral 

Y Ῥιος ση 1. Definition 

The double integral of a function of two variables u = f(a, y) over a planar 
domain S in the x, y plane is denoted by 


[flew ds = ff fiy) dy dx. (8.134) 
5 5 


It isa number, if it exists, and it is defined in the following way (Fig. 8.30): 
1. Decomposition of the domain S into n elementary domains AS;. 


2. Choosing an arbitrary point P;(2;, yi) in the interior or on the boundary 
Figure 8.30 of every elementary domain. 


3. Multiplication of the value of the function u = f(x;, yi) at this point by the area AS; of the corre- 
sponding elementary domain. 

4. Summation of these products f(x;, y;) AS;. 

5. Calculation of the limit of the sum 


Y f(zi w)AS; (8.1352. 
ἐΞῖ 


as the diameter of the elementary domains tends to zero, consequently AS; tends to zero, and so n 
tends to oo. (The diameter of a set of points is the supremum of the distances between the points o 
the set.) The requirement AS tends to zero is not enough, because, e.g., in the case of a rectangle the 
area can be close to zero also if only one side is small and the other is not, so the considered points 
could be far from each other. If this limit exists independently of the partition of the domain S into 
elementary domains and also of the choice of the points P;(x;, yi), then it is called the double integra 
of the function u = f(x, y) over the domain S, the domain of integration, and one writes: 


/ Λία.) d$ = lim Σ᾽ Παν yi) ASi- (8.135b 
5 m» dcl 


noo i= 


2. Existence Theorem 

If the function f(x, y) is continuous on the domain of integration including the boundary, then the dou- 
ble integral (8.135b) exists. (This condition is sufficient but not neces- 
u=f(x,y) sary.) 

3. Geometrical Meaning 

The geometrical meaning of the double integral is the volume of a solid 
whose base is the domain in the x, y plane, whose side is a cylindrical sur- 
face with generators parallel to the z-axis, and it is bounded above by the 


ly surface defined by u = f(x,y) (Fig. 8.31). Every term f(2x;,y;) AS; of 
2 the sum (8.135b) corresponds to an elementary cell of a prism with base 
cana) AS; and with altitude f(x;,y;). The sign of the volume is positive or neg- 
ative, according to whether the considered part of the surface u = f(x, y) 

Figure 8.31 is above or under the x, y plane. If the surface intersects the x, y plane, 


then the volume is the algebraic sum of the positive and negative parts. 

If the value of the function is identically 1 (f(x, y) = 1), then the volume has the numerical value of 
the area of the domain S in the x, y plane. 

8.4.1.2 Evaluation of the Double Integral 


The evaluation of the double integral is reduced to the evaluation of a repeated integral, i.e., to the 
evaluation of two consecutive integrals. 
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1. Evaluation in Cartesian Coordinates 

If the double integral exists, then one can consider any type of partition of the domain of integration, 
such as a partition into rectangles. After dividing the domain of integration into infinitesimal rectan- 
gles by coordinate lines (Fig. 8.324) it follows the calculation of the sum of all differentials f(a, y)dS 
starting with all the rectangles along every vertical stripe, then along every horizontal stripe. (The 
interior sum is an integral approximation sum with respect to the variable y, the exterior one with re- 
spect to x.) If the integrand is continuous, then this repeated integral is equal to the double integral 
on this domain. The analytic notation is: 


b | 22(2) b φο(ῳ) 
EC y)dS — i / f(a,y)dy| dx = n J f(x,y) dy da. (8.136a) 

E a deri) a vile) 
Here y = p(x) and y = yı (x) are the equations of the upper and lower boundary curves (AB) pove and 


(AB) cow Of the surface region S (vi < φο;Φι, 9» continuous). Here a and b are the abscissae of the 
points of the curves to the very left and to the very right. The elementary area in Cartesian coordinates 
is 


dS = dz dy. (8.136b) 
(The area of the rectangle is Ax Ay independently of the value of x.) For the first integration x is 
yA yA 
g D 
NY B | —. Ss 
3 B A p VO 
dy H dy πας Β EES 
πα S ον) 
448 α C ΝΥ μα eer, 
O0 a dx bx 0 dx x 
a) b) 
Figure 8.32 Figure 8.33 Figure 8.34 


handled as a constant. The square brackets in (8.136a) can be omitted, since according to the notation 
the interior integral is referred to the interior integration variable, the exterior integral is referred to 
the second variable. In (8.136a) the differential signs dx and dy are at the end of the integrand. It is 
also usual to put these signs right after the corresponding integral signs, in front of the integrand. 
The summation can be performed in reversed order, too, (Fig. 8.32b). If the integrand is continuous, 
then it results also in the double integral: 


B valy) 
[ feas - [ [ros ανά (8.1366) 
E ἃ wily) 
W Calculation of A — / zy? dS , where S is the surface region between the parabola y = 27 and the 
Js 
2 2r à 2 y? 2x 1 p 2 39 
line y = 2x (Fig. 8.33) as A = [ fs zy! dy dx = [ x dx |— =- | (8¢*—2")dx = — or 
Jo δα” Jo 3|42 3 Jo 5 
A VU P PM n vu 1 4, 2 2 
a=f / zy dedy = | y dy 5 al fy (y-4) y=. 
Jo Jy/2 Jo 2 "m 2 Jo 4 5 


2. Evaluation in Polar Coordinates 
The integration domain is divided by coordinate lines into elementary parts bounded by the arcs of 
two concentric circles and two segments of rays issuing from the pole (Fig. 8.34). The area of the 
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elementary domain in polar coordinates has the form 


dS = pdp dọ. (8.137a) 
(The area of an elementary part determined by the same Ap and Ay is obviously smaller being close 
to the origin, and larger far from it.) With an integrand given in polar coordinates w = f(p,y) a 
summation is to be performed first along each sector, then with respect to all sectors: 
2 p2(p) 
[ie.eas=f[ f fo. pdpao. (8.137b) 
S ¥1 pi(y) 
Here p = pile) and p = ρο(φ) are the equations of the interior and the exterior boundary curves 


(AmB ) and (ΑΒ) of the surface S and «οι and v» are the infimum and supremum of the polar angles 
of the points of the domain. The reverse order of integration is seldom used. 


W Calculation of the special integral A = / p sin? y dS, where S is the surface of the half-circle with 
Js 
p=3cosp (0 € < 7/2) (sce Fig. 8. ur 


x 7-«/2 p3cosp 2 p 
A= / P sin? ϱ dp ἀφ = [^ sin? y d 
0 (3, 0) 0 0 ) 3 


7/2 " 3 6 
Figure 8.35 =9 [ sin" pcos’ o dp = 5' 
“Ὁ h 


3. Evaluation with Arbitrary Curvilinear Coordinates u and v 


3cosq 


0 


The coordinates are defined by the relations Sonst, 

x =x(u,v), y= y(u, v) (8.138) K zu 
(see 3.6.3.1, p. 261). The domain of integration is partitioned 
by coordinate lines u — const and v — const into infinitesimal 
surface elements (Fig. 8.36) and the integrand is expressed by 
the coordinates u and v. Performing the summation along one 
strip, e.g., along v — const, then over all strips, yields 


const. 
3 


u= 
Q 
un 

5 


ug v2(u) 
[λων v)dS = / i f(u,v)|D| dv du. (8.139) Σ 


Ul vi(u) 


Here v = vı (u) and v = vz(u) are the equations of the boundary Figure 8.36 


curves AmB and AnB ofthe surface S. u, and ug denote the infimum and the supremum of the values 

of u of the points belonging to the surface S. |D| denotes the absolute value of the Jacobian determinant 
(functional determinant) 

Ox Ox 

Βία.) [δι Qv 


D= D(u,v) "| ὃν Oy (8.1402) 
Qu Ov 
The area of the elementary domain dS in curvilinear coordinates can be easily expressed: 
= |D| dv du. (8.140b) 


The formula (8.137b) is a special case of (8.139) for the polar coordinates x = pcos p, y = psiny. The 
functional determinant here is D — p. 

The curvilinear coordinates are chosen so that the limits of integration in the formula (8.139) are as 
simple as possible, and also the integrand is not very complicated. 
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Β Calculation of A = fie y) dS for the case when $ is the inte- 
5 


rior of an asteroid (see 2.13.4, p. 104), with x = acos?t, y = asin? t 
(Fig. 8.37). First the curvilinear coordinates u and v as r = 
ucos?v, y = usin? v are introduced whose coordinate lines u = αι 
represents a family of similar asteroids with equations £ = c, cos? v 
and y = c, sin? v. The coordinate lines v = cz are rays with the equa- 
tions y = Κα, where k = tan? cy holds. This gives 


cos? v —3u cos? v sin v 
ag SERS 
sinïv  3usin?vcosv 


= 3u sin? v cos? v, 


a fm 
A- / / f(x(u,v), y(u, v)) 3usin? v cos? v dv du. Figure 8.37 
o Jo 


Tabelle 8.8 Plane Elements of Area 


Coordinates Element of Area | 
Cartesian coordinates x, y dS — dy dx | 
Polar coordinates p, Φ dS = pdp dp | 
Arbitrary curvilinear coordinates u,v | dS = |D| dudv (D Jacobian determinant) | 


8.4.1.3 Applications of the Double Integral 

Some applications of the double integral are collected in Table 8.9, p. 528. The required areas of 
elementary domains in Cartesian and polar coordinates are given in Table 8.8. 

8.4.2 Triple Integrals 

The triple integral is an extension of the notion of the integral into three-dimensional domains. It also 
is called volume integral. 

8.4.2.1 Notion of the Triple Integral 

1. Definition 


One defines the triple integral of a function f(x, y, z) of three variables over a three-dimensional domain 
V analogously to the definition of the double integral. One writes: 


[ fev z)dV = [ff 16. de dy ae. (8.141) 


The volume V (Fig. 8.38) is partitioned into elementary volumes AV;. Then the products f(2;, Yi, zi) 
AV; are formed, where the point P;(z;, yi, zi) is inside the elementary volume or it is on the boundary. 
The triple integral is the limit of the sum of these products with all the elementary volumes in which the 
volume V is partitioned, then the diameter of every elementary volume tends to zero, i.e., their number 
tends to oo. The triple integral exists only if the limit is independent of the partition into elementary 
volumes and the choice of the points P;(;, yi, zi). Then holds: 


] fev. z)dV — dim, Do Flen m2) ΔΝ. (8.142) 
J 


π-λοο t= 
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Table 8.9 Applications of the Double Integral 


General Formula | Cartesian Coordinates Polar Coordinates 


1. Area of a plane figure: 


s=f dS zl dy dx = Jf doae 


2. Surface: 


' dS δολ” (82V? (f|. Q[(02zV [82V 
s=] cos y =f 14 (35) (s) ae = ff o TP (3) + (5) dpd 


3. Volume of a cylinder: 


v= [sas = [f zdyac = [f :odpaz 


S 


4. Moment of inertia of a plane figure, with respect to the x-axis: 


i= E dS = [| ? dy dx Ξ |» sin? y dp ἀρ 


S 


5. Moment of inertia of a plane figure, with respect to the pole 0: 


-fada - 3 odds = ff x i 
In |» dS [[ c asas J| ° dode 


6. Mass of a plane figure with the density function ϱ: 


M = n = ff oduds = [f epdpag 


5 


7. Coordinates of the center of gravity of a homogeneous plane figure: 


[x 45 / a dy dx 7 n cos Y dp dip 


[yas 7 ns . ΠΣ; 


2. Existence Theorem 
The existence theorem for the triple integral is a perfect analogue of the existence theorem for the 
double integral. 
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8.4.2.2 Evaluation of the Triple Integral 


The evaluation of triple integrals is reduced to repeated evaluation of three ordinary integrals. If the 
triple integral exists, then one can consider any partition of the domain of integration. 
Zz 
m Z wo, y) 
P πμ] r ης 
ZVI y) 
0 
o» 
y 
0 
Cc I =0,(x) 


Figure 8.38 Figure 8.39 


1. Evaluation in Cartesian Coordinates B 
'The domain of integration can be considered as a volume V here. A decomposition of the domain 
is formed by coordinate surfaces, in this case by planes, into infinitesimal parallelepipeds, i.e., their 
diameter is an infinitesimal quantity (Fig. 8.39). Then one performs the summation of all the products 
f (2, y, z) dV, starting the summation along the vertical columns, i.e., summation with respect to z, then 
in all columns of one slice, i.e., summation with respect to y, and finally in all such slices, i.e., summation 
with respect to r. Every single sum for any column is an approximation sum of an integral, and if the 
diameter of the parallelepipeds tends to zero, then the sums tend to the corresponding integrals, and 
if the integrand is continuous, then this repeated integral is equal to the triple integral. Analytically: 

b { 92(x) | var.) 

/ / / f (zy, z)dz| ανν dx 


a eir) Wiley) 


[ fev z)dV 


b p2(x) ¥ 
= J n a x,y, 5) dz dy dz. (8.143a) 
ἃ ur) va Go) 
Here z = q(x, y) and z = v»(x, y) are the equations of the lower and upper part of the surface bounding 


the domain of integration V (see limiting curve I in Fig. 8.39); da dy dz is the elementary volume in 
the Cartesian coordinate system. y = yi (a) and y = (2) are the functions describing the lower and 
upper part of the curve C which is the boundary line of the projection of the volume onto the x, y plane, 
and r = a and x = b are the extreme values of the x coordinates of the points of the volume under 
consideration (and also the projection under consideration). There are the following postulates for 
the domain of integration: The functions φι(α) and p2(x) are defined and continuous in the interval 
a < x < b, and they satisfy the inequality yı(x) € yo(x). The functions ψι(α, y) and v»(x, y) are 
defined and continuous on the domain a € x < b, φι(α) € y  φο(α). and also (v, y) < i» (x,y) 
holds. In this way, every point (x, y, 2) in V satisfies the relations 


a € x & b, φι(α) € y € v»(x), ψι(α,υ) € z € palz, y). (8.143b) 
Just as with double integrals, the order of integration can be changed, then the limiting functions will 
change in the same sense. (Formally: the limits of the outermost integral must be constants, and any 
limit may contain variables only of exterior integrals.) 
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W Calculate the integral J = / (y? + τῇ) dV for a pyramid bounded by the coordinate planes and the 
v 
plane x +y +z = 1: 


1 pl-z pl-z—-y , i 1 1-2 1—z-y, , F 1 
I= / / / (P? + 3) dz dy da = Í {/ |/ (δ + 2) dz] dy) de= L. 
0 J0 0 0 0 “0 30 


Figure 8.40 Figure 8.41 


2. Evaluation in Cylindrical Coordinates 
The domain of integration is decomposed into infinitesimal elementary cells by coordinate surfaces 


p = const, p = const, z = const (Fig. 8.40). The volume of an elementary domain in cylindrical 
coordinates (Table 8.10, p. 532) is 
dV = pdzdpdy. (8.144a) 


After defining the integrand by cylindrical coordinates f(p, p, 2) the integral is: 


p2 pz) 2ο(ριϱ) 
ΠΣ, dV =f / J F(p, 2, 2) pdz dp dig. (8.144b) 
" 


£1 p(o) z1 (ps) 


E Calculate the integral J = / dV for a solid (Fig. 8.41) bounded by the x, y plane, the x, z plane, 


the cylindrical surface 27 + y? = ax and the sphere x? + y? + z? = a?: 0 a? 


5 : π/ὰ pacosp pw/a?—p? 
ya —p;  p—0,pp—acosp; 9, =0, 5. I= Í , j p dz dp di 


π/ à COS p vf a2—p? 3 
i μ ή i d pao} dp = ΤΕ (3m — 4). Since f(p,y, z) = 1, the integral is equal to 
0 0 0 


the volume of the solid. 


3. Evaluation in Spherical Coordinates 

'The domain of integration is decomposed into infinitesimal elementary cells by coordinate surfaces 
r = const, p = const, J = const (Fig. 8.42). The volume of an elementary domain in spherical 
coordinates (see Table 8.10, p. 532) is 


dV = r° sin dr dd dy. (8.145a) 
For the integrand f(r, p, Ὁ) in spherical coordinates, the integral is: 
p2 Dle) ra (Pp) 


fie Q,U yav=f f j fr. φ, 0) ν᾽ sin Ὁ dr dd dig. (8.145b) 


91:019) i (9,9) 
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Figure 8.42 Figure 8.43 


cos 9 


W Calculate the integral [ -- [5 
Vv 
is the z-axis. The angle at the vertex is 2a, the altitude of the cone is h (Fig. 8.43). Consequently 


dV for a cone whose vertex is at the origin, and its symmetry axis 


holds: rı = 0, r2 


h 
; 0;,—0,05—0; yı =0,%2 = 2π. 
cos Ü 


2m ρα ph/cosd su. 2n a h/ cos? 
p= T f / ar sin 9 dr dd dp = D d cos Ü sin Ü / ar! a} dip [ 
ϱ Jo Jo r 0 0 0 


= 2r h (1 — cosa). 


4. Evaluation in Arbitrary Curvilinear Coordinates u, v, w 
The coordinates are defined by the equations 

z = z(u,v, w), y = y(u, v, w), z = z(u,v, w) (8.146) 
(see 3.6.3.1, p. 261). The domain of integration is decomposed into infinitesimal elementary cells by the 


coordinate surfaces u = const, v = const, w = const. The volume of an elementary domain in arbitrary 
coordinates (see Table 8.10, p. 532) is: 


= hood 1 —|[9y ὃν ὃ . 
— |D| du dv dw, with D= δε Em Em ; (8.147a) 
z Oz Oz 

Ou Ov Ow 
i.e., D is the Jacobian determinant. For the integrand f(u, v, w) in curvilinear coordinates u, v, w, the 
integral is: 


ug v2(u) wo(u,v) 
[fev v,w)dV — / i I s (u, v, w) | D| dw dv du. (8.147b) 


U1 vj (u) wi(u,v) 


Remark: The formulas (8.144b) and (8.145b) are special cases of (8.147b). 

For cylindrical coordinates D = p holds, for spherical coordinates D = r° sin 9 is valid. 

If the integrand is continuous, then one can change the order of integration in any coordinate system. 
A curvilinear coordinate system is chosen such that the determination of the limits of the integral 
(8.147b), and also the calculation of the integral, should be as easy as possible. 


8.4.2.3 Applications of the Triple Integral 
Some applications of the triple integral are collected in Table 8.11, p. 533. The elementary areas 
corresponding to different coordinates are given in Table 8.8, p. 527. The elementary volumes corre- 
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sponding to different coordinates are given in Table 8.10. 


Table 8.10 Elementary volumes 


Coordinates Elementary Volume 
Cartesian coordinates x, y, z dV = dx dy dz 
Cylindrical coordinates p, yp , z dV = pdp dọ dz 
Spherical coordinates r , 9 , dV =r? sin V dr dd dip 
Arbitrary curvilinear coordinates u, v, w | dV = |D| dudvdw (D Jacobian determinant) 


8.5 SurfaceIntegrals 
There are distinguished surface integrals of the first type, of the second type, and of general type, 
analogously to the three different line integrals (see 8.3, p. 515). 


8.5.1 SurfaceIntegral ofthe First Type 


The surface integral or integral over a surface in space is the generalization of the double integral, sim- 
ilarly as the line integral of the first type (see 8.3.1, p. 516) is a generalization of the ordinary integral. 


8.5.1.1 Notion of the Surface Integral of the First Type 


1. Definition 
The surface integral of the first type of a function u = f(x,y,z) of three variables defined in a connected 


domain is the integral 
| Hey) 45, (8.148a) 


over a region S of a surface. The numerical value of the surface integral of the 

first kind is defined in the following way (see Fig. 8.44): 

1. Decomposition of the region S in an arbitrary way into n elementary regions 
Figure 8.44 AS. 

2. Choosing an arbitrary point P;(x;, Yi, zi) inside or on the boundary of each elementary region AS; . 

3. Multiplication of the value f (x;, yi, zi) of the function at this point by the area AS; of the corre- 

sponding elementary region. 

4. Summation of the products f (z;, Yi, zi) A S;. 

5. Determination of the limit of the sum 


Y fs yn 2) ΔΘ (8.148b) 
i=1 


as the diameter of each elementary region tends to zero, so AS; tends to zero, hence, their number n 
tends to οο (see 8.4.1.1, 1., p. 524). 

If this limit exists and is independent of the particular decomposition of the region S into elementary 
regions and also of the choice of the points P;(x;, Yi, zi), then it is called the surface integral of the first 
type of the function u = f(x,y,z) over the region S, and one writes: 


/ f(a,y,z) dS = aim, Do f (ti yi a) ASi. (8.148c) 
5 


noo i=l 
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2. Existence Theorem 


If the function f (x, y, 2) is continuous on the domain, and the functions defining the surface have con- 
tinuous derivatives here, the surface integral of the first type exists. 


Table 8.11 Applications of the Triple Integral 


General 
Formula 


Cartesian 
Coordinates 


Cylindrical 
Coordinates 


Spherical 
Coordinates 


1. Volume of a solid 


v= / dV = 
V 


// dz dy dx 


ΠΟΥ 


III 1? sin ὃ dr d dip 


2. Axial momen 


t of inertia of a solid with respect to the z-axis 


i= [?v- 
Vv 


ΠΗ 


/ p? dz dp dip 


nii r^ sin? 0 dr dd dp 


3. Mass of a solid with the density function o 


M= fea 


V 


Jff 94 dy dx 


"i op dz dp dq 


III or? sin Ὁ dr dd dy 


4. Coordinates of the center of a homogeneous solid 


/ rdV 


πο V = 
T ydV 

yc =“ y ~ 
/ zdV 

2σ ὦ ν = 


ELLE 


p cos q dp dq dz 


If r? sin? V cos y dr d dy 


/ dz dy dx 
I y dz dy dx 


"i pdpdp dz 


I i p? sin y dp dọ dz 


nii r? sin dr dd dy 


If r? sin? J sin y dr dó dy 


nii dz dy dx 
ΠΠ 


nii pdpdp dz 
I pz dp dọ dz 


"i dz dy dx 


/ pdp dọ dz 


lll 1? sin ó dr dù αφ 
nii r? sin V cos Ὁ dr d dy 
"i r? sind dr dó dip 


8.5.1.2 Evaluation of the Surface Integral of the First Type 


The evaluation of the surface integral of the 
over a planar domain (see 8.4.1, p. 524). 


1. Explicit Representation of the Surface 


If the surface S is given by the equation 


z = z(x,y) 


first type is reduced to the evaluation of a double integra. 


(8.149) 
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in explicit form, then 
[te y,z) dS = J fle, y, ο(α. υ)]ψ 1 + p? + ¢ dz dy, (8.150a) 


is eid where S’ is ae projection of S onto the x,y plane and p and q are the partial derivatives 
Oz Oz : , 
p= an’ q= ay Here one assumes that to every point of the surface S there corresponds a unique 
xv y 

point in δ’ in the x, y plane, i.e., the points of the surface are defined uniquely by their coordinates. If 
it does not hold, one decomposes S into several parts each of which satisfies the condition. Then the 
integral on the total surface can be calculated as the algebraic sum of the integrals over these parts of 
9. 


The equation (8.150a) can be written in the form 


[re y,z)dS = J Flo v. α(α, "d (8.150b) 
des 
; i Y-y 2 
since the equation of the surface normal of (8.149) has the form 1 (see Ta- 
q = 
ble 3.29, p. 264), since for the angle between the direction of the normal and the z-axis, cosy = 


1 
Lp ο 
as an acute angle, so cosy > 0 always holds. 
2. Parametric Representation of the Surface 


holds. In evaluating a surface integral of the first type, this angle η is always considered 


If the surface S is given in parametric form by the equations 
2 --α(α,υ), y = y(u, v), z = z(u,v), (8.151a) 
(Fig. 8.45), then 


[ fov. a 
5 
= / fle(u, v), y(u, v), z(u, v) V EG — F? du dv, (8.151b) 
A 
where the functions E, F, and G are the quantities given in 3.6.3.3, 1., 


p. 263. The elementary region in parametric form is 


VEG — F? dudv = dS, (8.151c) 


Figure 8.45 


and A is the domain of the parameters u and v corresponding to the given surface region. The evaluation 
is performed by a repeated integration with respect to v and u: 


ug va(u) 


/ 90.9 dS — / / P(u,v)VEG— F?dvdu, ® = f[x(u, v), y(u, v), z(u, v)]. (8.151d) 
5 "1 vi(u) 
Here wu; and us are coordinates of the extreme coordinate lines u = const enclosing the region S 


(Fig. 8.45), and v = vi(u) and v = ve(u) are the equations of the curves AmB and AnB of the 
boundary of S. 


Remark: The formula (8.1502) is a special case of (8.151b) for 
u—-z, v=y, E-1-p, F=pq, G=14+¢. (8.152) 


3. Elementary Regions of Curved Surfaces 
The elementary regions of curved surfaces are given in Table 8.12. 
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Table 8.12 Elementary Regions of Curved Surfaces 


Coordinates Elementary Region 


Cartesian coordinates x, y, z = z(x,y) | dS = 


Cylindrical lateral surface, dS = Rdpdz 


R (const. radius), coordinates y, z 


Spherical surface R (const. radius), dS = R? sin Ὁ dd dy 
coordinates 0 , y 


Arbitrary curvilinear coordinates u,v | dS = V EG — F? du dv 
(E, F,G see differential of arc, p. 263) 


8.5.1.3 Applications of the Surface Integral of the First Type 


1. Surface Area of a Curved Surface 
S — | dS. (8.153) 
1 


2. Massofan Inhomogeneous Curved Surface S 
With the coordinate-dependent density o = f(x, y, z) it follows: 


Ms = n (8.154) 


5 


8.5.2 Surface Integral of the Second Type 


The surface integral of the second type, also called an integral over a projection, is a generalization of 
the notion of double integral similarly to the surface integral of the first type. 


8.5.2.1 Notion of the Surface Integral of the Second Type 


1. Notion of an Oriented Surface 

A surface usually has two sides, and one of them can be chosen arbitrarily as the exterior one. If the 
exterior side is fixed, it is called an oriented surface. Surfaces for which one can not define two sides are 
not discussed here (see [8.7]). 

2. Projection of an Oriented Surface onto a Coordinate Plane 

Projecting a bounded part S of an oriented surface onto a coordinate plane, e.g., onto the x,y plane, 
one can consider this projection Pr; S as positive or negative in the following way (Fig. 8.46): 

a) If the z, y plane is looked at from the positive direction of the z-axis, and one sees the positive side 
of the surface S, where the exterior part is considered to be positive, then the projection Pr;, S has a 
positive sign, otherwise it has a negative sign (Fig. 8.46 a,b). b) If one part of the surface shows its 
positive side and the other part its negative side, then the projection Przy S is regarded as the algebraic 
sum of the positive and negative projections (Fig. 8.46c). 

The Fig. 8.46d shows the projections Pr,,S and Pr,.5 of a surface δ; one of them is positive the other 
one is negative. 

The projection of a closed oriented surface is equal to zero. 
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exterior side 


L] interior side 


Figure 8.46 


3. Definition of the Surface Integral of the Second Type over a Projection 
onto a Coordinate Plane 


The surface integral of the second type of a function f(x,y,z) of three variables defined in a connected 
domain is the integral 


[ fev. 2) da dy, (8.155) 
5 


over the projection of an oriented surface S onto the x, y plane, where S is in the same domain where the 
function is defined, and if there is a one-to-one correspondence between the points of the surface and 
its projection. The numerical value of the integral is obtained in the same way as the surface integral 
of the first type except that in the third step the function value f(;, yi, zi) is not multiplied by the 
elementary region AS;, but by its projection Pr; AS;, oriented according to 8.5.2.1, 2., p. 535 on the 
x,y plane. Then holds: 


noo i—l 


πο ο» 2) Pray AS. (8.156a) 
5 


Defining analogously the surface integrals of the second type over the projections of the oriented surface 
S onto the y, z plane and onto the z, plane one gets: 


[ fev dydz= Jim, Y f(E yi2) Proe ASi (8.156b) 
S 


π-λοο i—l 


ll 


Aim, YO F (Ei, Yi, zm) Pra AS. (8.156c) 


π-λοο i=1 


ΠΠ 
5 
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4. Existence Theorem for the Surface Integral of the Second Type 

The surface integral of the second type (8.156a,b,c) exists if the function f(x, y, ο) is continuous and 
the equations defining the surface are continuous and have continuous derivatives. 

8.5.2.2 Evaluation of Surface Integrals of the Second Type 

The principal method is to reduce it to the evaluation of double integrals. 


1. Surface Given in Explicit Form 
If the surface S is given by the equation 


z= (zx, y) (8.157) 
in explicit form, then the integral (8.156a) is calculated by the formula 

ΠΠ 2) απ dy = J Fle, y, p(z, y)] dS. (8.158a) 

5 Ῥτανδ 


where Sey = Pr,,5. The surface integral of the function f(x,y,z) over the projections of the surface 
S onto the other coordinate planes is calculated similarly: 


| fe yas = | flw(y. z), y, z] dS yz, (8.158b) 
5 PryzS 


where one substitutes x = v(y, z), the equation of the surface S solved for x, and Sy, = PryS. 


[ies zdr = / fle, x(z, x), 2] dS, (8.158c) E 


PrizS 
where one substitutes y = x(z,x), the equation of the surface S solved for y, and $,, = Pr,,S. If 
the orientation of the surface is changed, i.e., if interchanging the exterior and interior sides, then the 
integral over the projection changes its sign. 
2. Surface Given in Parametric Form 
If the surface is given by the equations 


z = x(u, v), y = y(u, v), z = z(u,v) (8.159 
in parametric form, one calculates the integrals (8.156a,b,c) with help of the following formulas: 

| fev. 5) dx dy = | fitu.) (u,v), zu, pE) du dv, (8.160a 
i r D(u,v) 

E A 

[ fev. z)dydz = ΠΠ y(u, v), z(u, v)] PUR) du dv, (8.160b 
i 4 D(u, v) 

E A 

f te, y, z) dz dz = | fle, v), y(u, v), z(u, v)] Dleiz) du dv. (8.160c 
j 4 D(u,v) 


D(r.y) 94.2) D2) 
D(u,v)' D(u,v)' D(u,v) 


x,y, z with respect to the variables u and v; A is the domain of u and v corresponding to the surface S. 


Here the expressions are the Jacobian determinants of pairs of functions 


8.5.3 Surface Integral in General Form 
8.5.3.1 Notion of the Surface Integral in General Form 


If P(x, y, z), Q(x, y, 2), R(x, y, z) are three functions of three variables defined in a connected domain 
and S is an oriented surface contained in this domain, the sum of the integrals of the second type taken 
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over the projections on the three coordinate planes is called the surface integral in general form: 


[(Payae + Qdzda + Rdz dy) = [ Ράμα. f Qdzdz+ | Raz ay. (8.161) 
5 5 5 


5 


The formula reducing the surface integral to a double integral is: 


D(y.z) , Diet) , Dla) 
(P dy dz + Q dz dx + R dz dy) | H FR | du dv, (8.162) 


D(u,v) © * D(u,v) (u,v 
D(ry) D(wz) D(zm) 
D(u,v)' D(u,v)' D(u,v) 


Remark: The surface integral of vector-valued functions is discussed in the chapter about the theory 
of vector fields (see 13.3.2. p. 722). 


8.5.3.2 Properties of the Surface Integrals 


where the quantities 


, and A have the same meaning, as above. 


1. Ifthe domain of integration, i.e., the surface S, is decomposed into two parts Sı and 52 (see Fig. 
8.47), then 


[(Payae | Qdzdz + Rdz dy) = [(Padyde + Qdzde + Rdz dy) 
S δι 
+ [e dy dz + Q dz dz + Rds dy). (8.163) 


S2 


2. Ifthe orientation of the surface is reversed, i.e., the exterior and interior sides are interchanged, the 
integral changes its sign: 
[Pava + Q dz dz + Rdz dy) = [(Pdydz+Qdzde + Rdz dy), (8.164) 
St s- 


where S* and S^ denote the same surface with different orientation. 
3. A surface integral depends, in general, on the line bounding the 
surface region S as well as on the surface itself. Thus the integrals taken 
over two different non-closed surface regions δι and 5» spanned by the 
same closed curve C are, in general, not equal (Fig. 8.47): 


[P dyaz +Qdzdx + Rdz dy) 
Sı 
£ [(P dydz + Qaz dz + Rdz dy). (8.165) 
Sa Figure 8.47 


4. An application of the surface integral is the calculation of the volume V of a solid bounded by a 
closed surface S. The integral can be expressed and calculated in the form 


V= : [owa +ydzdx + zdz dy), (8.166) 

Va ap rdydz or V = 1 ydzdx or vel z dx dy or (8.167a) 
5 E S 

V= : fe dy dz + y dz dx + z dz dy) (8.167b) 


where S is oriented so that its exterior side is positive. 
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H Calculation of the volume V of a sphere with surface S according to a? + y? + 2? 
spherical coordinates x = Rsin ð cosg, y = Rsinvsiny, z = Rcos0 (0€ 0 X 7,0 
the Jacobian determinant as in (8.160a) 


D(x,y) 


πα 
D(9,v) | νο n = R cos sin Ü (8.1682) 
follows from the third integral in (8.1672) 
2g π T 
Bane! i 4; 
V= / J R? cos? 9 sin Ὁ dd dy = 2r R? J cos? V sin Ὁ d) = 378. (8.168b) 
Φ-0 9-0 0 


9 Differential Equations 


1. A Differential Equation is an equation, in which one or more variables, one or more functions 
of these variables, and also the derivatives of these functions with respect to these variables occur. The 
order of a differential equation is equal to the order of the highest occurring derivative. 
2. Ordinary and Partial Differential Equations differ from each other in the number of their 
independent variables; in the first case there is only one, in the second case there are several. 

O?z Oz Oz 

= tyz =. 

ðxðy Ox Oy 


dyV gd. | . 
W A: (2) ο. =0. W B: zd?ydx—dy(dz) = e” (dy). HC: 
T T 


9.1 Ordinary Differential Equations 
1. General Ordinary Differential Equation of Order n 


in implicit form has the equation 
F [z, ylz) y (x),..., ΠΙΟ] Ξ 0. (9.1) 


If this equation is solved for y? (1), then it is the explicit form of an ordinary differential equation of 
order n. 

2. Solution or Integral 

of a differential equation is every function satisfying the equation in an interval a € x < b which can be 
also infinite. A solution, which contains n arbitrary constants c4, co, . . . , Cn, is called the general solution 
or general integral. If the values of these constants are fixed, a particular integral or a particular solution 
is obtained. The value of these constants can be determined by n further conditions. If the values of 
y and its derivatives up to order n — 1 are prescribed at one of the endpoints of the interval, then the 
problem is called an initial value problem. If there are given values at both endpoints of the interval, 
then the problem is called a boundary value problem. 

W The differential equation —4' sin x + y cos x = 1 has the general solution y = cosa + csin x. For the 
condition c = 0 one gets the particular solution y = cos x. 

3. Initial Value Problem 

If the n values y(zo), y (zo), ... , y^P (ao) are given at zo for the solution y = y(x) of an n-th order 
ordinary differential equation, then an initial value problem is given. The numbers are called the initial 
values or initial conditions. They form a system of n equations for the unknown constants c1, C2,..., Cn 
of the general solution of the n-th order ordinary differential equation. 

W The harmonic motion of a special elastic spring-mass system can be modeled by the initial value 
problem y" + y — 0 with y(0) = yo, y(0) = 0. The solution is y = yo cos x. 

4. Boundery Value Problem 

If the solution of an ordinary differential equation and/or its derivatives are given at several points of 
its domain, then these values are called the boundary conditions. A differential equation with boundary 
conditions is called a boundary value problem. 

W The bending line of a bar with fixed endpoints and uniform load is described by the differential 
equation y" = x — a? with the boundary conditions y(0) = 0,y(1) = 0 (0 < x < 1). The solution is 


a? α΄ 2 


9.1.1 First-Order Differential Equations 


9.1.1.1 Existence Theorems, Direction Field 


1. Existence of a Solution 
In accordance with the Cauchy existence theorem the differential equation 

y = f(x.y) (9.2) 
€ Springer-Verlag Berlin Heidelberg 2015 540 
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has at least one solution in a neighborhood of ro such that it takes the value yo at x = xo if the function 
f(x, y) is continuous in a neighborhood G of the point (xo, yo). For example, G can be selected as the 
region given by |x — zo| < a and |y — yo| < b with some a and b. 

2. Lipschitz Condition 

The Lipschitz condition with respect to y is satisfied by f (v, y) if 


[/(2,381) - f(x, y2)| < NI — vel (9.3) 


holds for all (x, y1) and (x, y2) from G, where N is independent of x, yı, and ys. If this condition is 
satisfied, then the differential equation (9.2) has a unique solution through (xo, yo). The Lipschitz 
condition is obviously satisfied if f (x, y) has a bounded partial derivative Of /Oy in this neighborhood. 
In 9.1.1.4, p. 546 there are examples in which the assumptions of the Cauchy existence theorem are not 
satisfied. 

3. Direction Field 

If the graph of a solution y = y(x) of the differential equation y’ = f(x,y) goes through the point 
P(x,y), then the slope dy/dx of the tangent line of the graph at this point can be determined from 
the differential equation. So, at every point (x,y) the differential equation defines the slope of the 
tangent line of the solution passing through the considered point. The collection of these directions 
(Fig. 9.1) forms the direction field. An element of the direction field is a point together with the direc- 
tion associated to it. Integration of a first-order differential equation geometrically means to connect 
the elements of a direction field into an integral curve, whose tangents have the same slopes at all points 
as the corresponding elements of the direction field. 


/ 


AT (Xo Yo) 


0 x 


Figure 9.1 Figure 9.2 


4. Vertical Directions 
If a vertical direction can be found in a direction field, e.a., if the function f(x, y) has a pole, then one 
can change the role of the independent and dependent variables and consider the differential equation 
M (9.4) 
dy f(x,y) 
as an equivalent equation to (9.2). In the region where the conditions of the existence theorems are 
fulfilled for the differential equations (9.2) or (9.4), there exists a unique integral curve (Fig. 9.2) 
through every point P(xo, yo). 
5. General Solution 
The set of all integral curves of (9.2) can be characterized by one parameter and it can be given by the 
equation 
F(x,y,C) 20 (9.5a) 
of the corresponding one-parameter family of curves. The parameter C, an arbitrary constant, can be 
chosen freely and it is a necessary part of the general solution of every first-order differential equation. 
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A particular solution y = (x), which satisfies the condition yo = (xo), can be obtained from the 
general solution (9.5a) if C is expressed from the equation 


F(z, yo, C) = 0. (9.5b) 
9.1.1.2 Important Solution Methods 


1. Separation of Variables 
If a differential equation can be transformed into the form 


M (x)N(y)dx + P(x)Q(y)dy = 0, (9.6a) 
then it can be rewritten as 
R(x)dx + S(y)dy = 0, (9.6b) 


where the variables x and y are separated into two terms. To get this form, equation (9.6a) is divided 
by P(x)N (y). The general solution of (9.6a) is 


^ da z+ [7 Day= (9.7) 


If for some M = 
constant functions x = 
singular solutions. 


ry = J, the functions P(x) or N(y) or both are equal to zero, then the 


T 
T or/and y = y are also solutions of the differential equation. They are called 


dy 


y 
in this final equation, then one has the singular solutions y = 0 and x = 0. 


di n 
W zdy + ydx = 0; I / = C; ln|y|+ln|z| =C = In|c; yx = c. If one allows also c = 0 


2. Homogeneous Equations 

If M(x,y) and N(x, y) are homogeneous functions of the same order (see 2.18.2.6, 1., p. 122), then in 

the equation 
M (x, y)dx + N(x, y)dy = 0 (9.8) 

the variables can be separated by substitution of u = y/z. 

E r(x- y)y' +y? = 0 with y = u(z)x, gives (1 — uju’ + u/x = 0, then by separation of the variables 


l-u 
holds f E= qu x - f Zdr. After integration: In |x|+In|u—u| = C =In|e|, ux = ce", y = ce". 
u 


As can be seen in the preceding paragraph, Separation of Variables, the line x = 0 is also an integral 
curve. 


3. Exact Differential Equations 


An exact differential equation is an equation of the form 


M(a,y)dx + N(x, y)dy=0 or Νία, υ)ν' + Μία, ν) =0, (9.9a) 
if there exists a function d (zr, y) of two variables such that 
M (v, y)dz + N(x,y)dy = d(x,y), (9.9b) 


i.e., if the left side of (9.9a) is the total differential of a function Φ(α, y) (see 6.2.2.1, p. 447). If functions 
M (x, y) and N(x, y) and their first-order partial derivatives are continuous on a connected domain G, 
then the equality 


OM | ON 

Oy Ox 
is a necessary and sufficient condition for equation (9.9a) to be exact. In this case the general solution 
of (9.9a) is the function 

(x,y) =C (C = const), (9.94) 


(9.96) 


9.1 Ordinary Differential Equations 543 


which can be calculated according to (8.3.4), 8.3.4.4, p. 522 as the integral 
g y 
P(x, y) = [MEn αξ + ] Neon) dn, (9.9e) 
20 yo 


where zo and yo can be chosen arbitrarily from G. 


W Examples will be given later. 
4. Integrating Factor 
A function u(x, y) is called an integrating factor or a multiplier if the equation 
Mdz + Ndy —0 (9.10a) 

multiplied by u(x, y) becomes an exact differential equation. The integrating factor satisfies the differ- 
ential equation 
Olnp M Olnp OM ON 

Ox Oy Oy Oa” 
Every particular solution p of this equation is an integrating factor. To give a general solution of this 
partial differential equation is much more complicated than to solve the original equation, so usually 
one is looking for the solution µία, y) in a special form, e.g., u(x), p(y), (ay) or µία” + y?). 
Β To solve the differential equation (x? + y) dx — «dy = 0, the equation for the integrating factor is 
ὅ]ημ 2 ὅ]ημ ὅ]ημ 
um (y) Oy ax 


1 1 1 
—2, so u = — . Multiplication of the given differential equation by ji yields (1 + +) dr — —dy = 0. 
mm x m 


N 


(9.10b) 


2. An integrating factor which is independent of y must satisfy x 


The general solution according to (9.96) with the selection of a = 1, yo = 0 is then: 


a= f (1+ λα [ian=c ο z- 2-0, 


X 


5. First-Order Linear Differential Equations 
A first-order linear differential equation has the form 

y + P(x)y = Q(2), (9.11a 
where the unknown function and its derivative occur only in first degree, and P(x) and Q(x) are given 
functions. If P(x) and Q(x) are continuous functions on a finite, closed interval, then the differentia 
equation satisfies the conditions of the Picard-Lindelof theorem (see 12.2.2.4,4., p. 668) in this region. 
An integrating factor is here 


JL = exp (J Par) i (9.11b 


the general solution is 


y = exp (- / Par) [[ ον (J Paz) dx + c ! (9.11c 


Replacing the indefinite integrals by definite ones with lower bound xo and upper bound x in this 
formula, then for the solution y(a9) = C (see 8.2.1.2, 1., p. 495). If yi is any particular solution of the 
differential equation, then the general solution of the differential equation is given by the formula 


y=y 4 Cexp (- f Pac). (9.114) 


If y; (x) and yo() are two linearly independent particular solutions (see 9.1.2.3, 2., p. 553), then one 
can get the general solution without any integration as 


y = yı t Cly — y1). (9.116) 
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W To solve the differential equation y' — ytanx = cosx with the initial condition πο = 0, yo = 0. 


£ 
Calculating exp (- [ tan s dr) = cos z one gets the solution according to (9.11c): 
Jo 


y= 


2 ΡΝ 
6. Bernoulli Differential Equations 
The Bernoulli differential equation is an equation of the form 
y + P(x)y = Q(x)y" (nz0,nz1) (9.12) 
which can be reduced to a linear differential equation if it is divided by y" and the new variable z — 


y"! is introduced. 


T 
[ cos? x dx = 
0 


1 [> xcosx +r sin 2; x 


COS X u cos x 


Ay 
W Solution of the differential equation y' — Ha v4/y . Since n = 1/2, dividing by \/y and introducing 
2 
: ..dez 2 m . i 
the new variable z = „/y leads to the equation dnce es By using the formulas for the solution of a 
4^ xm 


1 ; z1 ο[1 
linear differential equation there is exp(f P dz) = — and z = z? |/ : 5 dt + σ =g |; In|z| + ο] ; 
x J 2m 


2 


So, finally, y = α΄ G In |z| + C 


7. Riccati Differential Equations 

The Riccati differential equation 
y = P(zx)y? + Q(a)y + R(a), (9.13a 

usually cannot be solved by elementary integration, i.e., not by using a final number of successive ele- 

mentary integrations. However it is possible to transform it by suitable substitutions into differentia. 

equations for which solutions often can be found. 

Method 1: By the substitution 


πω 


5 


= p(x 9.131 
ΓΕΝ (a) (9.13b 
the Riccati differential equation can be transformed into the normal form 
du č ; à 
d =u? + R(x)  (9.13c) with Ro(z) = P?8? c QPB + PR — Pp’. (9.13d 
X 


Therefore (a) is determined so that terms with the factor u(x) disappear. 
If a particular solution τι (x) of (9.13c) is known, which can be found, e.g., by a suitable approach, then 
by the help of the substitution 
1 
= —— ur 9.13e 

u m + us (x) (9.13e 
(9.13c) is to be transformed into the linear differential equation for z(x): 

2 *2ui(z)z —1— 0. (9.13f 
From the solution of (9.13f) the solution of (9.13a) is obtained by using (9.13e) and (9.13b). 
Method 2: By the substitution 


υ' 


~ P(x)o(2) 
(9.13a) is transformed into a linear differential equation of second order (see 9.1.2.6,1., p. 560): 


Po" — (P! + Ρῷ)υ + P?Rv =0. (9.13h 


y= (9.13g 
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> 1 4 1 4 
E To solve the differential equation y! + y? + =y — ο = 0, ie. for P 1,Q = 


x g 
Y 1 
Method 1: One gets β(α) = E" and with the help of y — —u(x) — jg, me gets the normal form 
zx a 
: 15 
uw = u -- m . Particular solutions of the normal form can be got, e.g., with the approach u — ας 
Ar T 
3 5 A 1 3 i ] "NE 
u(x) = —, uo(x) = —.—. After substituting u = —— + — the differential equation σ΄ 4- —z 4-1 = 0 
2r 2r z(r) 2r x 
> K 4K- 
follows with the solution z(x) 7 Eg j - (K const). The inverse transformation gives 
1 η 113 
2x1 +2C 
= —— (C-AK). 
4 x5 — Ox ( €) 


Method 2: According to (9.13h) the Euler differential equation z?v"--xv' —4v = 015 obtained with the 


; 1 
general solution v(x) = Cia? -- C — (see concerning the Euler differential equation, p. 557). One of 
x 


204 + 2C 
the constants C, and C$ can be chosen freely, e.g. C9 = —1 then from (9.13h) follows y = ey ; 
z?— CT 
9.1.1.3 Implicit Differential Equations 
1. Solution in Parametric Form 
Given a differential equation in implicit form 
F(x,y,y') — 0. (9.14) 
There are n integral curves passing through a point P(xo, yo) if the following conditions hold: 
a) The equation F(xo, yo, p) = 0 (p = dy/dx) has n real roots pi, . .., Pn at the point P(zo, yo). 


b) The function F(x, y, p) and its first partial derivatives are continuous at Y = ro, y = Yo, p = Pi: 
furthermore OF /Op # 0. 
If the original equation can be solved with respect to y', then it yields n equations of the explicit forms 
discussed above. Solving these equations one gets n families of integral curves. If the equation can be 
written in the form x = (y, y') or y = U(a,y’), then putting y' = p and considering p as an auxiliary 
variable, after differentiation with respect to y or x one obtains an equation for dp/dy or dp/dx which 
is solved with respect to the derivative. A solution of this equation together with the original equation 
(9.14) determines a desired solution in parametric form. 


Β To get the solution of the differential equation x = yy’ + y?, one substitutes y’ = p and gets x = 


m -— . sou E d NE! 1 
py + p°. Differentiation with respect to y and substituting » = — results in = = p + (y + 2p)7* Or 
y p p y 
dy py 2p? , . . : C + arcsin p 
- - -. Solving this equation for y one obtains y = —p + ——z———-—- (C const). 
p I-P το g q y y= -p ΠΕΠ ( ) 


Substitution into the initial equation gives the solution for x in parametric form. 


2. Lagrange Differential Equation 
'The Lagrange differential equation is the equation 


a(y')v + b(y')y + c(y') = 0. (9.15a) 
The solution can be determined by the method given above. If for p = po holds 


a(p) + b(p)p = 0, (9.15b) then α(φο)α + b(po)y + c(po) = 0 (9.15c) 


is a singular solution of (9.15a). 
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3. Clairaut Differential Equation 


The Clairaut differential equation is the special case of the Lagrange differential equation if 


a(p) + b(p)p = 0, (9.16a 
and so it can be transformed into the form 

y — yz fly’). (9.16b 
The general solution is 

y — Cx 4 f(C). (9.16c 
Besides the general solution, the Clairaut differential equation also has a singular solution, which can 
be obtained by eliminating the constant C from the equations 

y = Cx + f(C) (9.16d) and 0-z-f'(C), (9.16e 


The second equation can be obtained by differentiating the first one with respect to C. Geometrically, 
the singular solution is the envelope (see 3.6.1.7, p. 255) of the solution family of lines (Fig. 9.3). 

W Solution of the differential equation y = αυ”. The general solution is y = Cx--C?. The singular 
solution one gets with the help of the equation x + 2C = 0 to eliminate C, and hence a? + 4y = 0. 
Fig. 9.3 shows this case. 


yA 


Figure 9.3 Figure 9.4 


9.1.1.4 Singular Integrals and Singular Points 


1. Singular element 
An element (ro, yo, yo) is called a singular element of the differential equation, if in addition to the 
differential equation 


F(x,y,y) =0 (9.17a) 
it also satisfies the equation 

OF 

ay = (9.17b) 


2. Singular Integral 
An integral curve from singular elements is called a singular integral curve; the equation 

p(a,y) =0 (9.17c) 
of a singular integral curve is called a singular integral. The envelopes of the integral curves are singular 
integral curves (Fig. 9.3); they consist of the singular elements. 
The uniqueness of the solution (see 9.1.1.1, 1., p. 540) usually fails at the points of a singular integral 
curve. 
3. Determination of Singular Integrals 
Usually one cannot obtain singular integrals for any values of the arbitrary constants of the general 
solution. To determine the singular solution of a differential equation (9.17a) with p — y' one has to 
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introduce the equation 
OF _ 
Op |— 
and to eliminate p. If the obtained relation is a solution of the given differential equation, then it is 
a singular solution. The equation of this solution should be transformed into a form which does not 
contain multiple-valued functions, in particular no radicals where the complex values should also be 
considered. 
Radicals are expressions obtained by nesting algebraic equations (see 2.2.1, p. 62). If the equation of the 
family of integral curves is known, i.e., the general solution of the given differential equation is known, 
then one can determine the envelope of the family of curves, the singular integral, with the methods of 
differential geometry (see 3.6.1.7, p. 255). 


0 (9.17d) 


: , : , 45,8, ΠΗ : 
H Solution of the differential equation 2 — y — ae + τ. = 0. Substituting y' = p, the calculation 
m R : . 8 8 . E : ; 
of the additional equation with (9.17d) yields ορ + σὰ = 0. Elimination of p results in equation a) 
4 
r—y-O0andb)r—y-— ορ) where a) is not a solution, b) is a solution, a special case of the general 


solution (y — C)? = (x — ΟἽ”. The integral curves of a) and b) are shown in Fig. 9.4. 
4. Singular Points of a Differential Equation 
Singular points of a differential equation are the points where the right side of the differential equation 


y — f(x,y) (9.18a 
is not defined. This is the case, e.g., in the differential equations of the following forms: 
1. Differential Equation with a Fraction of Linear Functions 
dy ax+ by 
dr  cr-ey 


(ae — bc z- 0) (9.18b 


hasan isolated singular point at (0, 0), since the assumptions of the existence theorem are fulfilled almos 
at every point arbitrarily close to (0, 0) but not at this point itself. The conditions are not fulfilled αἱ 
the points where cx + ey = 0. One can force the fulfillment of the conditions at these points exchanging 
the role of the variables and considering the equation 
dr — cx +ey 
dy απ 
The behavior of the integral curve in the neighborhood of a singular point depends on the roots of the 
characteristic equation 
A? — (b+ c)À + bc — ae — 0. (9.18d 
The following cases can be distinguished: 


(9.18c 


Case 1: If the roots are real and they have the same sign, then the singular point is a branch point. 'The 
integral curves in a neighborhood of the singular point pass through it and if the roots of the character- 
istic equation do not coincide, they have a common tangent except for one. If the roots coincide, then 
ither all integral curves have the same tangent, or there is a unique integral curve passing through the 
singular point in each direction. 


[e] 
μη 


d 
W A: For the differential equation ES 

p 
Ay = 1. The integral curves have the equation y = C x? (Fig. 9.5). The general solution also contains 
the line x = 0 considering the form x? = Οι y. 
x , dy αγ. ιο ; 
W B: The characteristic equation for T is A^ — 2A +1 = 0, Ay = àz = 1. The integral curves 
da: x 


21 , 
=! the characteristic equation is A? — 3A +2 = 0, A, = 2. 
i 
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are y = «In |x| + Ca (Fig. 9.6). The singular point is a so-called node. 


d , 
W C: The characteristic equation for = igi —241 0, Ar = A2 
dz s 


I 1. The integral curves 


] x 
are y = C x (Fig. 9.7). The singular point is a so-called ray point. 


y4 yt » 
x 2» x 


Figure 9.5 Figure 9.6 Figure 9.7 


xy 


Case 2: If the roots are real and they have different signs, the singular point is a saddle point, and 
two of the integral curves pass through it. 


d , 
W D: The characteristic equation for um =- is} —-1= 0, λι = +1, Ap = —1. The integral curves 
d x 


are xy = C (Fig. 9.8). For C = 0 the particular solutions x = 0, y = 0 hold. 
Case 3: Ifthe roots are conjugate complex numbers with a non-zero real part (Re(A) # 0), then the 


singular point is a spiral point which is also called a focal point, and the integral curves wind about this 
singular point. 


dy uty. 


E E: The characteristic equation for i = is à? — 2A + 2 = 0, ` 1 +i, A, = 1— i. The 
dr  r-—y 
integral curves in polar coordinates are r = C e? (Fig. 9.9). 
A A 
P ; NS : 
(oN 
WH oa 
Figure 9.8 Figure 9.9 Figure 9.10 


Case 4: If the roots are pure imaginary numbers, then the singular point is a central point, or center, 
which is surrounded by the closed integral curves. 


dy 
W F: The characteristic equation for d 


Ξ is à? +1 = 0, à =i, λο i. The integral curves are 
y 


x? +y? =C (Fig. 9.10). 

2. Differential Equation with the Ratio of Two Arbitrary Functions 
dy _ P(x,y) 
== - 9.104. 
ΤΊΝΕΣ nm 
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has the singular points for the values of the variables where 

P(z.y) = Q(, y) = 0. (9.19) 
If P and Q are continuous functions and they have continuous partial derivatives, (9.19a) can be written 
in the form 

dy _ a(x — το) + b(y — yo) + Pi, y) 

dr — c(r— xo)-- e(y — yo) + Qi(v, y) 
Here zo and yo are the coordinates of the singular point and Pj (x, y) and Qi(z, y) are infinitesimals of a 
higher order than the distance of the point (x, y) from the singular point (ro, yo). With these assump- 
tions the type of a singular point of the given differential equation is the same as that of the approximate 
equation obtained by omitting the terms P, and Q4, with the following exceptions: 


(9.19c) 


a) If the singular point of the approximate equation is a center, the singular point of the original equa- 
tion is either a center or a focal point. 


ü b 

b) If ae — bc = 0, i.e., - or a = c = l or a = b = 0, then the type of singular point should be 
c ο 

determined by examining the terms of higher order. 


9.1.1.5 Approximation Methods for Solution of First-Order Differential 
Equations 


1. Successive Approximation Method of Picard 
The integration of the differential equation 


y = f(x,y) (9.20a) 
with the initial condition y = yo for x = 20 results in the fixed-point problem 


y — yo / f(x,y) da. (9.20b) 


Substituting another function y; (x) instead of y into the right-hand side of (9.20b), then the result will 
be a new function y2(x), which is different from γι (x), if yı (x) is not already a solution of (9.20a). Sub- 
stituting y2(x) instead of y into the right-hand side of (9.20b) gives a function y3(a). If the conditions 
of the existence theorem are fulfilled (see 9.1.1.1, 1., p. 540), the sequence of functions 4, Y2, ya. . .- 
converges to the desired solution in a certain interval containing the point 20. 

This Picard method of successive approximation is an iteration method (see 19.1.1, p. 949). 

Β Solve the differential equation y’ = eë — y? with initial values zo = 0, yo = 0. Rewriting the equa- 
tion in integral form and using the successive approximation method with an initial approximation 


c T 
yo(x) = 0 gives: yı / edr=e*—-1, ys if 


0 


, 15; 
[e (e* 1] dx = 3e* τα τν etc. 


2. Solution by Series Expansion 
The Taylor series expansion of the solution of a differential equation (see 7.3.3.3, 1., p. 471) can be 
given in the form 
Tq ^2 m man 

y = yo + (£ — xo)uo 4 - - yo" ἠ-::'ἡ u yo +++ (9.21) 
if the values yo’, yo", ... , yo”, ... of all derivatives of the solution function are known at the initial 
value xo of the independent variable. The values of the derivatives can be determined by successively 
differentiating the original equation and substituting the initial conditions. If the differential equation 
can be differentiated infinitely many times, the obtained series will be convergent in a certain neigh- 
borhood of the initial value of the independent variable. This method can be used also for n-th order 
differential equations. 
Remark: The above result is the Taylor series of the function, which may not represent the function 
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itself (sce 7.3.3.3, 1., p. 471). 
It is often useful to substitute the solution by an infinite series with unknown coefficients, and to de- 
termine them by comparing coefficients. 


Bl A: To solve the differential equation y! = e? — y?, xo = 0, yo = 0 one can consider the series 


y = a4x + aga? + aga? ++- + αμα" +--+. Substituting this into the equation considering the formula 
(7.88), p. 470 for the square of the series gives 
zog? 

à + 2a5x + 3a3x? + +++ + αι αἱ + 201832? + (a5? + 2a1a3)* Ες 1 1-α 4 7 | E: Eee 

i , . : 2 1 ii . 
Comparing coefficients gives: αι 1, 242 1, 8a3 + ai^ = =, 4a, + 2αιαο = e etc. Solving 
these equations successively and substituting the coefficient values into the series representation yields 

ο 

x £ 5 4 
yan Ele 

2 6 24 


W B: The same differential equation with the same initial conditions can also be solved in the fol- 
lowing way: Substituting x = 0 into the equation, gives yo’ = 1 and successive differentiation yields 
y = et — Qyy!, yo" = 1, y" = e — 2y? — 2yy", y" = —, yO = e — 6y'y" — 2yy", yo = —5, ete. 
2 3 E y4 

. x x δα 
From the Taylor theorem (see 7.3.3.3, 1., p. 471) follows the solution y = x 4 21 3 Π Eee 
3. Graphical Solution of Differential Equations 
The graphical integration of a differential equation is a method, which is 
based on the direction field (see 9.1.1.1, 3., p. 541). The integral curve 
in Fig. 9.11 is represented by a broken line which starts at the given 
initial point and is composed of short line segments. The directions of 
the line segments are always the same as the direction of the direction 
field at the starting point of the line segment. This is also the endpoint 
Figure 9.11 of the previous line segment. 


4. Numerical Solution of Differential Equations 
The numerical solutions of differential equations will be discussed in detail in 19.4, p. 969. Numerical 


methods are used to determine a solution of a differential equation, if the equation y’ = f(x,y) does 

not belong to the special cases discussed above whose analytic solutions are known, or if the function 

f(x,y) is too complicated. This can happen if f(x, y) is non-linear in y. 

9.1.2 Differential Equations of Higher Order and Systems of 
Differential Equations 

9.1.2.1 Basic Results 


1. Existence of a Solution 


1. Reduction to a System of Differential Equations Every explicit n-th order differential equa- 
tion 


y? = f (zy yis uy mmm (9.222) 
by introducing the new variables 
WW. Vac ssa m y (9.22b) 
can be reduced to a system of n first-order differential equations 
d di dy, 
fam, amy. A = Mey Yis- I1): (9.22c) 


dx 
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2. Existence of a System of Solutions The system of n differential equations 


dyi . 

qe mess) ($ — 1,2,...,n), (9.23a 
which is more general than system (9.22c), has a unique system of solutions 

vi =yi(t) (6 =1,2,...,n), (9.23b 
which is defined in an interval 2 — h € x < zo +h and for x = x takes the previously given initia 
values y;(zo) = yj (i — 1,2,..., n), if the functions f;(v, yi, yo; . .. , Yn) are continuous with respect to 


all variables and satisfy the following Lipschitz condition. 
3. Lipschitz condition For the values x, y; and y; + Ay;, which are in a certain neighborhood o 
the given initial values, the functions f; satisfy the following inequalities: 


[iG yı + Ayn, yo + Ayss ss n + AYn) — fie ns ess) 
< K ([Awi] + [Aye] +--+ + |Agn) (9.24a 
with a common constant ΜΚ (see also 9.1.1.1, 2., p. 541). 
This fact implies that if the function f(x, y, y', . .. , y" P) is continuous and satisfies the Lipschitz con- 
dition (9.24a), then the equation 
ο ΠΠ 
has a unique solution with the initial values y(xo) = yo, (xo) = yo, ..., y" P (vo) = yo), and it 


is (n — 1) times continuously differentiable. 
2. General Solution 
1. The general solution of the differential equation (9.24b) contains n independent arbitrary con- 
stants: 

y = y(x, Ci, C2,- -, Cn). (9.25a) 
2. In the geometrical interpretation the equation (9.25a) defines a family of curves depending on n 
parameters. Every single one of these integral curves, i.e., the graph of the corresponding particular 


solution, can be obtained by a suitable choice of the constants C1, C5,..., Cn. If the solution has to 
satisfy the above initial conditions, then the values Ci, C5, ..., Cn are determined from the following 
equations: 
y(xo, ο C) — yo 
d 
ER Css; ο) = Yo; (9.25b) 
aq P 
dr 
2:5, OF = yo) 
= y(x, σι , : ΜΝ Yo 


If these equations are inconsistent for any initial values in a certain domain, then the solution is not 
general in this domain, i.e., the arbitrary constants cannot be chosen independently. 

3. The general solution of system (9.23a) also contains n arbitrary constants. This general solution 
can be represented in two different ways: Either it is given in a form which is solved for the unknown 
functions 


y= Μια, Ci, ttt) Ch), yo = Γαία, σι, tee Ch), Un = Fas Οι, E ,C«) (9.26a) 
or in the form which is solved for the constants 
πο yu +++; Yn) = Ci, ο uus -3 Yn) = Cn ο Yn) = Cn- (9.26b) 


In the case of (9.26b) each relation 
vic, ye E» =C; (9.269) 
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is a first integral of the system (9.23a). The first integral can be defined independently of the genera 
solution as a relation (9.26c). That 15, (9.26c) will be an identity replacing yi, y; . ιν, Yn by any partic- 
ular solution of the given system and replacing the constant by the arbitrary constant C; determined 
by this particular solution. 

If any first integral is known in the form (9.26c), then the function y;(x, y1,..., Yn) satisfies the partia. 
different equation 


Ovi ὃ Ovi 
Pi + filt, Yi,- Yn) Pi -+ falt, gus Yn) = 0. (9.26d 
Ox πι OYn 


Conversely, each solution y;(2, Y1,- - - , Yn) of the partial differential equation (9.264) defines a first inte- 
gral of the system (9.23a) in the form (9.26c). The general solution of the system (9.23a) can be repre- 
sented as a system of n first integrals of system (9.23a), if the corresponding functions y; (x, Y1,- - - , Yn 
(i = 1,2,...,n) are linearly independent (see 9.1.2.3, 2., p. 553). 


9.1.2.2 Lowering the Order 


One of the most important solution methods for n-th order differential equations 


f nga =0 (9.27) 
is the substitution of variables in order to obtain a simpler differential equation, especially one of lower 
order. Different cases can be distinguished. 

1. f=flyy’,---,y™), i.e., £ does not appear explicitly: 


f (y.y, woe ») = 0. (9.28a) 
By substitution 

dy dy dp 

ag OO md 381 

dn Πρ Pay (9.28b) 


the order of the differential equation can be reduced from n to (n — 1). 

E Reducing the order of the differential equation yy" — y? = 0 to one, with the substitution y' = 
p,pdp/dy = y" it becomes a first-order differential equation y p dp/dy — p? = 0, and y dp/dy — p = 0 
results in p = C y = dy/dx, y = CieC*. Canceling p does not result in a loss of a solution, since p = 0 
gives the solution y = Ci, which is included in the general solution with C = 0. 

2. f=f(x,y’,...,y™), ie., y does not appear explicitly: 


F (s, y...) =0. (9.294) 
The order of the differential equation can be reduced from n to (n — 1) by the substitution 

y =p. (9.29b) 
If the first k derivatives are missing in the initial equation, then a suitable substitution is 

yD =p. (9.999) 


E The order of the differential equation y" — zy" + (y")? = 0 will be reduced by the substitution 


d, d; 
y” = p, so one gets a Clairaut differential equation p — d 3E ( 2)- 0 whose general solution 


dx dx 
f . C a e 32 ; . 
isp = Cia Οἱ). Therefore, y = E z + Coax + Cs. From the singular solution of the 
] : : : 2/3 3/2 , -— | Lt : 
Clairaut differential equation p — po one gets the singular solution of the original equation: 


a al? 4 C + Cs. 
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3. f (x, ο RET y) is a homogeneous function (see 2.18.2.4, 4., p. 122) 
in y, y^ y"... y 9: 


f (x, 359 sss sy) =0. (9.30a) 
One can reduce the order by the substitution 
y 
ᾱ--, ie, y= ef =t, (9.30b) 
y 
Bl Transforming the differential equation yy" — y^ = 0 by the substitution z = y'/y, results in ES = 
yy" _ y? T 
T—, = 0 so the order is reduced by one. One gets z = Ci, therefore, In|y| = Ciz + Ch, or 
y 


y = Cc'* with In|C| = C2. 
4. f=f(a,y,y’,---,y™) isa function of only a: 


y™ = f(x). (9.31a) 
One gets the general solution by n repeated integrations. It has the form 

y = Οι + Con + Cyn? +- + Cyn"! η w(x) (9.31b) 
with 


ψ(α) = / M / f(x) (dz)" conf f(t)z — t) dt. (9.31c) 


It has to be mentioned here that ro is not an additional arbitrary constant, since the change in zo results 
in the change of Cp because of the relation 
1 


Ch πτηρ 


-D (gi). (9.31d) 


9.1.2.3 Linear n-th Order Differential Equations 


1. Classification 
A differential equation of the form 


η ΜΙ =9 
y? + ayy" + agy"? +++) + aniy + any = F (9.32) 
is called an n-th order linear differential equation. Here F and the coefficients a; are functions of 2, 
which are supposed to be continuous in a certain interval. If a1, a2,...,an are constants, it is called 


a differential equation with constant coefficients. If F = 0, then the linear differential equation is 
homogeneous, and if F Æ 0, then it is inhomogeneous. 

2. Fundamental System of Solutions 

A system of n solutions y1, ys, ... , Yn of a homogeneous linear differential equation is called a funda- 
mental system if these functions are linearly independent on the considered interval, i.e., their linear 
combination ΟΊ yi + C2 yo +++++ Ch Yn is not identically zero for any system of values C, C5, ..., Cs, 
except for the values C? = C3 =--- = C, = 0. The solutions yi, ys, ..., Yn of a linear homogeneous dif- 
ferential equation form a fundamental system on the considered interval if and only if their Wronskian 
determinant 


νι Ya Yn 
wap ο in" (9.33) 
y; 7D. yD... yp =D) 
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is non-zero. For every solution system of a homogeneous linear differential equation the formula of 
Liouville is valid: 


T 
W(x) = W (zo) exp (- / "NOS ar) ; (9.34) 
20 

It follows from (9.34) that if the Wronskian determinant is zero somewhere in the solution interval, 
then it can be only identically zero. This means: The n solutions yi, y2,...,Yn of the homogeneous 
linear differential equation are linearly dependent if even for a single point xo of the considered interval 


W (ao) = 0. If the solutions y1, yo, .. . , Yn form a fundamental system of the differential equation, then 
the general solution of the linear homogeneous differential equation (9.32) is given as 
y = Ciy t C» ya t + Cs ya. (9.35) 


A linear n-th order homogeneous differential equation has exactly n linearly independent solutions on 
an interval, where the coefficient functions a;(x) are continuous. 
3. Lowering the Order 


If a particular solution y; of a homogeneous differential equation is known, by assuming 


y = yr(x)u(x) (9.36) 
one can determine further solutions from a homogeneous linear differential equation of order n — 1 for 
u' (ac). 
4. Superposition Principle 
If y; and y are two solutions of the differential equation (9.32) for different right-hand sides ΕΙ and 
Fy, then their sum y = yi + y» is a solution of the same differential equation with the right-hand side 
F = ΓΙ Fp. From this observation it follows that to get the general solution of an inhomogeneous 
differential equation it is sufficient to add any particular solution of the inhomogeneous differential 
equation to the general solution of the corresponding homogeneous differential equation. 
5. Decomposition Theorem 
If an inhomogeneous differential equation (9.32) has real coefficients and its right-hand side is complex 
in the form F = Fi + iF) with some real functions F, and F5, then the solution y = yı + iyə is also 
complex, where y; and y» are the two solutions of the two inhomogeneous differential equations (9.32) 
with the corresponding right-hand sides F and F>. 
6. Solution of Inhomogeneous Differential Equations (9.32) by Means 

of Quadratures 

If the fundamental system of the corresponding homogeneous differential equation is already known, 
there are the following two solution methods to continue the calculations: 
1. Method of Variation of Constants Looking for the solution in the form 

y = Cun + Coya to Cun (9.37a) 


where C, C5,..., C4, here treated as functions of x. There are infinitely many such functions, but 
requiring that they satisfy the equations 


Ci'yı + Cy yo ο = 0, 
Cy; s [07474 dress Ον Yn! = 0, (9.37b) 
Cy 4 Cy)? decies) C, y, 7» =0 
and substituting y into (9.32) with these equalities follows 
Cryo =n Ον qirra ος Ὁ =F. (9.37c) 


Because the Wronskian determinant of the coefficients in the linear system of equations (9.37b) and 
(9.37c) is different from zero, one gets a unique solution for the unknown functions Cy’, Cs, ..., Ch’, 
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and their integrals give the functions C1, C2,...,Ch. 
2 1 
W y" y y-r-—l. (9.37d) 
lg lg 
In the interval x > 1 or « 1 all assumptions on the coefficients are fulfilled. First the homogeneous 
: 1 
equation 7” + D τα. 0 is solved. A particular solution is e; = e”. Then one looks for 
μμ. μα. 


a second one in the form ο = e*u(r), and with the notation u'(r) = v(x) one gets the first-order 


1 
differential equation v' + (1 + =) v = 0. A solution of this equation is v(x) = (1 — x)e~*, and 
x 


x 


therefore, u(x) = EO dr = Ja — x)e "dx = πε". With this result φο = x is obtained for 


the second element of the fundamental system. The general solution of the homogeneous equation is 
τα) = Cie” + Cox. The variation of constants with u(x) and us(r) instead of C1 (x) and C(x) is now: 


να) = u (α)οἳ + υρ(α)α, 
y (x) = ux(x)e* + uo(x) + uj (x)e* + uz (αλα, υι(α)ε" um z)z — 0, 
y'(x) = u(x) + ui (x)e* + ο (x), M je? -uy(r) 2x —1, so 


uj(r)— ze, uy(r) 2-1, ie, wu(r)—-(1-—-z)e *- Ci u(x) = --ᾱ + Cs. 
With this result the general solution of the inhomogeneous differential equation is: 

y(z) = —(1 + 22) + Che” + (Cy — 1) = —(1 + 22) + Ce + Cs. (9.37e) 
2. Method of Cauchy In the general solution 

y = Cun + Coya + +++ + CnYn (9.38a) 
of the homogeneous differential equation associated to (9.32) one determines the constants such that 


for an arbitrary parameter a the equations y = 0, y' = 0,..., y"-? = 0, y("-9 = F(a) are satisfied. 
In this way one gets a particular solution of the homogeneous equation, denoted by y(x, a), and then 


y= fot (x, a) da (9.38b) 


ERI 


isa particular solution of the inhomogeneous differential equation (9.32). This solution and their deriva- 
tives up to order (n — 1) are equal to zero at the point x = πρ. 

W The general solution of the homogeneous equation associated to the differential equation (9.37d) 
which has been solved by the method variation of constants is y = Cie* + Cox. From this result 
follows γ(α) = Cie? + Coa = 0, y'(a) = Cie? + Co = a — 1 and y(x,a) = ae ?e* — x, so that 
the particular solution y(x) of the inhomogeneous differential equation with (τρ) = (απο) = 0 is: 


T 
= / (ae~%e” — x) da = (xo + 1)e* 79 + (a — Ί)α — a? — 1. With this result one can get the 
“20 


general solution y(x) = Ci*e* + Cy*x — (x? + 1) of the inhomogeneous differential equation. 
9.1.2.4 Solution of Linear Differential Equations with Constant 
Coefficients 


1. Operational Notation 
The differential equation (9.32) can be written symbolically in the form 


P,(D)y = (D" + a D” + aD"? «as 4D + an)y =F, (9.39a) 
where D is a differential operator: 
di ' d* 
Ρυ--,, Dty= τς. (9.39b) 


dx’ da* 
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If the coefficients a; are constants, then P,(D) is a usual polynomial in the operator D of degree n. 


2. Solution of the Homogeneous Differential Equation with Constant Coefficients 
To determine the general solution of the homogeneous differential equation (9.39a) with F = 0, 1.6., 


P,(D)y = 0 (9.40a) 
one has to find the roots r4, ro,. .., r, of the characteristic equation 

P,(r) =r” ἠ- avr" 1 4 aor"? |... p ay ar + a, = 0. (9.40b) 
Every root r; determines a solution e'** ofthe equation P,(D)y = 0. Ifaroot r; has a higher multiplicity 
k, then ze"*, a?2e7*, ,.. , z^-le"i* are also solutions. The linear combination of all these solutions is the 
general solution of the homogeneous differential equation: 

y = Cre + Coe? e H e (O; + Cum bb Ouais) es (9.40c) 


If the coefficients a; are all real, then the complex roots of the characteristic equation are pairwise 
conjugate with the same multiplicity. In this case, for rı = a 1-18 and r2 = a — i8 one can replace the 
corresponding complex solution functions e™” and e'?* by the real functions e** cos Bx and ε΄“ sin Bx. 
The resulting expression C1 cos Bx + C» sin 8x can be written in the form Acos(f + p) with some 
constants A and q. 

W In the case of the differential equation y® + y® — y" — y = 0, the characteristic equation is rê + 
rt — r? — 1 = 0 with roots rı = 1, ra l, r34 = i, T56 i. The general solution can be given in 
two forms: 


(C3 + Cyx) cosx + (C5 + Cox) sina, or 

yee F p1) + Az cos(z + vs). 
3. Hurwitz Theorem 
In different applications, e.g., in vibration theory, it is important to know whether a solution of a given 
homogeneous differential equation with constant coefficients tend to zero for x — --oo or not. It tends 
to zero, obviously, if the real parts of the roots of the characteristic equation (9.40b) are negative. 
According to the Hurwitz theorem an equation 

αμα" απ. az" d +--+ ax + a9 =0 (9.41a) 
has only roots with negative real part if and only if all the determinants 


αι αρ 0 0 
αι a iy ao D i i a 0 
D,=a, Ώο] ! 9|, D= ]αφαραι],..., D,2| 9274 
a3 02 μ.μ... 
mm 00 0 ...a, 
(with a, — 0 for mn) (9.415) 
are positive. The determinants Dj; have on their diagonal the coefficients a1, a2, ..., ay (k =1,2,...,n), 


and the cocfficient-indices are decreasing from left to right. Coefficients with negative indices and also 
with indices larger than n are all put to 0. 
W For a cubic polynomial the determinants have in accordance to (9.41b) the following form: 


"T αι αρ 0 
Dia, Ίος i MF 3 = | 43 Q2 a1}. 
a3 02 
0 0 ag 


4. Solution of Inhomogeneous Differential Equations with Constant Coefficients 

These differential equations can be solved by the method variation of constants, or by the method of 
Cauchy, or with the operator method (see 9.2.2.3, 5., p. 588). If the right-hand side of the inhomo- 
geneous differential equation (9.32) has a special form, then a particular solution can be determined 
easily. 


1. Form: F(a) = Ae**, P,(a) #0 (9.42a) 
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A particular solution is 


Pee . 9.42b 
Pla) (9.42b) 
If a is a root of the characteristic equation of multiplicity m, i.e., if 
P,(a) = B, (a) =... = P, "79 (a) = 0, (9.42c) 
Ag n eoc 
then y = ΓΣΠ is a particular solution. These formulas can also be used by applying the decompo- 
να 
sition theorem, if the right side is 
F(x) = Ae™ coswa or Ae?*sinwz. (9.42d) 


The corresponding particular solutions are the real or the imaginary part of the solution of the same 
differential equation for 

F(x) = Ae*(coswa +isinwa) = Ac (9.42e) 
on the right-hand side. 
ΒΛ: For the differential equation y" — 6y' + 8y = ο””, the characteristic polynomial is P(D) = 
D? — 6D +8 with P(2) = 0 and P'(D) = 2D —6 with P'(2) = 2-2—6 = —2, so the particular solution 


. xe? 
is y = — 
i 2 
E B: The differential equation y" + y! + y = e” sin z results in the equation (D? + D + 1)y = e+, 
f . ο ο e*(cosx + isinz) . i 
From its solution y — - - one gets a particular solution y; = 
(1+i)?+(1+i)4+1 2+ οἱ 


at 
5e sin x — 3 cos x). Here yi is the imaginary part of y. 


2. Form: F(x) = Q,(x)e^*, Qn(x) is a polynomial of degree n (9.43) 
A particular solution can always be found in the same form, i.e., as an expression y = R(r)e^*. R(x) is 
a polynomial of degree n multiplied by x” if a is a root of the characteristic equation with a multiplicity 
m. Considering the coefficients of the polynomial R(x) as unknowns and substituting the expression 
into the inhomogeneous differential equation a linear system of equations is obtained for the coeffi- 
cients, and this system of equations always has a unique solution. 

This method is very useful especially in the cases of F (x) = Q, (x) for a = 0and F(x) = Q,(x)e'* coswax 
or F(x) = Q,(x)e'* sinwz for a = r + iw. There is a solution in the form y = z"e'*(M, (x) cos wax + 
Νι(ο) sin wa]. 


E The roots of the characteristic equation associated to the differential equation y + 2" + y" = 
6x + 2rsinz are ky = kp = 0, k3 = ky = —1. Because of the superposition principle (see 9.1.2.3, 4., 
p. 554), one can calculate the particular solutions of the inhomogeneous differential equation for the 
summands of the right-hand side separately. For the first summand the substitution of the given form 
yi = a? (ax +b) results in a right-hand side 12a +2b+6ax = 62, and so: a = 1 and b = —6. For the sec- 
ond summand one substitutes yy = (cx-4-d) sin x-- (f x-4-g) cos x. One gets the coefficients by coefficient 
comparison from (2g +2f — 6c--2 f x) sinx—(2c+2d+6f+2cx) cos c = 2rsinz,soc = 0, d = —3, f = 
1, g = —1. Therefore, the general solutionis y = οι Γωα--θα” 4-1? + (c3a-+¢4)e—* —3 sin x+(x—1) cos x. 


3. Euler Differential Equation 
'The Euler differential equation 
n 
Y arles + d)"y™® = F(x) (9.44a) 
k=0 
can be transformed with the substitution 


cr+d=e' (9.44b) 
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into a linear differential equation with constant coeficients: 


E The differential equation z?y" — ὅαγ' + 8y = a? is a special case of the Euler differential equation 
for n = 2. With the substitution x = εἴ it becomes the differential equation discussed earlier in A, 
dy dy Uo 


p 
dB x +8y . The general solution is y = Ce?! + Cre" σε’ Cia? το, 7 πω. 


p. 557: 


9.1.2.5 Systems of Linear Differential Equations with Constant 
Coefficients 


1. Normal Form 


The following simple case of a system of first-order linear differential equations with constant coeffi- 
cients is called a normal system or a normal form: 


1 

νι = ai + 122 ἡ F d1nln 

Ua! 4 , fi fi 4 

p 211) 0221: ας n; 

y2 2181 T 02292 1 T Q2nUn; (9.454) 
1 

Yn GniY1 + Qn2U2 + `+ + annYn- 


To find the general solution of such a system, one has to find first the roots of the characteristic equation 


11 — T A12 e Ain 
21 193 — T ... Gan =D (9 45b) 
απι απο ann — T 


Το every single root r; of this equation there is a system of particular solutions 


= Aye", y» — Aye", Den = Age i, (9.45c 

whose coefficients Αμ (k = 1,2,...,n) are determined from the homogeneous linear equation system 

(ai. — ri) A1 ου ο +*+ + ainAn = 0, 

LITE (9.45d 

απ Ay + Gn2Ag ++: + (ann — ri) As = 0. 
This system gives the relations between the values of the coefficients Αχ (see Trivial Solution and Fun- 
damental System in 4.5.2.1, 2., p. 309). For every r;, the particular solutions determined this way wil 
contain an arbitrary constant. If all the roots of the characteristic equation are different, the sum o 


these particular solutions contains n independent arbitrary constants, so in this way one gets the gen- 
eral solution. If a root r; has a multiplicity m in the characteristic equation, the system of particular 
solutions corresponding to this root has the form 


= Ai(x)e"*, = Asx(x)e"*,..., = An(x)e"*, (9.45e) 
ΜΗ a 1(2)...., A, G "n are polynomials of HA at most m — 1. After substituting these expressions 
with unknown coefficients of the polynomials A; (x) into the differential equation system one first can 


cancel the factor e'*, then one compares the coefficients of the different powers of x to have linear 
equations for the unknown coefficients of the polynomials, and among them m can be chosen freely. In 
this way, one gets a part of the solution with m arbitrary constants. The degree of the polynomials can 
be less than m — 1. 

In the special case when the system (9.45a) is symmetric, i.e., when aip = api, then it is sufficient to 
substitute A;(r) = const. For complex roots of the characteristic equation, the general solution can be 
transformed into a real form in the same way as has been shown for the case of a differential equation 
with constant coefficients (see 9.1.2.4, p. 555). 


W For the system yi! = 2yi + 2yo — ya, yo’ 2yi + 4y2+ ya, ya! = —3yi + 8y2 + 2y3 the characteristic 
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equation has the form 


3-τ 2 -1 
-2 4-r 1 
-3 8 2-r 


For the simple root rı = 6 one gets 441-245 — A; = 0, 244 — 2A +A; = 0, 3414-845 — 4A; = 0. 


= -(r- 6)(r - 1)? = 0. 


1 : ; 
From this system one has A; = 0, Ag 5349 C, y1 = 0, y2 = Cre®, ys = 201οὔ", For the multiple 


root τὸ = 1 one gets y; = (Pia + Qi )e*, yo = (Pox + Qo)e*, ya = (Psx + Q3)e”. Substitution into the 
differential equations yields 


Piz + (Pi + Qi) = (2P, + 2P, — P3)a + (2Q1 + 2Q2 — Qs), 
Poa + (P) + Q5) = (-2P, + AP, + P3)a + (—2Q1 + 4Q2 + Q3), 
Psx + (P3 + Q3) = (--3Ρι + 87) + 2P3)a + (—3Q1 + 8Q2 + 2Q3), 


which implies that P, = 5C5, Py = Co, P3 = 7C2, Qi 5C3 6C5, Q2 = C3, Q3 = TC3 11C. The 
general solution is yj = (ὅσα + 5C3 — 6C2)e”, y2 = Cre® + (Cox + C3)e”, ya = 2C469* + (TCox + 
7C3 — 11C3)e*. 
2. Homogeneous Systems of First-Order Linear Differential Equations 

with Constant Coefficients 
have the general form 


» αι μὴν 1ο. πο οἳ (9.46a) 
k-i k=l 


If the determinant ἠοϊ(α;μ) does not disappear, i.e., 

ἀοί(αικ) A 0, (9.46b) 
then the system (9.46a) can be transformed into the normal form (9.45a). 
In the case of det(a;,) = 0 further investigations are necessary (see [9.15]). 
The solution can be determined from the general form in the same way as shown for the normal form. 
The characteristic equation has the form 

det (ainr + bin) = 0. (9.46c) 
The coefficients A; in the solution (9.456) corresponding to a single root r; are determined from the 
equation system 


M (aar; T bir) Ak =f (i = d οὐκ n). (9.464) 


k=1 
Otherwise the solution method follows the same ideas as in the case of the normal form. 


W The characteristic equation of the two differential equations 5yj' 4-44 —2y' —y» = 0, yi 4-8y1—3y» = 

0 is: 

5br+4 —2r—-1 
r+8 —3 

The coefficients A, and A» for τι = 1 can be got from the equations 94, — 34; = 0, 94; — 345 = 0 

so Ay = 34, = 3601. For rg = —2 one gets analogously A; = 2A; = 2C5. The general solution is 

yı = Cie + Coe ?*, yo = 90ιεῖ + 2056 77, 

3. Inhomogeneous Systems of First-Order Linear Differential Equations 

have the general form 


X Gk, + Σ bye = Fi(x) (ἰ--1,3,...,τι). (9.47) 
k-i k=l 


2r:2r— 420, τι 1, το 2. 
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1. Superposition Principle If y; and y;? (j = 1,2,...,n) are solutions of inhomogeneous sys- 
tems which differ from each other only in their right-hand sides F;® and F;?), then the sum Yj = y; + 
yj (j — 1,2,...,n) is a solution of this system with the right-hand side F(x) = F;® (x) + F;® (x). 
Because of this, to get the general solution of an inhomogeneous system it is enough to add a particular 
solution to the general solution of the corresponding homogeneous system. 

2. The Variation of Constants can be used to get a particular solution of the inhomogeneous 
differential equation system. To do this one uses the general solution of the homogeneous system, and 
considers the constants C4, Co,..., C, as unknown functions Ci(x), Co(x),..., C,(x). Then it is to 
be substituted into the inhomogeneous system. In the expressions of the derivatives of y; there is the 
derivative of the new unknown functions C(x). Because yj, y», .. . , y, are solutions of the homogeneous 
system, the terms containing the new unknown functions will be canceled; only their derivatives remain 
in the equations. This gives for the functions C; (x) an inhomogeneous linear algebraic equation system 
which always has a unique solution. After n integrations one gets the functions C3 (x), Co(x), ... , C, (a). 
Substituting them into the solution of the homogeneous system instead of the constants results in the 
particular solution of the inhomogeneous system. 

E For the system of two inhomogeneous differential equations 5y” + 4γι — 2y? — yo = €, yi! + 
δνι — 3y2 = δε 3 the general solution of the homogeneous system is (see p. 559) γι = Cie” + Coe ?", 
yo = 3016" + 25e 7". Considering the constants ΟἹ and Cù as functions of x and substituting into 
the original equations gives 5Cj'e? + 5Cs'e-?" — 6C,'e* — 403'e? = e-*, Ce? + Cy’? = 5e-* or 
Cxe? — Q,e-* = e, Cy'e® Οὐ ευ = 5e-*. Therefore, 201'e? = 4e-*, Οι = —e7?* + const, 
2(0 ο = 6e~*, Cy = 3e” + const. Since a particular solution is searched for, one can replace every 
constant by zero and the result is y; = 2e *, y) = 3e *. The general solution is finally γι = 2e^* + 
Cie” + Coe 7", yo = 3e? + 3C,€* + 2056 77. 

3. The Method of Unknown Coefficients is especially useful if on the right-hand side there 
are special functions in the form Q,(x)e^*. The application is similar to the one, used for differential 
equations of n-th order (see 9.1.2.5, p. 558). 


4. Second-Order Systems 
'The methods introduced above can also be used for differential equations of higher order. For the 
system 


n n n 
» αι +Y biy +Y ciry =O (i= 1,2,...,n) (9.48) 
[ms k=1 k=1 
one can determine particular solutions in the form y; = A;e"*. To do this, one gets r; from the char- 


acteristic equation det(ajr? + bir + Cik) = 0, and A; from the corresponding linear homogeneous 
algebraic equations. 


9.1.2.6 Linear Second-Order Differential Equations 
Many special differential equations belong to this class, which often occur in practical applications. 
Several of them are discussed in this paragraph. For more details of representation, properties and 
solution methods see [9.15]. 
1. General Methods 
1. Solving the Inhomogeneous Differential Equation by the Help of the Superposition 
Principle 
y^ + p(a)y! + a(x)y = Fx). (9.49a) 
To get the general solution of an inhomogeneous differential equation it is enough to add a particu- 
lar solution of the inhomogeneous equation to the general solution of the corresponding homogeneous 
equation. 
a) The general solution of the corresponding homogeneous differential equation, i.e., with F(x) = 0, is 
y = Cry + Coy. (9.49b) 
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Here yı and y» are two linearly independent particular solutions of (9.49a) (see 9.1.2.3, 2., p. 553). I 
a particular solution y; is already known, then the second one y; can be determined by the equation 
(9.34) of Liouville. From (9.34) follows: 
/ 
y: aye —yn'yo ο [yp i 
be | = nw = n'y = EE 7 25 y? (2) = Aexp ( fro) dv) (9.49¢ 
92 yi 
giving 
» = Ir 
yo = Ayı I expl J pde) dx (9.49d 
Yı 


where A can be chosen arbitrarily. 
b) A particular solution of the inhomogeneous equation can be determined by the formula 


y= 3 J PO πο ο (9.490) 


where y; and y» are two particular solutions of the corresponding homogeneous differential equation. 
C) A particular solution of the inhomogeneous differential equation can be determined also by variation 
of constants (see 9.1.2.3, 6., p. 554). 
2. Solving the Inhomogeneous Differential Equation by the Method of Undetermined Co- 
efficients 

s(x)y" + p(x)y + a(x)y = F(x) (9.50a) 
If the functions s(x), p(x), q(x) and F(x) are polynomials or functions which can be expanded into a 
convergent power series around zy in a certain domain, where s(xo) 4 0, then the solutions of this dif- 
ferential equation can also be expanded into a similar series, and these series are convergent in the same 
domain. Here they should be determined by the method of undetermined coefficients: The solution to 
be looking for as a series has the form 


y = ag + a (x — xo) + as(z — zo? +, (9.50b) 
and it has to be substituted into the differential equation (9.50a). Equating corresponding coefficients 
(of the same powers of (x — xo)) results in equations to determine the coefficients ag, a1, 2, .... 


Β To solve the differential equation y” + zy = 0 one substitutes y = ag + αι + aga? + aga? ++, 
y! = a, + 2agr + 3aga? +--+, and y” = 2as + 6a3x +--+ getting 2ag = 0, 6a3 + a) = 0,.... 
a a 
The solution of these equations is a = 0, a3 9 x «ΘᾺ 3 d , a5 = 0,..., so the solution is 
4 
T 


αὖ fi z^ } αἱ ^ at 
.; 3:3 2-3-5-6 FOTOS i324 ae eo er 


3. The Homogeneous Differential Equation 

xy” + αρ(α)γ' + q(z)y -- 0 (9.51a) 
can be solved by the method of undetermined coefficients if the functions p(x) and q(x) can be expanded 
as a convergent power series of r. The solutions have the form 


y = x" (ag + aya + aoa? +--+), (9.51b) 
whose exponent r can be determined from the defining equation 
r(r — 1) + p(0)r + q(0) = 0. (9.51c) 


Ifthe roots of this equation are different and their difference is not an integer number, then one gets two 
linearly independent solutions of (9.51a). Otherwise the method of undetermined coefficients results 
only one solution. Then with the help of (9.49b) one can get a second solution or at least one can find 
a form which gives a second solution with the method of undetermined coefficients. 


W For the Bessel differential equation (9.52a) one gets only one solution with the method of the un- 
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66 
determined coefficients in the form y = Σ αμ ον (ao # 0), which coincides with J,(x) up to a 
3 0 


constant factor. Since exp (- / pdr) == = — one finds a second solution by using formula (9.494) 
x 


da y Ck xE 


co 
a k=0 P ? vs .—n mnm 
y2 = Ayı / ror? (X αμα)” Au f "m dr = By Ine +a Σ. dix 


The determination of the unknown coefficients cj, and d; is difficult from the a;,’s. But this last expres- 
sion can be used to get the solution with the method of undetermined coefficients. Obviously this form 
is a series expansion of the function Y, (x) (9.53c). 


2. Bessel Differential Equation 


xy” + ay! + (a? — n?)y -- 0. (9.52a 
1. The Defining Equation is in this case 

r(r-1)+r-n =r -n =0, (9.52b 
so, r = +n. Substituting 

y = 2" (αρ + aix 4 -) (9.52c 


into this equation and equating the coefficients of z"** 


k(2n + kjak + ay 3 = 0. (9.52d 
For k = 1 follows (2n + 1)αι = 0. For the values k = 2,3,... one obtains 


to zero gives 


ao 
2m = =1,2,...), = =a η αν d 
amy =0 (m=1,2,...), az 2(2n + 2) 
as “o es, 09 is arbitrary. (9.520) 
2-4. (2n +2) (2n - 4) ` 
1 
2. Bessel or Cylindrical Functions The series obtained above for ay = —————_., where T is 


2n l'(n. 4- 1)' 
the gamma function (see 8.2.5, 6., p. 514), is a particular solution of the Bessel differential equation 
(9.52a) for integer values of n. It defines the Bessel or cylindrical function of the first kind of index n 


an 2 4 
In(x) = > pet od Z τ 
n 2^ l'(n + 1) 2(2n +2) 2:4: (2n 4 2)(2n + 4) 


-X C (2) (9.53a) 


£x kn d kr) 


The graphs of functions Jp and J, are shown in Fig. 9.12. 
The general solution of the Bessel differential equation for non-integer n has the form 
y = CJ. (x) + CoJ_n (x), (9.53b) 
where J_,() is defined by the infinite series obtained from the series representation of J, (x) by replac- 
ing n with —n. For integer n, holds J µ(α) = (—1)"J, (x). In this case, the term J_,,(a) in the general 
solution should be replaced with the Bessel function of the second kind 
Y,(2) = lim Ja (x) cos mr — Jmm) 


mon sin mm 


(9.53c) 


which is also called the Weber function. For the series expansion of Y, (x) see, e.g., [9.15]. The graphs 
of the functions Yo and Y; are shown in Fig. 9.13. 
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A 
05-4 Y γ-γ 
\ ο... 


[αυ] 


© 


Figure 9.12 Figure 9.13 


3. Bessel Functions with Imaginary Variables In some applications one uses Bessel functions 
with pure imaginary variables. In this case it is to be considered the product i^" J, (ir) which will be 


denoted by 1, (x): 
(3) n (3) n+2 (5) n+4 
2 2 MV T (9.54a 


T(n-1) Γης») Mints)” 
The functions 7, (x) are solutions of the differential equation 
xy" + ay! — (a? + n?)y -- 0. (9.54b 
A second solution of this differential equation is the MacDonald function 
Ti-ar) — L(x 
Κα(α) == nf ) nf ) 
2 sin nm 
If n converges to an integer number, this expression also converges. 
The functions J (x) and K„(x) are called modified Bessel functions. The graphs of functions Jo and Π 


are shown in Fig. 9.14; the graphs of functions Ko and Κι are illustrated in Fig. 9.15. The values o 
functions Jo(x), Ji(x), Yo(x), Υ (x), Io(x), L(x), Ko(x), Kı (x) are given in Table 21.11, p. 1106. 


Τη) = i” Jalie) = 


(9.54c 


^ 
Z, X 
T T T T > 
05 10 15 20 x ^" 0.5 1.0 
Figure 9.14 Figure 9.15 Figure 9.16 
4. Important Formulas for the Bessel Functions Jn (a) 
2n dJ, (x n 
hale) + Aa) = Pasa), POLL, a) (9.55) 
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The formulas (9.55a) are also valid for the Weber functions Y, (a). 


ant, (x ας n 

In—1(@) -- Iny (T) i ) ( ) La(z) — -L(x), (9.55b) 
1 dx g 
2nK, (x dx : n 

Καμι(α) — Kn- (£) = a ) a ) = —K,-1(2) -Κπ(α). (9.550) 


For integer numbers n the following formulas are valid: 


π/2 


2 
Jon (11) = — if cos(x sin y) cos 2ny dy, (9.55d) 
π 
0 
9 π/2 
οι μι(α) = = / sin(x sin y) sin(2n + 1)p dy (9.55e) 
T 


0 


or, in complex form, 
In(x) = z0" J ezes? cos ny dip. (9.55f) 
T 4 
0 


The functions Jn+1/2(x) can be expressed by using elementary functions. In particular, 


2 2 
Ijal) = 4] τα ne (9.56a) Jal) = EI (9.56b) 


By applying the recursion formulas (9.55a)-(9.55f) the expression for Jn+1/2(x) for arbitrary integer n 
can be given. For large values of x the following asymptotic formulas are valid: 


Jun) ‘2 [cos (s - z) +O (2)] ; (9.57a 
= Ps | +0 (2) (9.57b 
Y, (2) = a [sin (s E 7) +O (2) ; (9.57c 
K,(r)-— (ze | +O (5) , (9.57d 


1 1 
The expression O (9) means an infinitesimal quantity of the same order as — (see the Landau symbol, 
x x 


2.1.4.9, p. 57). 
For further properties of the Bessel functions see [21.1]. 


5. Important Formulas for the Spherical Bessel Functions Spherical Bessel functions of 
the first and second kind j;(z) and nj(z) follow from the Bessel functions of the first and second kind 


Jn(z) (9.53a)and Y,(z) (9.536) for half odd order index n = τ ᾽ ...asin ji(z) = Va; and 


T 
ni(z) = Va; m (z) with | — 0,1,2,.... They occur as regular or singular solutions of the potential 
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nk? 
free radial Schroedinger equation (see 9.2.4.6,3., (9.137b), p. 599) with V(r) = 0, E = 5 „Z= kr 
m 
and s;(z) = Πι(τ): 
d? , f 
em, [z si(z)] + [2 -- l(L-1)) (2) =0, si(z) = (z) or (z). (9.58a 


They also occur in the quantum mechanical scattering theory, where the n;(z) are called the spherica 
von Neumann functions. By the help of the Rayleigh formulas 


l sin z ! lOS 2 
ποια a =) m niet (5) (a (9.58b 


zdz 2 2 


follows, so 


sin 2 . sin z — Z cos z 


jo(2) τι jl) E pts (9.58c 
COS Z cos z + zsinz 
no(z) —, πια) = E pi (9.58d 


Complex spherical functions are used with @,,(y) = e"? in the form Yr(&) = O?"(90)9,, (o), e.g. in 
9.2.4.6, (9.1360), p. 599. In the combined index L = (I, m) l= 0,1,2,... denotes the quantum number 
of the orbital angular momentum. The magnetic quantum number m is restricted to the 2/ + 1 values 
m = —l, —l + ντ , 4H. With the abbreviations 


Julk) = j(kr)YL(&,). ni(k) = n(kr)Y;L(&,), & = E (9.59a 
" 
one gets the Kasterinian formulas 
ος (Ὁ BUM. ο ο ο λος. (9.59b 
ik/ kr ik kr 


where V denotes the nabla operator (s. 13.2.6.1, S. 715). The expansion of a plane wave in terms o 
spherical or Bessel functions gives 


" ο... ET 
eK ag V rij(kr)YT(à), & = P Ysy Doe (9.59c 
L L 1:50 πι----ἶ 
There are the following addition theorems 
ijn (K(E1 + 19)) 2 4 X Crni” jr ({π|}}}, (kr), r12 = arbitrarily, (9.59d 
LiLa 
i'nz(k(fi +r2)) 2 4π M Crni” σπιν a) (ks). rir (9.59e 
Li,La 
with the Clebsch-Gordan coefficients (see 5.3.4.7, p. 345) 
Cinta = | αεΥι(ΘΥ:, (Yz, (9). (9.591) 


For further details see [21.1], [9.28] until [9.31]. 
3. Legendre Differential Equation 


Restricting the investigations in this book to the case of real variables and integer parameters n — 
0, 1,2,... the Legendre differential equation has the form 


(1— z2)y" --2 - n(n-1)y 20 or ((1—a2)y - n(n 4 1)y -- 0. (9.60a) 
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1. Legendre Polynomials or Spherical Harmonics of the First Kind are the particular solu- 
tions of the Legendre differential equation for integer n, which can be expanded into the power series 


y= Σ a,x”. The method of undetermined coefficients yields the polynomials 
v=0 
δή -- (2n)! E n(n — 1) gn n(n — 1)(n — 2)(n E 3) nd TNR 
2n (nl? 2(2n — 1) 2: 4(2n — 1)(2n — 3) 


(|x| < oo; n = 0,1,2,...). (9.60b) 


NE 3 1 d"(zi?—1) (9.006) 


Ῥι(α) = F (n + 1,—n, 1; 5 E am 


where F denotes the hypergeometric series (see 4., p. 567). The first eight polynomials have the fol- 
lowing simple form (see 21.12, p. 1108): 


Py(x)—1, (9.60d) Ρι(α) τα, (9.60e 
NR NR 

P(x) = PS -1), (9.60f) P(x) = Pid — 3x), (9.60g 
1 4 2 1 5 3 ; 

Pr) = «δα! -- 30x? +3), (9.60h) Ρα) = = (6325 — 70x? + 159), (9.60i 


1 
16 
The graphs of P, (x) for the values from n = 1 to n = 7 are represented in Fig. 9.16. The numerica 
values can be calculated easily by pocket calculators or from function tables. 


2. Properties of the Legendre Polynomials of the First Kind 
a) Integral Representation: 


Ps x) 


; 1 " 
(291α5--8315α΄-Ε105α7--5) (9.601) Ρη(α) το 429" 693x° +315a3 —35x) .(9.60k 


π π 


1) = 1 le S 2 n 1 dy . 
Pix) = T fe + eos ova! — 1)" dp π / (a cos pyx? — 1)nt ` (9.61a) 


The signs can be chosen arbitrarily in both equations. 
b) Recursion Formulas: 


(n + 1) P441(x) = (2n + Ί)σΡη(α)-- πι (£) (n > 1; P(x) 21, Pi(x) — x), (9.61b) 
(z? — η) = n[rP,(x) - Pra(x)| (n> 1). (9.61c) 
da: 
c) Orthogonality Relation: 
: 0 form £n, 
f PP v) dx = 2 form=n. (9.61d) 
ES 2n 1 


d) Root Theorem: All the n roots of P, (x) are real and single and are in the interval (—1, 1). 


e) Generating Function: The Legendre polynomial of the first kind can be represented as the power 
series expansion of the function 


1 oo 
——— ο 9.6le 
V1 — 2rz t τὸ È d ( 


For further properties of the Legendre polynomials of the first kind see [21.1]. 
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3. Legendre Functions or Spherical Harmonics of the Second Kind A second particular 
solution Q,,(x) can be got, which is valid for |x| > 1 and linearly independent of P, (x), see (9.61a), by 


—(n+1) 
the power series expansion 5 bya”: 
Qula) = 2"(n!)? ΜΉΤ ea mE 1 ) 
(2n +1)! 2 2 2. y 
2^ (n)? MOT (n+ 1)(n + 2) μ.ο. 
(2n + 1)! 2(2n + 3) 
(n : Dus 2)n*3)n-4) qs 
| gp 05 m .62a 
4- (2n + e +5) i (2.621) 
The representation of Q, (x) valid for : | < Lis: 
1 1+ 
Qala) = 5 P) Jing — x. Ip 1(2) P4 (x). (9.62b) 


The spherical harmonics of the τ ο. second kind are also called the associated Legendre functions 
(see also 9.2.4.6, 4., (9.138c), p. 600). 


4. Hypergeometric Differential Equation 
The hypergeometric differential equation is the equation 


x - 23 [y (o +B + al 


where a, 3, y are parameters. It contains several important special cases. 


oy — 0, (9.63a) 


1-2 
a) Fora=n-+1, 8 n,y=1, anda 7 it is the Legendre differential equation. 


b) If y £ 0 or η is not a negative integer, it has the hypergeometric series or hypergeometric function 
as a particular solution : 
| B ν 1)8(8 +1) . 
F(o,B,y;z) 21-4 iis £+ ala + 1)2(8 + ) 92 Fees 
ly de τὴ 

1)...(o - n)B(B +1)... n 

a(a 4 1)... (α 4- n)B(B 4- 1) (8n) «a bos; 9.63b) 
1:2...(n- 1)- q(y - 1)...(y - m) 

which is absolutely convergent for |x| < 1. The convergence for x = +1 depends on the value of 
ô = y — a — B. For x = 1 it is convergent if ô > 0, it is divergent if ô < 0. For x = —1 it is absolutely 
convergent if ô < 0, it is conditionally convergent for —1 < ὁ < 0, and it is divergent for ὁ < —1. 
c) For 2 — y Z 0 or not equal to a negative integer it has a particular solution 


y =r IF(at+1—7,68+1-7,2- για). 9.63c 
d) In some special cases the hypergeometric series can be reduced to elementary functions, e.g., 


F(1,8,6;2) = F(o;1,0;2) = , (9.64a) | F(—n,B,B;—x) = (14-α)", 9.64b 
=e 
ον. πα +2) ; e d 3: 2 . arcsinz } 
F(1,1,2;—2) =, (494) Ε(πις,.μο)-----, 9.64d 


lim F (ns 1; z) =e". (9.64e 
Boo 
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5. Laguerre Differential Equation 


Restricting the investigation to integer parameters (n = 0,1,2,...) and real variables, the Laguerre 


differential equation has the form 
zy" +(a+1—2)y' 4 ny — 0. 
Particular solutions are the Laguerre polynomials 
ep d" (ea ae) _ y πα (-α) l 
n! dx” ει kj k! 


The recursion formula for n > 1 is: 


(n+ 1)LE) (2) = (~r + 2n +a + 1) LG (2) — (n + o) EE (2), 


L(x) = 


n 


LO (x) =1, Lo Ξ]--α--. 
An orthogonality relation for a > —1 holds: 

oo 0 form zn, 
etre a) (x )L (α) (a) dz = i +a 


m 


Jra +a) form=n. 


0 n 


I' denotes the gamma function (see 8.2.5, 6., p. 514). 
6. Hermite Differential Equation 
Two defining equations are often used in the literature: 
a) Defining Equation of Type 1: 

y” — xy +ny=0 (n=0,1,2,...). 
b) Defining Equation of Type 2: 

y' — 2zy +ny=0 (n=0,1,2,...). 


(9.65a 


(9.65b 


(9.65c 


(9.65d 


(9.65e 


(9.664) 


(9.665) 


Particular solutions are the Hermite polynomials, He, (x) for the defining equation of type 1, and H, (x) 


for the defining equation of type 2. 
a) Hermite Polynomials for Defining Equation of Type 1: 


- 4?N d^ x 
He,(x) = (—1)" exp ( 7 ) d P ( 7 ) 
me ι΄. n an-2 i n QJR-4A 1. Í n ,n—6 Ls 
=f (2: +1 (le 1-3 ο + (n € IN). 


For n > 1 the following recursion formulas are valid: 


He,j,1(x) = «He,(x) — nHe, (1). (9.664) Heg(r) 21, Ηει(α) = m. 


The orthogonality relation is: 


+00 "E form zn, 
23 TA Hem ; Hen a) dx = - 
" e ( τ) , i nlv2m form — n. 


b) Hermite Polynomials for Defining Equation of Type 2: 


Βπία) = (-1)* exp (οἱ ; E z exp (~: 2?) (n € N). 


The relation with the Hermite polynomials for defining equation of type 1 is the following: 


Henle) = 27"? Hy, (5) (n € N). 


(9.66c 


(9.66e 


(9.66f 


(9.66g 


(9.66h 
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9.1.3 Boundary Value Problems 
9.1.3.1 Problem Formulation 
1. Notion of the Boundary Value Problem 


In different applications, e.g., in mathematical physics, differential equations must be solved as so- 
called boundary value problems (see 9.2.3, p. 589), where the required solution must satisfy previously 
given relations at the endpoints of an interval of the independent variable. A special case is the linear 
boundary value problem, where a solution of a linear differential equation should satisfy linear boundary 
value conditions. In the following section the discussion is restricted to second-order linear differential 
equations with linear boundary values. 

2. Self-Adjoint Differential Equation 

Self-adjoint differential equations are important special second-order differential equations of the form 


[py] — ay + λου = f. (9.67 
The linear boundary values are the homogeneous conditions 
Agy(a) + Boy'(a) — 0, Aiy(b) + Biy'(b) = 0. (9.67b 


The functions p(x), p'(x), q(x), o(x), and f(a) are supposed to be continuous in the finite interva 
a € x € b. In the case of an infinite interval the results change considerably (see [9.5]). Furthermore, 
it is supposed that p(x) > po > 0, o(x) > ου > 0. The quantity A, a parameter of the differentia 
equation, is a constant. For f = 0, it is called the homogeneous boundary value problem associated to 
the inhomogeneous boundary value problem. 
Every second-order differential equation of the form 

Ay" + By' + Cy + ARy = F (9.67c 
can be reduced to the self-adjoint equation (9.67a) by multiplying it by p/A if in [a,b], A z 0, and 
performing the following substitutions 


R pC pR 
= — dz), =-—, = 67d 
p «φ(/ 3 η) q A ΟΞ (9.67 
To find a solution satisfying the inhomogeneous conditions 
Aoy(a) + Boy'(a) = Co, Ary(b) + Biy'(b) = Οι (9.67e 


one returns to the problem with homogeneous boundary conditions, but the right-hand side f(x 
changes and y = z + u is substituted where u is an arbitrary twice differentiable function satisfying 
the inhomogeneous boundary conditions and z is a new unknown function satisfying the corresponding 
homogeneous conditions. 

3. Sturm-Liouville Problem 

For a given value of the parameter A there are two cases: 

1. Either the inhomogeneous boundary value problem has a unique solution for arbitrary f(x), while 
the corresponding homogeneous problem has only the trivial, identically zero solution, or, 

2. The corresponding homogeneous problem also has non-trivial, i.e., not identically zero solutions, 
but in this case the inhomogeneous problem does not have a solution for arbitrary right-hand side; and 
if a solution exists, it is not unique. 


The values of the parameter A, for which the second case occurs, i.e., the homogeneous problem has a 

non-trivial solution, are called the eigenvalues of the boundary value problem, the corresponding non- 

trivial solutions are called the eigenfunctions. The problem of determining the eigenvalues and eigen- 

functions of a differential equation (9.67a) is called the Sturm-Liouville problem. 

9.1.3.2 Fundamental Properties of Eigenfunctions and Eigenvalues 

1. The eigenvalues of a boundary value problem form a monotone increasing sequence of real numbers 
No < Ap < À2€ XA Xr, (9.68a) 

tending to infinity. 
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2. The eigenfunction associated to the eigenvalue A; has exactly n roots in the interval a < x < b. 
3. If y(x) and z(z) are two eigenfunctions belonging to the same eigenvalue A, they differ only in a 
constant multiplier c, i.e., 

2(x) = cy(x). (9.68b) 
4. Two eigenfunctions y; (x) and yo(x), associated to different eigenvalues λι and A», are orthogonal to 
each other with the weight function o(x) 


fu yo(x) o(x) dx = 0. (9.68c) 


5. If in (9.67a) the coefficients p(x) and q(x) are replaced by p(x) > p(x) and q(r) > g(x), then the 
eigenvalues will not decrease, i.e., Jai > An, where An and A, are the n-th eigenvalues of the modified 
and the original equations respectively. But if the coefficient g(x) is replaced by O(a) > o(x), then 
the eigenvalues will not increase, i.e., Xn € àn. The n-th eigenvalue depends continuously on the 
coefficients of the equation, i.e., small changes in the coefficients will result in small variations of the 
n-th eigenvalue. 

6. Reduction of the interval [a, b] into a smaller one does not result in smaller eigenvalues. 


9.1.3.3 Expansion in Eigenfunctions 


1. Normalization of the Eigenfunction 
For every A, an eigenfunction o, (x) is chosen such that 


f ρω) οί) dz = 1. (9.69a) 


It is called a normalized eigenfunction. 
2. Fourier Expansion 
To every function g(x) defined in the interval [a, b], one can assign its Fourier series 
ss b 
g(x) ~ ΝΣ Enna) ον EO n(x) o(x) dx (9.69b) 
n=0 a 

with the eigenfunctions of the corresponding boundary value problem, if the integrals in (9.69b) exist. 
3. Expansion Theorem 
If the function g(x) has a continuous derivative and satisfies the boundary conditions of the given prob- 
lem, then the Fourier series of g(x) (in the eigenfunctions of this boundary value problem) is absolutely 
and uniformly convergent to g(x). 
4. Parseval Equation 
If the integral on the left-hand side exists, then 


b oo 

[eco ota) dx = Me (9.69c) 
a n=0 

is always valid. The Fourier series of the function g(x) converges in this case to g(x) in mean, that is 

b N 2 

lim ate) — X` envn(x)] οα) da = 0. (9.69d) 
οκ, a n=0 

9.1.3.4 Singular Cases 


Boundary value problems of the above type occur very often in solving problems of theoretical physics 
by the Fourier method, however at the endpoints of the interval [a, b] some singularities of the differen- 
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tial equation may occur, e.g., p(x) vanishes. At such singular points some restrictions are imposed on 
the solutions, e.g., continuity or being finite or unlimited growth with a bounded order. These condi- 
tions play the role of homogeneous boundary conditions (see 9.2.3.3, p. 591). In addition, often occurs 
the case where in certain boundary value problems homogeneous boundary conditions should be con- 
sidered, such that they connect the values of the function or its derivative at different endpoints of the 
interval. Often occur the relations 

y(a) = y(b),  p(a)u'(a) = (όν (0), (9.70) 
which represent periodicity in the case of p(a) = p(b). For such boundary value problems everything 
being introduced above remains valid, except statement (9.68b). For further discussion of this topic 


see [9.5]. 


9.2 Partial Differential Equations 


9.2.1 First-Order Partial Differential Equations 
9.2.1.1 Linear First-Order Partial Differential Equations 


1. Linear and Quasilinear Partial Differential Equations 
The equation 


X E bt yee =¥ (9.71a) 
is called a linear first-order partial differential equation. Here z is an unknown function of the inde- 
pendent variables z4,...,:,, and X4,..., Xn, Y are given functions of these variables. If functions 
Xj,..., Xn, Y depend also on z, the equation is called a quasilinear partial differential equation. In the 
case of 

Y a0, (9.71b) 
the equation is called homogeneous. 

2. Solution of a Homogeneous Partial Linear Differential Equation 
The solution of a homogeneous partial linear differential equation and the solution of the so-called 


characteristic system 

dx, dx az, 

XX, X, 
are equivalent. This system can be solved in two different ways: 
1. Any £p, for which X, 4 0, can be chosen as an independent variable, so the system is transformed 
into the form 


(9.72a) 


dx; Xj 
——— (41Ξ 1... ντ). 9.72b 
dr, Xz G n) ( 
2. A more convenient way is to keep symmetry and to introduce a new variable ¢ getting 
dx; . 
de XS PS δι μα). (9.72c 


Every first integral of the system (9.72a) is a solution of the homogeneous linear partial differentia 
equation (9.72a,b), and conversely, every solution of (9.72a,b) is a first integral of (9.72a) (see 9.1.2.1, 
2., p. 551). If the n — 1 first integrals 


Qi(z1,..., 2.) =0 (6—1,2,...,n— 1) (9.72d 
are independent (see 9.1.2.3, 2., p. 553), then the general solution is 
z = (pi... na). (9.72e 


Here Φ is an arbitrary function of the n — 1 arguments y; and a general solution of the homogeneous 
linear differential equation. 
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3. Solution of Inhomogeneous Linear and Quasilinear 
Partial Differential Equations 


To solve an inhomogeneous linear and quasilinear partial differential equation (9.71a) one can try to 


find the solution z in the implicit form V(zi,...,c,,z2) = C. The function V is a solution of the 
homogeneous linear differential equation with n 4- 1 independent variables 
OV OV OV OV 
X TX fees +X, ΕΥ =0, 9.73a 
Oa 1 κ Ότο ' "Or,  ϑὲ ] ( ) 

whose characteristic system 

dx, — dx dz, dz 

ism 9.73h 
Χι X, X. Y PM 


is called the characteristic system of the original equation (9.71a). 


4. Geometrical Representation and Characteristics of the System 
In the case of the equation 
Oz Oz 
P(z,y, z) z— + Q(z, y, z) 
(zy, 2) τε -Ω(α,ν, By 
with two independent variables x; = Ὁ and 2» = y, a solution z = f(x,y) is a surface in x, y, z space, 
and it is called the integral surface of the differential equation. Equation (9.74a) means that at every 


= R(x, y,z) (9.74a) 


Oz Oz 

point of the integral surface z = f(x,y) the normal vector (= TL 3 is orthogonal to the vector 
x’ Oy 
(P,Q, R) given at that point. Here the system (9.73b) has the form 


dx u dy - dz 

P(r,y.z) Q(nwz)  R(a,y,2) 
It follows (see 13.1.3.5, p. 708) that the integral curves of this system, the so-called characteristics, are 
tangent to the vector (P, Q, R). Therefore, a characteristic having a common point with the integral 
surface z = f(x,y) lies completely on this surface. Since the conditions for the existence theorem 
13.1.3.5, 1., p. 551 hold, there is an integral curve of the characteristic system passing through every 
point of space, so the integral surface consists of characteristics. 
5. Cauchy Problem 


(9.74b) 


There are given n functions of n — 1 independent variables t4, t5, ..., t4 4: 

T1 = Tihti, t2, ... tn 1), 22 ΠΟ iaa). (9.75a) 
The Cauchy problem for the differential equation (9.71a) is to find a solution 

z — (zi, οι... 4) (9.75b) 
such that if one substitutes (9.75a), the result is a previously given function v(t4, t5, ..., t4 1): 

φ[αι (ti te, ses stni), v(t, to, EE stazi); ERE] vy (t4, 15, κ) t, 1)] = wth, to, ase iini). (9.75c) 


Inthe case of two independent variables, the problem reduces to find an integral surface passing through 
the given curve. Ifthis curve hasa tangent depending continuously on a point and it is not tangent to the 
characteristics at any point, then the Cauchy problem has a unique solution in a certain neighborhood 
of this curve. Here the integral surface consists of the set of all characteristics intersecting the given 
curve. For more mathematical discussion on theorems about the existence of the solution of the Cauchy 
problem see [9.15]. 


W A: For the linear first-order inhomogeneous partial differential equation (mz — ny) >, + (πα -- 
ba 
Oz : ET da: dy 
lz) — = ly — mz (l,m, n are constants), the equations of the characteristics are = = 
Oy mz—my na—lz 
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dz . . : ‘ 9 . 
. The integrals of this system are lz + my + nz Οι, a? + y? + 2? = Cy. One gets circles 


ly — max 
as characteristics, whose centers are on a line passing through the origin, and this line has direction 
cosines proportional to l,m, n. The integral surfaces are rotation surfaces with this line as an axis. 


; , : : : ς OZ 
W B: Determine the integral surface of the first-order linear inhomogeneous differential equation Jz + 
z 


ðz dx 
ο z, which passes through the curve 2; = 0, z = y(y). The equations of characteristics are I~ 
y 
dy dz -— : : Pare 
Πο The characteristics passing through the point (xo, yo, zo) are y = £ — zo + yo, Z = we". 
z 


A parametric representation of the required integral surface is y = «+ yo, z = e”p(yo), if we substitute 
20 = 0, zo = p(yo). The elimination of yo results in z = e"y(y — x). 


9.2.1.2 Non-Linear First-Order Partial Differential Equations 
1. General Form of First-Order Partial Differential Equation 


is the implicit equation 


Oz Oz 
κ... pac =) = 0: (9.76a) 


1. Complete Integral is the solution 
αξρίσι,... ὄμται,:-. ἀπ), (9.76b) 
depending on n parameters @),...,@, if at the considered values of 21,..., &n, 2 the functional deter- 
minant (or Jacobian determinant, see 2.18.2.6, 3., p. 123) is non-zero: 
O (ρου). Pen) 
O(ai,..., an) 


2. Characteristic Strip The solution of (9.76a) is reduced to the solution of the characteristic 
system 


#0. (9.76c) 


dx, - dEn dz —dpı - - dpa (9.764) 
P Pa pPrt = τη XrtpZ Xn pA 
with 
Z > X; £, pi E P, m (i=1,...,n). (9.166) 
The solutions of the characteristic system satisfying the additional condition 
F(zi...,25,Z, pu... p.) = 0 (9.76f) 


are called the characteristic strips. 
2. Canonical Systems of Differential Equations 
Sometimes it is more convenient to consider an equation not involving explicitly the unknown function 
z. Such an equation can be obtained by introducing an additional independent variable x,,; = 5 
and an unknown function V (x1, ..., 2n, $541), which defines the function z(x1, £2, ..., 2n) with the 
equation 

V (235552488, £) SC. (9.77a) 


oV Oz 
At the same time, one substitutes the functions —— i — l...m) for 5 in (9.764). 


zi / Ora xi 


Then one solves the differential equation (9.76a) for an arbitrary partial derivative of the function V. 
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The corresponding independent variable will be denoted by x after a suitable renumbering of the other 
variables. Finally, one gets the equation (9.76a) in the form 


oV oV T 
--» Pi i 
Ox’ Ps Ox; 

The system of characteristic differential equations is transformed into the system 


dr; OH dp; OH 


p+ H(z1,..., £n, £, P1,- -, Pn) =0, p 1,..εγ). (9.77b) 


dr Op; dx Ox; (i=1,...,n) and (9.77c) 
dV OH OH dp OH 

ή H, ττά 
dx Pap, 5 ODn dx Ox (9.77) 


Equations (9.77c) represent a system of 2n ordinary differential equations, which corresponds to an 
arbitrary function H (z1,..., 24,2, pi, ..., Pn) with 2n + 1 variables. It is called a canonical system or 
a normal system of differential equations. 

Many problems of mechanics and theoretical physics lead to equations of this form. Knowing a complete 
integral 


V = p(21,..-,2n,2,1,---,An) +a (9.77e) 

of the equation (9.77b) one can find the general solution of the canonical system (9.77c), since the 
ὃ ὃς 

equations a bi, a pi (i = 1,2,...,n) with 2n arbitrary parameters a; and b; determine a 


Oa; Ox; 
2n-parameter solution of the canonical system (9.77c). 

3. Clairaut Differential Equation 

If the given differential equation can be transformed into the form 


— (i=1,...,n), 9.78a 
Fe Ga hem), (9.78a) 
it is called a Clairaut differential equation. The determination of the complete integral is particularly 
simple, because a complete integral with the arbitrary parameters a4, a2, . . . , @n is 


Z = mpi + Lop. + `+: + LnPn + f (P1, P2,- --; Pn), pi = 


z = 421 + da23 +++» + as, + f (01,02, . . απ). (9.78b) 
E Two-Body Problem with Hamilton Function: Consider two particles moving in a plane under 
their mutual gravitational attraction according to the Newton field (see also 13.4.3.2, p. 728). Choosing 
the origin as the initial position of one of the particles, the equations of motion have the form 


Px ὃν d'y ὃν. k? 


' dë d ; i 9.79a 
dt? Ox’ d? Oy’ Very ( B 
Introducing the Hamiltonian function 
Tos , [2 
κα 9.79} 
αν τα} JAT (9.79 


the system (9.79a) is transformed into the normal system (into the system of canonical differentia 
equations) 


dx OH dy OH dp OH dq OH 


dt Op’ dt ðq’? dt Ox’ dt dy (3:79 
with variables 
dx d 
2,y,P=—, q=2. (9.79d 
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Now, the partial differential equation has the form 


Oz 1 Oz ? Oz 2 k2 
δε 2 (3) | (5) πρεπε (0.709) 


Introducing the polar coordinates p, p in (9.79e) one obtains a new differential equation having the 
solution 


2k? b 
z=-at—bpt+ec f 2a +4 zdr (9.79f 
po y- m 
with the parameters a, b, c. The general solution of the system (9.79c) follows from the equations 
Oz Oz 
— = to = = e. j 
3a ^ 3b Po (9.79g 


4. First-Order Differential Equation in Two Independent Variables 
Forz, = zr, πο = y, pı = p, pa = q the characteristic strip (see 9.2.1.2, 1., p. 573) can be geometrically 
interpreted as a curve at every point (x, y, ο) of which a plane p(£ — x) -- q(rj — y) = ¢ — z being tangen 
to the curve is prescribed. So, the problem of finding an integral surface of the equation 


r(e, oo 5) E (9.80 


passing through a given curve, i.e., to solve the Cauchy problem (see 9.2.1.1, 5., p. 572), is transformed 
into another problem: To find the characteristic strips passing through the points of the initial curve 
such that the corresponding tangent plane to each strip is tangent to that curve. One gets the values 
p and q at the points of the initial curve from the equations F(x,y, z, p,q) = 0 and pdx + qdy = dz. 
There can be several solutions in the case of non-linear differential equations. 


"Therefore, under the formulation of the Cauchy problem, in order to obtain a unique solution one can 
assume two continuous functions p and q satisfying the above relations along the initial curve. 
For the existence of solutions of the Cauchy problem see [9.15]. 
Β For the partial differential equation pq = 1 and the initial curve y = αὖ, z = 2x?, one can choose 
p = v and q = 1/z along the curve. The characteristic system has the form 

dr _ dy _ dz — 5 dp _ 0 

d T a P a PL {ἃ 
The characteristic strip with initial values zo, yo, zo, po and qo for t = 0 satisfies the equations x = 
Xo qot, y = Yo + pot, z = 2poqot + zo, p = Po, q = qo. For the case of po = xo, qo = 1/το the equation 
of the curve belonging to the characteristic strip that passes through the point (xo, yo, 20) of the initial 
curve is 


3 2 
2 = To + > Y =T Γίσο, z= 2t + 2r". 


Eliminating the parameters zo and t gives 27 = 4zy. For other chosen values of p and q along the initial 
curve one can get different solutions. 

Remark: The envelope of a one-parameter family of integral surfaces is also an integral surface. Con- 
sidering this fact one can solve the Cauchy problem with a complete integral. One finds a one-parameter 
family of solutions tangent to the planes given at the points of the initial curve. Then one determines 
the envelope of this family. 

W Determine the integral surface for the Clairaut differential equation z — pr — qy + pq = 0 passing 
through the curve y = x, z = z?. The complete integral of the differential equation is z = ax + by — ab. 
Since along the initial curve p = q = x, one determines the one-parameter family of integral surfaces 


1 : 
by the condition a = b. When the envelope of this family is found then one gets z = 4€ 4 y). 


576 9. Differential Equations 


5. Linear First-Order Partial Differential Equations in Total Differentials 
Equations of this kind have the form 

dz = fidzi + fada +--+ + frd&n, (9.81a) 
where fi, fo,..., fn are given functions of the variables x1, 22,..., Up, Z. The equation is called a com- 
pletely integrable or exact differential equation when there exists a unique relation between x1, 19, .. . , En, 
z with one arbitrary constant, which leads to equation (9.81a). Then there exists a unique solution 
z = 2(X1,22,...,%pn) of (9.81a), which has a given value zo for the initial values z1°,..., n? of the 
independent variables. Therefore, for n = 2, πι = x, 20 = y a unique integral surface passes through 
every point of space. 


-1 
The differential equation (0.814) is completely integrable if and only if the "555 equalities 
Of Of; Ofk εδ ,. 
+ fr + fi i,k=1,..., 9.811 
Or}, fr Oz Ox; f Oz (i | n) ( i 


in all variables £1, £2, . . . , &n, 2 are identically satisfied. 
If the differential equation is given in symmetric form 
fidzı +-+: + fado, = 0, (9.81c 
then the condition for complete integrability is 
Of, 3} Of;  Ofx Of; Of; 
i F f; + fi : Ξ0 9.814 
f (2 Oxy, fi Ox, — OX; fi Ox;  Omj ( 


for all possible combinations of the indices i, j, k. If the equation is completely integrable, then the 
solution of the differential equation (9.814) can be reduced to the solution of an ordinary differentia 
equation with n — 1 parameters. 


9.2.2 Linear Second-Order Partial Differential Equations 


9.2.2.1 Classification and Properties of Second-Order Differential 
Equations with Two Independent Variables 


1. General Form 
of a linear second-order partial differential equation with two independent variables x,y and an un- 
known function u is an equation in the form 
ὅτι Pu Ou Qu Qu 
Aa, + 2B— +C- +a +b Fcu- f, 9.824, 
Oa? | OxOy Oy? Or ὃν ` f ( ) 
where the coefficients A, B, C, a, b, c and f on the right-hand side are known functions of x and y. 
The form of the solution of this differential equation depends on the sign of the discriminant 
ó—-AC-B? (9.82b) 
in a considered domain. The following cases should be distinguished. 
1.6 « 0: Hyperbolic type. 
2. ó — 0: Parabolic type. 
3.6 > 0: Elliptic type. 
4. 6 changes its sign: Mixed type. 
An important property of the discriminant ô is that its sign is invariant with respect to arbitrary trans- 
formation of the independent variables, e.g., to introduction new coordinates in the x, y plane. There- 
fore, the type of the differential equation is invariant with respect to the choice of the independent 
variables. 
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2. Characteristics 

of linear second-order partial differential equations are the integral curves of the differential equation 
dy Bt V/-6 
da A i 
For the characteristics of the above three types of differential equations the following statements are 
valid: 

1. Hyperbolic type: There exist two families of real characteristics. 

2. Parabolic type: There exists only one family of real characteristics. 

3. Elliptic type: There exists no real characteristic. 

4. A differential equation obtained by coordinate transformation from (9.82a) has the same character- 
istics as (9.82a). 

5. If a family of characteristics coincides with a family of coordinate lines, then the term with the 
second derivative of the unknown function with respect to the corresponding independent variable is 
missing in (9.82a). In the case of a parabolic differential equation, the mixed derivative term is also 
missing. 


Ady” — 2Bdxdy + Cda? =0 or 


(9.83) 


3. Normal Form or Canonical Form 

One has the following possibilities to transform (9.82a) into the normal form of linear second-order 
partial differential equations. 

1. Transformation into Normal Form: The differential equation (9.82a) can be transformed into 
normal form by introducing the new independent variables 


E= (x,y) and n=Y(2,y), (9.84a 
which according to the sign of the discriminant (9.82b) belongs to one of the three considered types: 

a EL 

pa = T +---=0, <0, hyperbolic type; (9.84b 
Pu : : 
δη +---=0, 0-0, parabolic type; (9.84c 

Qu Pu m 

ag + δη +---=0, 020, elliptic type. (9.84d 


The terms not containing second-order partial derivatives of the unknown function are denoted by dots. 
2. Reduction of a Hyperbolic Type Equation to Canonical Form (9.84b): If, in the hyper- 
bolic case, one chooses two families of characteristics as the coordinate lines of the new coordinate 
system (9.84a), i.e., if substituting & = ία, y), m = v(r, y), where φ(α, y) = constant, y(x, y) = 
constant are the equations of the characteristics, then (9.82a) becomes the form 
ὅτι " 

060r 
This form is also called the canonical form of a hyperbolic type differential equation. From here one gets 
the canonical form (9.84b) by the substitution 

£—&tm n-&-m. (9.84f) 
3. Reduction of a Parabolic Type Equation to Canonical Form (9.84c): The only family of 
characteristics given in this case is selected for the family ἕξ = const, where an arbitrary function of x 
and y can be chosen for η, which must not be dependent on €. 
4. Reduction of an Elliptic Type Equation to Canonical Form (9.84d): If the coefficients 
A(x, y), B(x, y), C(x, y) are analytic functions (see 14.1.2.1, p. 732) in the elliptic case, then the char- 
acteristics define two complex conjugate families of curves y(x, y) = constant, w(x, y) = constant. By 
substituting € = y + v and η = i(y — v), the equation becomes of the form (9.84d). 


-=0. (9.84e) 
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4. Generalized Form 

Every statement for the classification and reduction to canonical forms remains valid for equations 
given in a more general form 

ὅτι Pu Ou Ou Ou 

—;5 + 2B(x, — +C(z, y) +F | x,y,u, 

Ox? (7,9) τὸν (x,y) Oy? d 
where F is a non-linear function of the unknown function u and its first-order partial derivatives Qu/Ox 
and ðu/ðy, in contrast to (9.824). 


A(x, y) =0, (9.85) 


9.2.2.2 Classification and Properties of Linear Second-Order Differential 
Equations with More than Two Independent Variables 


1. General Form 
A differential equation of this kind for u = u(x1, £2, . . - , &n) has the form 


Pu 
2 tik Fe D, +- =0, (9.86) 


where aj, are given functions of the independent variables and the dots in (9.86) mean terms not con- 
taining second-order derivatives of the unknown function. 

In general, the differential equation (9.86) cannot be reduced to asimple canonical form by transforming 
the independent variables. However, there is an important classification, similar to the one introduced 
above in 9.2.2.1, p. 576 (see [9.5]). 


2. Linear Second-Order Partial Differential Equations with Constant Coefficients 
If all coefficients a;; in (9.86) are constants, then the equation can be reduced by a linear homogeneous 
transformation of the independent variables into a simpler canonical form 


ὅτι 
Teg ντο (9.87) 


where the coefficients x; are +1 or 0. Several characteristic cases have to be distinguished. 

1. Elliptic Differential Equation If all coefficients «; are different from zero, and they have the same 
sign, then it is the case of an elliptic differential equation. 

2. Hyperbolic and Ultra-Hyperbolic Differential Equation If all coefficients κ; are different 
from zero, but one has a sign different from the other’s, then it is the case of a hyperbolic differential 
equation. If both types of signs occur at least twice, then it is an ultra-hyperbolic differential equation. 
3. Parabolic Differential Equation If one of the coefficients &; is equal to zero, the others are dif- 
ferent from zero and they have the same sign, then it is the case of a parabolic differential equation. 

4. Simple Case for Elliptic and Hyperbolic Differential Equations If not only the coefficients 
of the second order derivatives of the unknown function are constants, but also those of the first order 
derivatives, then it is possible to eliminate the terms of the first order derivatives, for which &; 4 0, by 
substitution. For this purpose is 


u = v exp (-ix za) ; (9.88) 


Kk 
: ; : Ou. -- 
substituted where b; is the coefficient of po (9.87) and the summation is performed for all x; 4 0. 
m 


In this way, every elliptic and hyperbolic differential equation with constant coefficients can be reduced 
to a simple form: 
2 


Ov 
δρ Av kv — g. (9.90) 


a) Elliptic Case: Av-- kv — g. (9.89) b) Hyperbolic Case: 8 
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Here A denotes the Laplace operator (see 13.2.6.5, p. 716). 


9.2.2.3 Integration Methods for Linear Second-Order Partial Differential 
Equations 


1. Method of Separation of Variables 

Certain solutions of several differential equations of physics can be determined by special substitutions, 
and although these are not general solutions, one gets a family of solutions depending on arbitrary 
parameters. Linear differential equations, especially those of second order, can often be solved if looking 
for a solution in the form of a product 


u(zi, ..., 24) = qai(zi)ga(2) . . qon (24). (9.91) 
Next, one tries to separate the functions qp, (cj), i.e., for each of them one wants to determine an or- 
dinary differential equation containing only one variable x}. This separation of variables is successful 
in many cases when the trial solution in the form of a product (9.91) is substituted into the given 
differential equation. In order to guarantee that the solution of the original equation satisfies the re- 
quired homogeneous boundary conditions, it may appear to be sufficient that some of functions qi (x1), 
(13), ... , Qu (v5) satisfy certain boundary conditions. 

By means of summation, differentiation and integration, new solutions can be acquired from the ob- 
tained ones; the parameters should be chosen so that the remaining boundary and initial conditions 
are satisfied (see examples). 

Finally, don't forget that the solutions obtained in this way, often infinite series and improper inte- 
grals, are only formal solutions. 'That is, one has to check whether the solution makes a physical sense, 
e.g., whether it is convergent, satisfies the original differential equation and the boundary conditions, 
whether it is differentiable termwise and whether the limit at the boundary exists. 

'The infinite series and improper integrals in the examples of this paragraph are convergent if the func- 
tions defining the boundary conditions satisfy the required conditions, e.g., the continuity assumption 
for the second derivatives in the first and the second examples. 


W A: Equation of the Vibrating String is a linear second-order partial differential equation o 
hyperbolic type 
Ou Pu 
esa c 9.92a 
Ot? Ox? ( 
It describes the vibration of a spanned string. The boundary and the initial conditions are: 
Ou 
u = f(x), =| = (2), uleso =0, uļz= = 0. (9.92b 
t=0 Ot [i-o 
Seeking a solution in the form 
u = X(v)T(t), (9.92c 
and after substituting it into the given equation (9.92a) follows 
qu xe 
— =L, (9.92d 
wT Xx 
The variables are separated, the right side depends on only x and the left side depends on only t, so 
each of them is a constant quantity. The constant must be negative, otherwise the boundary conditions 
cannot be satisfied, i.e., non-negative values give the trivial solution u(x,t) = 0. This negative constan 


is denoted by —A?. The result is an ordinary linear second-order differential equation with constant 
coefficients for both variables. For the general solution see 9.1.2.4, p. 555. The results are the linear 
differential equations 


X" +X =0, (9.929) and T"--a?X?T — 0. (9.928) 


From the boundary conditions follows X(0) = X(/) = 0. Hence X(x) is an eigenfunction of the 
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Sturm-Liouville boundary value problem and A? is the corresponding eigenvalue (see 9.1.3.1, 3., p. 569). 
Solving the differential equation (9.92e) for X with the corresponding boundary conditions one gets 

X(r)- Csinàr with sinM=0, ie,wih = > s. dass. (9.92g) 
Solving equation (9.92f) for T' yields a particular solution of the original differential equation (9.92a) 
for every eigenvalue A,: 


na 
Un(x, t) (o. cos 


τι + b, sin it) sin ται (9.921) 


Requiring that for t = 0, 


ul = M uj(r,0) is equal to f(x), and (9.92i) 
t=0 n=1 
i x δι, : ; 
or » = 2. = (x,0) is equal to y(x), (9.925) 
one gets with a Fourier series expansion in sines (see 7.4.1.1, 1., p. 474) 
2p ma 2 p nee 
ds = T / f(a) sin dx, by: = d (x) sin TT dr. (9.92k) 


W B: Equation of Longitudinal Vibration of a Bar is a linear second-order partial differential 
equation of hyperbolic type, which describes the longitudinal vibration of a bar with one end free and a 
constant force p affecting the fixed end. Here is to solve the same differential equation as in W A (p. 579), 
i.e., 


Ou Pu 
B "ns (9.93a 
with the same initial but different boundary conditions: 
Ou Ou 
[ —f(r, =| =pl), 9: a =0 (free end), 9.93 
Un f(x). BE s p(x),  (9.93b) δει, (free end), (9.93c 
Ou 
| =kp. 9.93d 
Ox |... p ( 
The conditions (9.93c,d) can be replaced by the homogeneous conditions 
Oz Oz 
= =| = 9.93e 
ὄσι-ο OX ( i 
where instead of u is introduced a new unknown function 
kpa? 
z=u- αι" (9.93f 
The differential equation becomes inhomogeneous: 
Oz 0? 2k 
E A (9.93g 


πα τα 7 
ot? Ox? 1 
Looking for the solution in the form z = v 4- w, where v satisfies the homogeneous differential equation 
with the initial and boundary conditions for z, i.e., 
kpa? Oz 
2 x | Fs vx), 
t=0 ) 2 Ot li—o (x) 


(9.93h) 
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and w satisfies the inhomogeneous differential equation with zero initial and boundary conditions. This 


k 2, t 

gives w = D . Substituting the product form of the unknown function v(x, t) into the differentia 
equation (9.93a) 

v = X(x)T(t) (9.93i 
gives the separated ordinary differential equations as in W A (p. 579) 

X" "pu I 

----π---λ᾽. 9.93] 

X^ T (9.93) 


Integrating the differential equation for X with the boundary conditions X'(0) = X'(I) = 0 one finds 
the eigenfunctions 


na 
Xn = cos En (9.93k 
and the corresponding eigenvalues 
96 
ο nr 
An? = "EC (n =0,1,2,...). (9.931 
Proceeding as in W A. (p. 579) one finally obtains 
ka?pt? | kpa? απ a annt bn annt nuc 
u H + ag + bot 4 Qn COS | sin cos " 9.93m 
2l op we ης 2 : η hn [ l ( 
where αν and b, (n = 0,1,2,...) are the coefficients of the Fourier series expansion in cosines of the 
kpa? 


functions f(a) — 


l 
and — (x) in the interval (0, [) (see 7.4.1.1, 1., p. 474). 
am 


W C: Equation of a Vibrating Round Membrane fixed along the boundary: 
'The differential equation is linear, partial and it is of hyperbolic type. It has the form in Cartesian and 
in polar coordinates (see 3.5.3.2,3., p. 211) 


Ou Pu 1 Pu Ou 1δι  10?u 1 u 

os πα τω 94a p= I : 94: 

δι; oy a OP’ ae) Op pOp PoP a? δ (3.94b 
The initial and boundary conditions are 

uli-o — f(p. v), (9.94c) = F(p, p), (9.944) u|p-g = 0. (9.94e 


dt io 
The substitution of the product form 

u = U(p)®(y)T(t) (9.94f 
with three variables into the differential equation in polar coordinates yields 

U" U' p" 1 Tq : 

| $= = 5 =-)’. (9.94g 

U pU P8 dT 
Three ordinary differential equations are obtained for the separated variables analogously to examples 
A (p. 579) and B (p. 580): 


2ΤΤΗ y 
PU toU Lic. aub (994i) 


T" a? XT — 0, (9.94h) 
U Φ 


Q" +P -- 0. (9.941) 
From the conditions Φ(0) = &(27), Φ'(0) = &'(27) it follows that: 
Φ(ᾳ) =a, cosnp + b, sinng, w=n? (n=0,1,2,...). (9.94k) 
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2 
U and A will be determined from the equations [pU"]' — 2s —\*pU and U(R) = 0. Considering 
p 


the obvious condition of boundedness of U (p) at p — 0 and substituting Ap — z gives 


2U" +20 + (2 nh) —0, ie, Ul) = Jal) = Ja (ως) (9.941) 
where J, are the Bessel functions (see 9.1.2.6, 2., p. 562) with \ = E and J,(u) = 0. The system of 
functions 

Όμι(ϱ) = Jn (ως) (k=1,2,...) (9.94m) 


with Hng as the k-th positive root of the function J,,(z) is a complete system of eigenfunctions of the 
self-adjoint Sturm-Liouville problem which are orthogonal with the weight function p. 
The solution of the problem can have the form of a double series: 


nkt 
U = τ = k ank COS NY + b, sin ny) cos ——— Glink 
n=0k=1 R 
b 
(ens cos n + dnk sin ng) sin -- | In (ως) li (9.94n 
From the initial conditions at t — 0 one obtains 
oo oo 
f(p,v) = b» aus cos nq + b, sinny) Jn (n) , (9.940 
n=0 k=1 
SES Ollnk . p 
F(p,g)- 3; (Cae cos nip + dne sin ng)Js [pk ]) . (9.94p 
— P R 
where 
2 2" p l 
US RERUMS (uai) ) cosngJs | Hnr ) Pdp, 9.94 
MET REJ? (Hink) / ae f" fp. p) cosmos (nies) pdp (9.94q 
i J d 9.94 
nk = στα πω, * A "Flo, y)sinnyJn (ως) o p. (9.94r 


In the case of n = 0, the numerator 2 should be changed to 1. To determine the coefficients cng and dnk 

the function f(p, ϱ) is replaced by F(p, p) in the formulas for anx and bng and finely it is multiplied by 
R 

mm 

B D: Dirichlet Problem (see 13.5.1, p. 729) for the rectangle 0 € x < a, 0 € y € b (Fig. 9.17): 


y Find a function u(x, y) satisfying the elliptic type Laplace differen- 
tial equation 
b Au =0 (9.95a 


and the boundary conditions 
u(0,y) = gily), ula, y) = ely), 


0 ax u(x,0) = φι(α). u(x, b) = p(x). (9.95b 
First there is to determine a particular solution for the boundary 
Figure 9.17 conditions qi(y) = v»(y) = 0. Substituting the product form 


u = X(x)Y (y) (9.95c 
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into (9.95a) gives the separated differential equations 


XU ye " 
vu ee E 954 
X Y (9.95d) 
with the eigenvalue A analogously to examples A (p. 579) through C (p. 581). Since X(0) = X(a) = 0, 
X=Csinàr, AUS =A, (n=1,2...). (9.956) 
a 


In the second step the general solution of the differential equation is obtained: 


Y =0 (9.95f) inthe form Y =a, sinh το — y) +b, sinh d (9.95g) 


From these equations one gets a particular solution of (9.95a) satisfying the boundary conditions u(0, y) — 
u(a, y) — 0, which has the form 


Un = la. sinh Bn — y) + b, sinh my sin Zr. (9.95h 
a a a 


In the third step one considers the general solution as a series 


e ds (9.95i 


n=1 
so from the boundary conditions for y = 0 and y = b 


oo š 
w= y (an sinh Z (b — y) + b, sinh y) sin Zr (9.951 
a a a 


n=1 


follows with the coefficients 
2 a . nm 
dpt αι A ψι(α) sin P gdt, b, - Um 
asinh — asinh — 
a a 
The problem with the boundary conditions (x) = v»(x) = 0 can be solved in a similar manner, and 
taking the series (9.95j) one gets the general solution of (9.95a) and (9.95b). 


2 


Di W(x) sin PT ede. (9.95k 
0 a 


W E: Heat Conduction Equation Heat conduction in a homogeneous bar with one end at infin- 
ity and the other end kept at a constant temperature is described by the linear second-order partia 
differential equation of parabolic type 


Ou Pu 

a4 σον (9.96a 
which satisfies the initial and boundary conditions 

uko = f(x),  u|,-o = 0 (9.96b 


in the domain 0 € x < +00, t > 0. It is also to be supposed that the temperature tends to zero a 
infinity. Substituting 


u = X(x)T(t) (9.96c 
into (9.96a) one obtains the ordinary differential equations 
T' x" : 
=^ =), .96d 
T X i κ 


whose parameter A is introduced analogously to the previous examples A (p. 579) through D (p. 582). 
One gets 


T equae (9.960) 
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as a solution for T(t). Using the boundary condition X (0) = 0, gives 


X(x) = Csin Ax (9.96f) andso uy Ce sin Az, (9.96g 
where λ is an arbitrary real number. The solution can be obtained in the form 
ulz, t) = f 7 Ce sin λα dA. (9.96h 
0 
From the initial condition αμ. ο = f(x) follows (9.961) with (9.961) for the constant (see 7.4.1.1, 1., 
Κα) = F C) sin λα dÀ , (996) | CQ)- e f(s)sinAsds. (9.96j 
p. 474). g 
Combining (9.96j) and (9.96h) gives 
2 [e go 2,2 
u(a,t) = — [ f(s) (f e^? tt gin Assin Ax dX) ds (9.96k 
π Jo Jo 


or after replacing the product of the two sines with one half of the difference of two cosines ((2.122), 
p. 83) and using formula (21.27), in Table 21.8.2, p. 1100, it follows that 


eo r—s) x4 s)? 
u(z,t) = [ fo e ( e) esp ( m λα. (9.961) 


2. Riemann Method for Solving Cauchy’s Problem for the Hyperbolic 
Differential Equation 
ὅτι Ou ,2u 


» toe FI δη tou=F (9.97a) 


1. Riemann Function is a function v(x, y; €, n), where € and η are considered as parameters, sat- 
isfying the homogeneous equation 


Ov OA(av) (bw) 
Oxdy Ox Oy 
which is the adjoint of (9.97a) and the conditions 


v(x, ης ἕξ, η) = exp Ü b(s,7) « , υ(ξ,ψ;ξ, m) = exp (/«« s) a) : (9.97c) 
£ 


η 


+cv=0 (9.97b) 


In general, linear second-order differential equations and their adjoint differential equations have the 
form 


O? (αυ) O(b;v) 


Yos; DE Xs + cu = f (9.97d) and Σ διε, 2 δα, ου — 0. (9.97e) 


2. Riemann Formula is the integral formula which is used to 
determine function u(£, η) satisfying the given differential equation 
(9.97a) and taking the previously given values along the previously 
given curve I’ (Fig. 9.18) together with its derivative in the direc- 
tion of the curve normal (see 3.6.1.2, 2., p. 244): 


1 1 1 / Qu ðv 
ul, η) = σ(αυ)ρ } gra / [w tg (vas e) dx 
Figure 9.18 SP 
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7 I 2 (s "5. dy ἡ " Fv dz dy. (9.971) 


PMQ 


The smooth curve I’ (Fig. 9.18) must not have tangents parallel to the coordinate axes, i.e., the curve 
must not be tangent to the characteristics. The line integral in this formula can be calculated, since the 
values of both partial derivatives can be determined from the function values and from its derivatives 
in a non-tangential direction along the curve arc. 

; — , Ou 
In the Cauchy problem, the values of the partial derivatives of the unknown function, e.g., δη are often 

y 

given instead of the normal derivative along the curve. Then another form of the Riemann formula is 
used: 


u(£, m) = (uv)p / ο 79 dx (ων | a) dy + "i Ευ dx dy. (9.97g) 
A PMQ 


QP 


BW Telegraph Equation (Telegrapher’s Equation) is a linear second-order partial differential equation 
of hyperbolic type 


a? ὃ Or 
am + 2b— + cu = = (9.98a) 


where a > 0, b, and c are constants. The equation describes the current flow in wires. It is a general- 
ization of the differential equation of a vibrating e 
Replacing the unknown function u(x,t) by u = z exp(— (b/a)t), so (9.98a) is reduced to the form 


932 poz ο. 1 ο U-ac 

og" Bp 4n (m x =e : (9.98b 
POMA the independent variables by 

E= Zim +2), n= L (mt — x) (9.98c 

m 
finally one je the canonical form 
Oz g 
mr) 9.98d 
ὄέδη 4 ( 


of a hyperbolic type linear partial differential equation (see 9.2.2.1, 1., p. 577). The Riemann function 
υ(ξ, η; £o, Mo) should satisfy this equation with unit value at € = £j and η = no. Choosing the form 


w = (£— &)(n = m) (9.980) 
for win v = f(w), then f(w) is a solution of the differential equation 
df df 1 


πα ae a - 0 (9.98f) 
with initial condition f(0) = 1. The substitution w = a? reduces this differential equation to Bessel’s 
differential equation of order zero (see 9.1.2.6, 2., p. 562) 

P ld 

EILIS aug (9.98g) 


da? ada 


hence the solution is 


v= Ig | (E — ξο)(η — n) ; (9.98h) 
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A solution of the original differential equation (9.98a) satisfying the boundary conditions 

Oz 

z = f(x) = 

t=0 ) Ot 

can be obtained substituting the found value of v into the Riemann formula and then returning to the 
original variables: 


z(a t= We mt) + f(x + mt)] 


yom lo e mt? — (s — 2) 
G m 
+5 g(s) 


= g(x) (9.981) 


t=0 


(9.98j) 


m 


3. Green’s Method of Solving the Boundary Value Problem for Elliptic 
Differential Equations with Two Independent Variables 
This method is very similar to the Riemann method of solving the Cauchy problem for hyperbolic dif- 
ferential equations. 
If one wants to find a function u(x, y) satisfying the elliptic type of linear second-order partial differen- 
tial equation 
Fu Pu ðu ðu 
-+5 +a +b cu = 9.99a 
δα Oy) ^ Or ὃν | f ( ) 
in a given domain and taking the prescribed values on its boundary, first the Green function G(x, y, €, η) 
has to be determined for this domain, where € and r are regarded as parameters. The Green function 
must satisfy the following conditions: 


1. The function G(x, y; €, η) satisfies the homogeneous adjoint differential equation 
PG | OG O(aG) O(bG) 
Ox? ^ Oy? Ox Oy 
everywhere in the given domain except at the point x = €, y = m. 
2. The function G(x, y; £, n) has the form 


+cG=0 (9.99b) 


1 
Uln = +V (9.996) with Τ--γ(α-ἑ)}η-(υ-η), (9.994) 


where U has unit value at the point x = €, y = η and U and V are continuous functions in the entire 
domain together with their second derivatives. 

3. The function G(x, y; €, n) is equal to zero on the boundary of the given domain. 

The second step is to give the solution of the boundary value problem with the Green function by the 
formula 


£, ARG (a, y €, m) d z flo, y)G(a,y; €, η) da dy, (9.090) 


where D is the considered domain, S is its boundary on which the function is assumed to be known and 


denotes the normal derivative directed toward the interior of D. 


On 
Condition 3 depends on the formulation of the problem. For instance, if instead of the function values 
the values of the derivative of the unknown function are given in the direction normal to the boundary 
of the domain, then in 3 the condition 


mc (a cos a + bcos 8)G = 0 (9.998 
n 
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holds on the boundary. α and f denote here the angles between the interior normal to the boundary of 
the domain and the coordinate axes. In this case, the solution is given by the formula 
1 [Ou 


1 
ulén) =-5- [ -Gds zS fG dz dy. (9.99g) 
S D 


4. Green's Method for the Solution of Boundary Value Problems with Three 
Independent Variables 
The solution of the differential equation 
ðu δι Qu 
Au+a +b Fc Feu 9.100a 
Or Oy Oz f ( 
should take the given values on the boundary of the considered domain. As the first step, one constructs 
again the Green function, but now it depends on three parameters €, 7, and ¢. The adjoint differentia. 
equation satisfied by the Green function has the form 
O(aG) AdA(bG)  O(cG) 
Ox Oy Oz 


As in condition 2, the function G(x, y, z; £, η, C) has the form 


AG +eG= (9.100b 


ut +V (9.100c) with r= γία € + (y n + (2 -- CY. (9.100d 


The solution of the pa is: 


u(&, η, Q) 3 [m an 4 s- « ff, fG dz dy dz. (9.100e 
p 


Both methods, Riemann's and Green's, have the common idea first to determine a special solution of the 
differential equation, which can then be used to obtain a solution with arbitrary boundary conditions. 
An essential difference between the Riemann and the Green function is that the first one depends only 
on the form of the left-hand side of the differential equation, while the second one depends also on the 
considered domain. Finding the Green function is, in practice, an extremely difficult problem, even if 
it is known to exist; therefore, Green's method is used mostly in theoretical research. 


W A: Construction of the Green function for the Dirichlet problem 
of the Laplace differential equation (see 13.5.1, p. 729) 

Au —0 (9.101a) 
for the case, when the considered domain is a circle (Fig. 9.19). 
The Green function is 


G(x, ψ;ξ. η) = E jd E E (9.101b) 


wherer — MP, p ou Ti M,P and R is the radius of the 
considered circle (Fig. 9.19). The points M and M, are symmet- 
ric with respect to the circle, i.e., both points are on the same ray 
starting from the center and 

OM ΟΛΠ = R°. (9.101c) 
The formula (0.99) for a solution of Dirichlet’s problem, after sub- 
stituting the normal derivative of the Green function and after cer- 
Figure 9.19 tain calculations, yields the so-called Poisson integral 


1 on R2 = p? 
2m Jo  R?-4 p? — 2Rpcos(v — p 


ul, n) = 11) dy. (9.101d) 
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The notation is the same as above. The known values of u are given on the boundary of the circle by 
u(y). For the coordinates of the point M (£, η) follows € = pcos v, η = psin v. 
W B: Construction of the Green function for the Dirichlet problem of the Laplace differential equation 
(see 13.5.1, p. 729) 

Au — 0, (9.102a 
for the case when the considered domain is a sphere with radius R. The Green function now has the 
form 


1 R 
G(x,y,z;£, m, Q) = -= — — (9.102b 

T np! 
with p = VE +n? + Q? as the distance of the point (£, η, C) from the center, r as the distance between 
the points (a, y, 2) and (€,7,¢), and rı as the distance of the point (x, y, z) from the symmetric poin 


RE a RG 


of (£, η, C) according to (9.101c), i.e., from the point (= =) . In this case, the Poisson integra: 
pp’ p 


has the form = the same notation as in Bl A (p. 587)) 
m- 
υ(ξ.η, C) dt m ds. (9.102c 
απ 


5. Operational re 

Operational methods can be used not only to solve ordinary differential equations but also for partia 
differential equations (see 15.1.6, p. 769). They are based on transition from the unknown function 
to its transform (see 15.1, p. 767). In this process the unknown function is regarded as a function ο 
only one variable and the transformation is performed with respect to this variable. The remaining 
variables are considered as parameters. The differential equation to determine the transform of the 
unknown function contains one less independent variable than the original equation. In particular, i 
the original equation is a partial differential equation of two independent variables, then one obtains 
an ordinary differential equation for the transform. If the transform of the unknown function can be 
found from the obtained equation, then the original function is obtained either from the formula for 
the inverse function or from the table of transforms. 


6. Approximation Methods 
In order to solve practical problems with partial differential equations, different approximation methods 
are used. They can be divided into analytical and numerical methods. 

1. Analytical Methods make possible the determination of approximate analytical expressions for 
the unknown function. 

2. Numerical Methods result in approximate values of the unknown function for certain values of 
the independent variables. Here the following methods (see 19.5, p. 976) are used: 

a) Finite Difference Method, or Lattice-Point Method: The derivatives are replaced by divided dif- 
ferences, so the differential equation including the initial and boundary conditions becomes an algebraic 
equation system. A linear differential equation with linear initial and boundary conditions becomes a 
linear equation system. 
b) Finite Element Method, or briefly FEM (see 19.5.3, p. 978), for boundary value problems: Here 
a variational problem is assigned to the boundary value problem. The approximation of the unknown 
function is performed by a spline approach, whose coefficients should be chosen to get the best possible 
solution. The domain of the boundary value problem is decomposed into regular sub-domains. The 
coefficients are determined by solving an extreme value problem. 

c) Integral Equation Method (along a Closed Curve) for special boundary problems: The bound- 
ary value problem is formulated as an equivalent integral equation problem along the boundary of the 
domain of the boundary value problem. To do this, one applies the theorems of vector analysis (see 
13.3.3, p. 724, and followings), e.g., Green formulas. The remaining integrals along the closed curve 
are to be determined numerically by a suitable quadrature formula. 
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3. Physical Solutions of differential equations can be given by experimental methods. This is based 
on the fact that various physical phenomena can be described by the same differential equation. To 
solve a given equation, first a model is constructed by which the given problem is simulated and the 
values of the unknown function are obtained directly from this model. Since such models are often 
known and can be constructed by varying the parameters in a wide range, the differential equation can 
also be applied in a wide domain of the variables. 


9.2.3 Some further Partial Differential Equations from Natural 
Sciences and Engineering 


9.2.3.1 Formulation of the Problem and the Boundary Conditions 


1. Problem Formulation 

'The modeling and the mathematical treatment of different physical phenomena in classical theoretical 
physics, especially in modeling media considered structureless or continuously changing, such as gases, 
fluids, solids, the fields of classical physics, leads to the introduction of partial differential equations. 
Examples are the wave (see 9.2.3.2, p. 590) and the heat equations (see 9.2.3.3, p. 591). Many problems 
in non-classical theoretical physics are also governed by partial differential equations. An important 
area is quantum mechanics, which is based on the recognition that media and fields are discontinu- 
ous. The most famous relation is the Schroedinger equation. Linear second-order partial differential 
equations occur most frequently and they have special importance in today's natural sciences. 

2. Initial and Boundary Conditions 

The solution of the problems of physics, engineering, and the natural sciences must usually fulfill two 
basic requirements: 

1. The solution must satisfy not only the differential equation, but also certain initial and/or boundary 
conditions. There are problems with only initial condition or only with boundary conditions or with 
both. All the conditions together must determine the unique solution of the differential equation. 

2. The solution must be stable with respect to small changes in the initial and boundary conditions, 
i.e., its change should be arbitrarily small if the perturbations of these conditions are small enough. 
Then a correct problem formulation is given. 
One can assume that the mathematical model of the given problem to describe the real situation is 
adequate only in cases when these conditions are fulfilled. 

For instance, the Cauchy problem (see 9.2.1.1, 5., p. 572) is correctly defined with a differential equation 
of hyperbolic type for investigating vibration processes in continuous media. This means that the values 
of the required function, and the values of its derivatives in a non-tangential (mostly in a normal) 
direction are given on an initial manifold, i.e., on a curve or on a surface. 

In the case of differential equations of elliptic type, which occur in investigations of steady state and 
equilibrium problems in continuous media, the formulation of the boundary value problem is correct. If 
the considered domain is unbounded, then the unknown function must satisfy certain given properties 
with unlimited increase of the independent variables. 


3. Inhomogeneous Conditions and Inhomogeneous Differential Equations 

The solution of homogeneous or inhomogeneous linear partial differential equations with inhomoge- 
neous initial or boundary conditions can be reduced to the solution of an equation which differs from 
the original one only by a free term not containing the unknown function, and which has homogeneous 
conditions. It is sufficient to replace the original function by its difference from an arbitrary twice dif- 
ferentiable function satisfying the given inhomogeneous conditions. 

In general, one uses the fact that the solution of a linear inhomogeneous partial differential equation 
with given inhomogeneous initial or boundary conditions is the sum of the solutions of the same dif- 
ferential equation with zero conditions and the solution of the corresponding homogeneous differential 
equation with the given conditions. 
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To reduce the solution of the linear inhomogeneous partial differential equation 


Ou 


δρ L[u] = g(x. t) (9.103a) 
with homogeneous initial conditions 
Ou 
[ =0, —- = 9.103k 
: t=0 "Ot i-o ( ) 


to the solution of the Cauchy problem for the corresponding homogeneous differential equation, one 
substitutes 


t 
u = ic t; 7) dr. (9.103c 
0 
Here y(x, t; τ) is the solution of the differential equation 
P L[u] =0 (9.103d 
— L[u] = 0, j 
Ot? 
which satisfies the boundary conditions 
Ou 
u =0, — e gd, T). 9.103e 
t=r Ot i ~ ( ) ( 
In this equation, x represents symbolically all the n variables £1, £2, . . . , x, of the n-dimensional prob- 
: : | i : i 2 Qu Ou | 
lem. L[u] denotes a linear differential expression, which may contain the derivative E but not higher- 


order derivatives with respect to t. 


9.2.3.2 Wave Equation 


The extension of oscillations in a homogeneous media is described by the wave equation 


ὅτι T 
ag ^ Au = Q(r,t), (9.104a) 
whose right-hand side Q(x, t) vanishes when there is no perturbation. The symbol x represents the n 
variables x1,... , &n of the n-dimensional problem. The Laplace operator A (see also 13.2.6.5, 716,) is 
defined in the following way: 
Ou Pu ὅτι 
Au dnp | Dare? pes (9.104b) 


B Or? 
The solution of the wave equation is the wave function u. The differential equation (9.104a) is of hy- 
perbolic type. 
1. Homogeneous Problem 
The solution of the homogeneous problem with Q(x, t) = 0 and with the initial conditions 

Ou 


ul Ξφία). = 
EU ^ ot 


t=0 


is given for the cases n = 1, 2,3 by the following integrals. 
Case n = 3 (Kirchhoff Formula): 


1 w(a1, a2, 03) ,9 φίαι, a2, 03) ; 
4ra? / t - Ot JI t do); (2.1058) 
(Sat) (Sat) 


where the integration is performed over the spherical surface Sa: given by the equation (αι — x1)? + 
229.4 4.2 9242 
(αο — x2)? + (a3 — v3) art’. 


u(x, σον 73,0) = 
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Case n = 2 nn Formula): 


u(z1, £9, t) al, I 
IN ve? -( (αι — 21)? — (ag — x3)? 


φίαι. 03) daydag 
| (9.106b) 
E7 7 ποσο» — 


ib (01, αὐ) daydag 


(ag — x2) 
where the integration is ic n along the circle Cat given by the equation (a — c 1)? + (a9 — πο)» < 
2,2 
üt. 
Case n — 1 (d'Alembert formula): 


; i cim 1 zı+at 
u(t) - pla ta ) H en a ) F / Yla) da. (9.106c) 
i1—at 


2. Inhomogeneous Problem 

In the case, when Q(a,t) # 0, one has to add to the right-hand sides of (9.106a,b,c) the correcting 
terms: 

Casen — 3 — Potential): Fora domain K given by r « at with 


r= γί ξι — 21)? + (ξο — x3)? + (3 — v3)?, the correction term is 


L3 ή πα iin =) dé dE ads . (9.107a) 


Q(&, £2, T) ἀξιαξοᾶτ 
© s , 9.107: 
ase n 2πα a j vet T)? — (& — 21)? — (& — 2)? ; ( ») 


where K is a domain of ξι, 2,7 space defined by the inequalities 0 € 7 < t, (£j — 1)? + (£g — 23)? < 


a?(t — r^. 
Casen=1: = al / Q(£, τ) d£dr, (9.1070) 


where T' is the Ws 0c T €t,|£ — σι] € a|t — τ]. a denotes the wave velocity of the perturbation. 


9.2.3.3. Heat Conduction and Diffusion Equation for Homogeneous Media 


1. Three-Dimensional Heat Conduction Equation 
The propagation of heat in a homogeneous medium is described by a linear second-order partial differ- 
ential equation of parabolic type 

Ou ; 
δι a? Au = Q(x, t), (9.1082) 
where A is the three-dimensional Laplace operator defined in three directions of propagation x1, 20, 
x3, determined by the position vector r. If the heat flow has neither source nor sink, the right-hand side 
vanishes since Q(x, t) = 0. 
The Cauchy problem can be posed in the following way: It is to determine a bounded solution u(x, t) for 


592 9. Differential Equations 


t > 0, where u[izo = f(x). The requirement of boundedness guarantees the uniqueness of the solution. 
For the homogeneous differential equation with Q(x, t) = 0, one gets the wave function 


-Foo +00 +00 
u(x1, x2, 23,0) = mE / a J Nonono) 
E a 2? 4 (a: NE 
οφ Γη m = (s αν) ) daidasdag. (9.108b) 
a?t 


In the case of an inhomogeneous differential equation with Q(x, t) # 0, one has to add to the right-hand 
side of (9.108b) the following expression: 


E = 


—00 --οο --οο 
(x1 — αι)” + (3; — a2)? + (zs — αἲ)” 
: exp 
4a?(t — τ) 
The problem of determining u(x,t) for t < 0, if the values u(x, 0) are given, cannot be solved in this 
way, since the Cauchy problem is not correctly formulated in this case. 
Since the temperature difference is proportional to the heat, one often introduces u = T (r,t) (temper- 
ature field) and a? = Dy (heat diffusion constant or thermal conductivity) to get 
OT 
ot 
2. Three-Dimensional Diffusion Equation 
In analogy to the heat equation, the propagation of a concentration C in a homogeneous medium is 
described by the same linear partial differential equation (9.108a) and (9.108d), where Dw is replaced 
by the three-dimensional diffusion coefficient Dc. The diffusion equation is: 
OC 
a Dc AC = Qc (F, t). (9.109) 
One gets the solutions by changing the symbols in the wave equations (9.108b) and (9.108c). 


9.2.3.4 Potential Equation 
The linear second-order partial differential equation 

Au = --4πο (9.110a) 
is called the potential equation or Poisson differential equation (see 13.5.2, p. 729), which makes the 
determination of the potential u(x) of a scalar field determined by a scalar point function o(x) possible, 
where x has the coordinates x1, £2, 73 and A is the Laplace operator. The solution, the potential 
Uy (11, 2, 13) at the point M, is discussed in 13.5.2, p. 729. 
One gets the Laplace differential equation (see 13.5.1, p. 729) for the homogeneous differential equation 
with o = 0: 

Au = 0. (9.110b) 
The differential equations (9.110a) and (9.110b) are of elliptic type. 


) ο dr. (9.108c) 


- DwAT = Qwif, 1). (9.108d) 


9.2.4 Schroedinger's Equation 
9.2.4.1 Notion of the Schroedinger Equation 


1. Determination and Dependencies 
The solutions of the Schroedinger equation, the wave functions v, describe the properties of a quantum 
mechanical system, i.e., the properties ofthe states of a particle. The Schroedinger equation is a second- 
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order partial differential equation with the second-order derivatives of the wave function with respect 
to the space coordinates and first-order with respect to the time coordinate: 


pt R ay + U (21, £2, 23, t) Y = É Y (9.111a) 

i im £1, £2, 23; = Alla 
;2 

b=" ogg t? 2jw (9.111b) 
m i or i 


: hk, F - i . A 

Here, A is the Laplace operator, A = — is the reduced Planck’s constant, i is the imaginary unit and 
π 

V is the nabla operator. The relation between the impulse p of a free particle with mass m and wave 
length \ is \ = h/p. 
2. Remarks: 
a) In quantum mechanics, an operator is assigned to every measurable quantity. The operator occur- 
ring in (9.111a) and (9.111b) is called the Hamilton operator H (“Hamiltonian”) . It has the same role 
as the Hamilton function of classical mechanical systems (see, e.g., the example on Two-Body Problem 
on p. 574). It represents the total energy of the system which is divided into kinetic and potential en- 
ergy. The first term in H is the operator for the kinetic energy, the second one for the potential energy. 
b) The imaginary unit appears explicitly in the Schroedinger equation. Consequently, the wave func- 
tions are complex functions. Both real functions occurring in v) + iq?) are needed to calculate the 
observable quantities. The square |W|? of the wave function, describing the probability dw of the par- 
ticle being in an arbitrary volume element dV of the observed domain, must satisfy special further 
conditions. 


c) Besides the potential of the interaction, every special solution depends also on the initial and bound- 
ary conditions of the given problem. In general, there is a linear second-order boundary value problem, 
whose solutions have physical meaning only for the eigenvalues. The squares of the absolute value of 
meaningful solutions are everywhere unique and regular, and tend to zero at infinity. 

d) The micro-particles also have wave and particle properties based on the wave-particle duality, so 
the Schroedinger equation is a wave equation (see 9.2.3.2, p. 590) for the De Broglie matter waves. 

e) The restriction to the non-relativistic case means that the velocity v of the particle is very small 
with respect to the velocity of light c (v < c). 


'The application of the Schroedinger equations is discussed in detail in the literature of theoretical 
physics (see, e.g., [9.15], (9.7].[9.10], [22.15]). In this chapter only some most important examples are 
demonstrated. 

9.2.4.2. Time-Dependent Schroedinger Equation 

The time-dependent Schroedinger equation (9.111a) describes the general non-relativistic case of a 
spin-less particle with mass m in a position-dependent and time-dependent potential field U (a; , £2, v3, t). 
The special conditions, which must be satisfied by the wave function, are: 

a) The function 7 must be bounded and continuous. 

b) The partial derivatives 0y)/0x1, Ov/Ox», and Ov/Oxa must be continuous. 

c) The function |x|? must be integrable, i.e., 


I lb (x4, £2, £3, t)? dV < oc. (9.112a) 
P 


According to the normalization condition, the probability that the particle is in the considered domain 
must be equal to one. (9.112a) is sufficient to guarantee the condition, since multiplying by an ap- 
propriate constant the value of the integral becomes one. 

A solution of the time-dependent Schroedinger equation has the form 


ψί(αι, x2, £3, t) = V (x1, £2, £3) exp (-1ξι) i (9.112b) 
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The state of the particle is described by a periodic function of time with angular frequency w = E/h. 1 
the energy of the particle has the fixed value E = const, then the probability dw of finding the particle 
in a space element dV is independent of time: 


dw = ||? dV = vw* dV. (9.112c 
Then one speaks about a stationary state of the particle. 
9.2.4.3 Time-Independent Schroedinger Equation 


If the potential U does not depend on time, i.e., U = U (z1, £2, %3), then it is the time-independen 
Schroedinger equation and the wave function V (x1, £2, x3) is sufficient to describe the state. Reducing 
it from the time-dependent Schroedinger equation (9.111a) with the solution (9.112b) gives 


2m 


AV + qi E — U)v — 0. (9.113a. 
i 
In this non-relativistic case, the energy of the particle is 
2 
p h 
=— (ος. h=2rħ). 1131 
Eni (p y^ vh) (9.113b 


The wave functions V satisfying this differential equation are the eigenfunctions; they exist only for 
certain energy values E, which are given for the considered problem of the special boundary conditions. 
The union of the eigenvalues forms the energy spectrum of the particle. If U is a potential of finite depth 
and it tends to zero at infinity, then the negative eigenvalues form a discrete spectrum. 


If the considered domain is the entire space, then it can be required as a boundary condition that V is 
quadratically integrable in the entire space in the Lebesgue sense (see 12.9.3.2, p. 696 and [8.5]). If the 
domain is finite, e.g., a sphere or a cylinder, then one can require, e.g., V = 0 for the boundary as the 
first boundary condition problem. 

This gives the Helmholtz differential equation in the special case of U (x) = 0: 


AV 4- AV =0 (9.1142) with the eigenvalue A= (9.114b) 
W = 0 is often required here as a boundary condition. (9.114a) represents the initial mathematical 
equation for acoustic oscillation in a finite domain. 


9.2.4.4 Statistical Interpretation of the Wave Function 


'The quantum mechanics postulates that the complete description of a regarded single-particle system 
in the time { is to be performed by the complex wave function w(r, t) as a state function and normalized 
solution of the Schroedinger equation. So, the wave function contains all possible experimental infor- 
mation, which can be got by measurements on this system. There exist no hidden sub-structures of the 
theory and no hidden parameters which could eliminate the principal statistical character of quantum 
mechanics, as it contains the connection of state function and t and measurement results. 


1. Observable and Probability Amplitude 


A physical expression (position, momentum, angular momentum, energy), which can be determined 
by a suitable measuring instrument, is called an observable. In quantum mechanics every observable A 


is represented by a linear, hermitian operator A with At = A, which interacts on the wave function. 
At the same time the operator of the quantum mechanics takes over the structure of the classical ex- 
pression. 


W For the operator 1 of the angular momentum, where r is the position operator p the momentum 
operator: 


"I SOs. so aa ae fh ὃ o 
1= (ls, ly l) =x P ie. ως Up; ih - t o a) (9.115a) 
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" h ὃ ὃ ^ h ὃ ὃ 
ly = Zp, — Ep, = z 3 , b=ap jp, =- |£ ‘ 1151 
v = 2ps — 2f = 7 ( Jz 5.) ; tPy — YP i e νο) (9.115b) 


In general, it is not possible to assign a certain numerical value to an observable by determining the 
wave function, but first only as the result of a measurement. The only possible measurement values A 


are the real eigenvalues a; of A; the associated eigenfunctions y; form a complete orthogonal system: 


Άφιξαιφι (i,k =1,2,...), / φῖφκ αγ = ὃ; κ. (9.116 
v 


If the system is in an arbitrarily general state 7), the result of a single experiment, i.e. the occurrence 
of a certain measure value a; in a single measurement can not be predicted. If imaging to perform the 
measurement on N — oo identical systems being in the same state v, then among the measuremen 
results every possible result a; can be found with a frequency N;. The probability W; to find the value 
à; in a single measurement can be determined: 


Ni 
W; = lim x SON =N. (9.117 


N-—oo 


To determine this probability from the wave function v, one performs an expansion of y as a series o 
eigenfunctions οι: 


ὁ-δαμ, affair "T 
i Uy 


The coefficient of expansion c; is the probability to find the system v in its characteristic state q;, i.e., 
to obtain the measuring value a;. From the absolute square of c; one gets the probability W; for the 
measuring value a;: 


2 W= f[[ vva =1. (9.119) 
i "AR 


Because in every measurement it is sure to find one of the possible measuring values a;, the sum of the 
B LE 
probabilities W; fulfills the condition of normalization for the wave function w . 


W; = 


Ci 


If two states /, ψο of a physical system are known, then from the linearity of the Schroedinger equation 
follows, that the superposition 

V — cV» (9.120) 
also represents a possible physical state. This fundamental superposition principle of quantum mechan- 
ics is the reason why at the determination of probabilities with the state function v, e.g., 

WWI? = [vn + vel? = Wa + da? + 2Βο(ψι v) (9.121) 
besides the single probabilities ||? , |v»|? occurs an additional term with sign. This explains the sur- 
prising interference effects of quantum mechanics, e.g. (wave-particle duality). 

2. Expectation Value and Uncertainty 
The quantum mechanical expectation value A is defined as the mean value of the measurement results 
obtained from measurements with N — oo identical systems: 


A= lim 1 Ma N; = Στα W; = // y* AddV. (9.122) 
U i 5 v 


N-00 N 


The expectation value usually is not identical to a possible measurement result. 


E The calculation of the expectation value F = (7, y, Z) of a position measurement for a particle in the 
state v(r, t) , e.g. 


om il z|o(z, t) dV 
/ 
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shows, that the wave function v(r,t) is to be interpreted as a probability amplitude. The absolute 
square |v[? then is a probability density. The expression 


dW = |v(z, t)? aV , faw = [| YE, tf? αν =1 
M 


is to be understood as a probability, to find the particle at the time t in the volume element dV in the 
position T (probability of the position). 

As a measure of the distribution of the measured results for an observable A, given for a general state in 
some measurements, can be defined by the help of the so-called uncertainty A A near the expectation 
value A, which is to be introduced via the standard error: 


Ν-λοο 


: 1 : 9 
(A A)? = lim N YN; (a; — AY? = Σ W; (a; — A)’. (9.123) 


By the help of the wave function v one can determine the uncertainty A A of a observable as expectation 
value of the quadratic deviation from the mean value A: 


(Δ.Α) =U- = 41 - 47 = "i v'(À — Ay dV . (9.124) 


If the system is in an eigenstate i; of A, then all measurements give the same measuring value a;: 
Ά --αι, AA=0. (9.125) 

A distribution near the expectation value A does not appear. 

3. Uncertainty Relation 

Considering two observable A , B , whose operators commutate (see Lie brackets in 5.3.6.4, 2., p. 356), 
ĉ = [À, B] = 8 — BÀ = 0, (9.126 


then (and only then) exists a simultaneous system of eigenfunctions p;p (i, v = 1,2,...) 


Api, = Qi Piw Boi, = by Piw- (9.127 
In this case exist physical states, in which the expected values of both operators are eigenvalues, so tha 
the uncertainties A A, A B simultaneously disappear: 

Ἄ--αι, B=b,, AA=AB=0. (9.128 
Performing in this system a measurement of the observable A, which leads with the measured value a; 
to the state Φιν, then a following measurement of B gives the measured value b, , without interfering 
the state generated in the first measurement (compatible observable, tolerance measurement). 


For two observable A and B , which are represented by non-commutative operators there does not exis 
a simultaneous system of eigenfunctions. In this case it is impossible to find a physical state, for which 
the uncertainties A A, A B can be simultaneously arbitrarily small. For the product of the uncertainties 


exists a lower bound, defined by the expectation value of the commutator C: 
1a 
AAAB > |z; (AB : (9.129) 
i 


This relation is called the uncertainty relation. The commutation relation (9.130) (see also 5.3.6.4, 
2., p. 356) between the components, e.g., of the position and the momentum operator into the same 
direction 

h 


h 
[Prt] = i (9.130) Ap, Ax > g* (9.131) 


leads to the Heisenberg uncertainty relation (9.131). In other words: there is a fundamental limitation 
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on how precisely both the position and the momentum of a particle can be simultaneously known. 
9.2.4.5 Force-Free Motion of a Particle in a Block 


1. Formulation of the Problem 
A particle with a mass m is moving freely in a block with impenetrable walls of edge lengths a, b, c, 
therefore, it is in a potential box which is infinitely high in all three directions because of the impene- 
trability of the walls. That is, the probability of the presence of the particle, and also the wave function 
V. vanishes outside the box. The Schroedinger equation and the boundary conditions for this problem 
are 

ο LANI ME (91322) W=0 fo J ; 
t 3 au j ; a = r y=, 
ðr? Oy δρ ρα 


μα 
y — b, (9.132b) 
E 


2. Solution Approach 
Separating the variables 

V (x, y, z) = V, (r) P (y) V.(z) (9.133a 
and substituting into (9.132a) gives 


1 d?v, 1 Y, 1 d?v, 2m 

id 1 4 ----φΕ--Β. (9.133b 
V, dx? W, dy  w,dz h^ 
Every term on the left-hand side depends only on one independent variable. Their sum can be a constant 
—B for arbitrary x, y, z only if every single term is a constant. In this case the partial differentia 
equation is reduced to three ordinary differential equations: 


PY, Pw, du, κ 


.2 E | 2 2 Ν 
dx? ks V, d ky Wy, dz ka vz. (9.133c 
The relation for the separation constants —k,”, —k,?, —k,? is 
: f Moan ; : 
kc key? +k? =B, (9.133d) consequently E = 5, Ue ΕΕ). (9.1390) 
m 


3. Solutions 
of the three equations (9.133c) are the functions 

W, = Asin krx, W, = Aysinkyy, V. = A;sink.z (9.134a) 
with the constants A,, Ay, Az. With these functions W satisfies the boundary conditions V = 0 for 
x= 0, y = 0 and z = 0. 


sink, a = sin ky b = sink; c = 0 (9.134b) 
must be valid to satisfy also the relation V = 0 for x = a, y = b and z = ç, i.e., the relations 
ko = T ee (9.134c) 
c 


must be satisfied, where ng, ny, and n; are integers. 
One gets for the total energy 


n? ns M? Ny \? nz\? 
En Ne E 1 :) 1 ( 2) la» Ny, Nz = 
Bees 2m ( 4) ( b ο (na, ny m 


It follows from this formula that the changes of energy of a particle by interchange with the neigh- 
borhood is not continuous, which is possible only in quantum systems. The numbers ng, ny, and nz, 
belonging to the eigenvalues of the energy, are called the quantum numbers. 


m 


Αη. (9.134d) 
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After calculating the product of constants Αν A,A, from the normalization condition 


lg o ThyU . 9 Whee 
sin? —” sin? 


abc 
(A, A,A.)? / (| f sin? e 
a 


000 


dx dy dz = 1 (9.134e) 


6G 


one gets the complete eigenfunctions of the states characterized by the three quantum numbers 


8 , πηι , mnyy . mnzz 
cya sin sin —“ sin . (9.134f) 
: abc a b ë 


The eigenfunctions vanish at the walls since one of the three sine functions is equal to zero. This is 
always the case outside the walls if the following relations are valid 

a 2a (nz — 1)a b 2b (n, — 1)b c 2c (η; — 1)c 

y ; n i z : 


, 2ttt3 ra ag 1 5 η ο... 


ng Ng Ne, Ny ny o Ny n, n, Ne 


(9.134g) 
So, there are n, — 1 and n, — 1 and n; — 1 planes perpendicular to the x- or y- or z-axis, in which V 
vanishes. These planes are called the nodal planes. 

4. Special Case of a Cube, Degeneracy 

In the special case of a cube with a = b = c, a particle can be in different states which are described 
by different linearly independent eigenfunctions and they have the same energy. This is the case when 
the sum ng? + ny? + n}? has the same value in different states. They are called degenerate states, and 
if there are i states with the same energy, they are called i-fold degeneracy. 

The quantum numbers ng, n, and n, can run through all real numbers, except zero. This last case 
would mean that the wave function is identically zero, i.e., the particle does not exist at any place in 
the box. The particle energy must remain finite, even if the temperature reaches absolute zero. This 
zero-point translational energy for a block is 


h? /1 1.1 
h= 5 (5 tg 5) (9.134h) 
9.2.4.6 Particle Movement ina Symmetric Central Field (sce 13.1.2.2, p. 702) 


1. Formulation of the Problem 

The considered particle moves in a central symmetric potential V (r). This model reproduces the move- 
ment of an electron in the electrostatic field of a positively charged nucleus. Since this is a spherically 
symmetric problem, it is reasonable to use spherical coordinates (Fig. 9.20). The following relations 
hold: 


1 e+y+2, v —rsincosc, 
z : : 
Ü = arccos — , y=rsinvsiny, (9.135a) 
» 
y 
p = arctan ^, z=rcosv, 
xv 


where r is the absolute value of the radius vector, Ὁ is the angle be- 
tween the radius vector and the z-axis (polar angle) and q is the 

j angle between the projection of the radius vector onto the x, y plane 
Figure 9.20 and the x-axis (azimuthal angle). For the Laplace operator 


Qu 20W 19 cosü OW 1 oy , 
ed Or? Or  r00  rsn0O0 rs d AG?’ ΕΘ 


holds, so the time-independent Schroedinger equation is: 


10 / ,ow 1 ὃ. οὐ 1 δν 2m 
f | sin à 5 =+- |E r)wv = 0. NR 
r? Or ( ο) r? sin 0 90 (sn 55) T r2sin? v Oy? R? [ Vi Q (9.135) 
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2. Solution 
Looking for a solution in the form 

Wr, 0, ϱ) = Ri(r)Y (0,9), (9.136a) 
where F; is the radial wave function depending only on r, and Y" (0, p) is the wave function depending 
on both angles. Substituting (9.136a) in (9.135c) gives 


1 ὃ f ,OR\ Vm , 2m m 
( E ΕΠΣΕ - V(r)]RY, 


r? Or Or 
1 9 oy” d. ory 
= sinv— | : £ Rip. 1360] 
i sin Ü QU (sn OV ) R r2sin? 9 Op? R) (9.1360) 
Dividing by RY” and multiplying by r? gives 
1 d {ναι 2mr? 1 1 af. Y" 1 OMS 
i z |E 3] = sin V + : d 

R, dr ( dr ) H h? [ ve) Yi" (sind ov ary) sin? 9 Oy? (9:1360) 


Equation (9.136c) can be satisfied if the expression on the left-hand side depending only on r and 
expression on the right-hand side depending only on Ὁ and ᾧ are equal to a constant, i.e., both sides 
being independent of each other are equal to the same constant. From the partial differential equation 
two ordinary differential equations follow. If the constant is chosen equal to I(l + 1), then the so-called 
radial equation results depending only on r and the potential V (r): 


1 d [4dR;\ 2m KL ος 1h? 
+t — E-V( : 0. 9.136d 
Ir? dr (: dr ) h? | (r) 2mr? ( 
'To find a solution for the angle-dependent part also in the separated form 
Y" (9, ϱ) = Θ(ϑ)Φ{ᾳ) (9.136e 
one substitutes (9.13606) into (9.136c) giving 
; 1 d dO 1d? 
àn? d } | -- 
sin Aer (sino 53) + {({ 4 D} Baer (9.136 
If the separation constant is chosen as m? in a reasonable way, then the so-called polar equation is 
1 d dO m? 
sind FEL 1 1966 
O sin dd (sa 2) νυ: α (9:1368 
and the azimuthal equation is 
2D ; 
+ mo — 0. (9.136h 


dy? 
Both equations are potential-independent, so they are valid for every central symmetric potential. 
There are three requirements for (9.136a): It should tend to zero for r — oo, it should be one-valued 
and quadratically integrable on the surface of the sphere. 
3. Solution of the Radial Equation 
Beside the potential V(r) the radial equation (9.136d) also contains the separation constant {1 + 1). 
Substituting 

u(r) 2 τ: Ri(r), (9.1372) 
since the square of the function u(r) gives the last required probability |u;(r)|?dr = |R)(r)|?r2dr of 
the presence of the particle in a spherical shell between r and r + dr. The substitution leads to the 
one-dimensional Schroedinger equation 
ulr) 2m πμ + 1)? 

" 


2mr? 


uj(r) — 0. (9.137b) 
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This one contains the effective potential 
Ver = V(r) + VD, (9.137c 
which has two parts. The rotation energy 


(0+1)? 


Vi(l) = Voor (l) = (9.137d 


2mr? 

is called the centrifugal potential. 
'The physical meaning of | as the orbital angular momentum follows from analogy with the classica 
rotation energy 


i 22 22 
1,4 (θῶ; P d 
Erot p 20 20  2mr? oe 


a rotating particle with moment of inertia O = yr? and orbital angular momentum | = Od: 


i? =1l+ 0Η, =ħyl(l +1). (9.137f 


4. Solution of the polar equation 

The polar equation (9.136), containing both separation constants l(1 -- 1) and γη”, is a Legendre differ- 
ential equation (9.60a), p. 565. Its solution is denoted by OP (0), and it can be determined by a power 
series expansion. Finite, single-valued and continuous solutions exist only for I(l + 1) = 0,2,6,12,.... 
One gets for | and m: 


22 
1 


[—0,152,.., πι €. (9.1382) 
So, m can take the (21 + 1) values 
-l, (--ἶ + 1), (-1+2),..., (0-2), (ἱ -- 1), l. (9.138b) 


For m Æ 0 one gets the corresponding Legendre polynomials, which are defined in the following way: 


κ. del) 2 οπου (cos? 8 — 1)! 

Pr" (cos ὃ) = ZI (1 — cos* 2) — (deosd)Hm - 

As a special case (l = 0, m = n, cos = Ὁ) follow the Legendre polynomials of the first kind (9.60c), 
p. 566. Their normalization results in the equation 


mg Atl (Lm) ome cy o NT PP (cos: 
Or (0) = 3 (xm) Pr" (cos 0) = Nj" Pr" (cos 0). (9.138d) 


5. Solution of the Azimuthal Equation 

Since the motion of the particle in the potential field V (r) is independent of the azimuthal angle even 
in the case of the physical assignment of a space direction, e.g., by a magnetic field, the general solution 
P = aci" ? + 8e" * can be specified by fixing 


(9.138c) 


mly) = Ae", (9.139 
because in this case |@,,|? is independent of p. The requirement for uniqueness is 

$, (p + 27) = Only), (9.139b 
so m can take on only the values 0, +1, +2, . 
It follows from the normalization 

2π 

2π 

[974 = Ια’ f dy = 2n| A}? (9.139c 

0 es 
that 

Í s 
$4(p)- ——e"* ~(m=0,4+1,+2,...). (9.139d 


^U 
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The quantum number m is called the magnetic quantum number. 


6. Complete Solution for the Dependency of the Angles 


In accordance with (9.136e), the solutions for the polar and the azimuthal equations should be multi- 
plied by each other: 


1 
σι ϱ) = OW) 8o) = NP (os vem’. (9.140a) 


The functions Y” (V, y) are the so-called surface spherical harmonics. 
When the radius vector F is reflected with respect to the origin (F — ~r), the angle Ὁ becomes 7 — Ὁ 
and ᾧ becomes y + 7, so the sign of Y,” may change: 


Υπ — 9, p +n) = (1) Y (9, 9). (9.140b) 
Then for the parity of the considered wave function holds 

P=(-ly. (9.141a) 
7. Parity 


The parity property serves the characterization of the behavior of the wave function under space in- 
version Y — -- (see 4.3.5.1, 1., p. 287). It is performed by the inversion or parity operator P: 
Pv(r,t) = v(—r,t). Denoting the eigenvalue of the operator by P, then applying P twice it must 
yield P P v(r,t) = P Pv(r.t) = Y (r,t), the original wave function. So: 

Pai = +1. (9.141b) 
It is called an even wave function if its sign does not change under space inversion, and it is called an 
odd wave function if its sign changes. 


9.2.4.7 Linear Harmonic Oscillator 


1. Formulation of the Problem 

Harmonic oscillation occurs when the drag forces in the oscillator satisfy Hooke’s law F = —kx. For 
the frequency of the oscillation, for the frequency of the oscillation circuit and for the potential energy 
the following formulas are valid: 


v= A (9.142a) be (9.142b) Epot = ake? = E μι (9.142c 
2πγ m ^ m’ E ΠΠ 2 
Substituting into (9.114a), the Schroedinger equation becomes 
dw 2m we ο ; 
de | 2 | 9 μα V --0. (9.143a 
With the substitutions 
y= nf, (9.143b) A= ls (9.143c 
᾽ h hw’ 


where λ is a parameter and not the wavelength, (9.143a) can be transformed into the simpler form o 
the Weber differential equation 


— + (A- y» = 0. (9.143d 


2. Solution 
A solution of the Weber differential equation can be got in the form 


V(y) = e"? H(y). (9.1442) 
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Differentiation shows that 


Py dH dH i 
2e 2 + (y? -1H|. 144k 
αρ ^ a 7 (y^ — 1) (9.144b 
Substitution into (9.143d) yields 
dH dH 
i +(à—1)H : .144c 
ar Ua ( )H=0 (9.144c 
The determination of a solution is convenient in the form of a series: 
oo . : PH oo 
H= Σαμ with = z Σα y a i(i — αν, 3. (9.145α 
i=0 dy dy i=2 
Substitution of (9.145a) into μιάς, results in 
i(i — 1)a -Σ Qiay’ + Sun — 0. (9.145b 
i—2 i=0 
Comparing the coefficients of y’ leads to the recursion formula 
6-2) + 1)aj+2 = [23 (^—1)]a; (j =0,1,2,...). (9.145c 


The coefficients a; for even powers of y begin from ag, the coefficients for odd powers begin from ay. 
So, ao and a, can be chosen arbitrarily. 


3. Physical Solutions 

The determination of the probability of the presence of a particle in the different states can be performed 
by a quadratically integrable wave function V (1) and by an eigenfunction which has physical meaning, 
i.e., normalizable and for αν values of y it tends to zero. 

The exponential function exp(—y?/2) in (9.144a) guarantees that the solution W(y) tends to zero for 
y — oo if the function H(y) is a polynomial. To get a polynomial, the coefficients a; in (9.145a), 


starting from a certain n, must vanish for every j > n: an # 0, Qn41 = àn42 = an+3 =... = 0. The 
recursion formula (9.145c) with j = n is 
2n-(A-1 
ο, ) (9.146a) 


(nt+2)(n+1)" 
an+2 = 0 can be satisfied for a, 4 0 only if 


2n-(A-1)=0, A= Se 2n +1. (9.146b) 


The coefficients an+2, @p44,... vanish for this choice of A. Also an-ı = 0 must hold to make the coeffi- 
cients ἄπει, 0543, ... equal to zero. 

One gets the Hermite polynomials from the second defining equation (see 9.1.2.6, 6., p. 568) for the 
special choice of a, = 2", a, 1 = 0. The first six of them are: 


Ho(y) = 1, Ha(y) = —12y + By", 
Ay(y) = 2y, Hi(y) = 12 — 48y? + 164, (9.146c 
Ha(y) = —2 + Ay?, Hs(y) = 120y — 160y? + 32^. 
The solution W(y) for the vibration quantum number n is 
V, = Nae” H, (y), (9.147a 


where N, is the normalizing factor. One gets it from the normalization condition / Wn’ dy = 1 as 


N= sv with va= Z = Va (see (9.143b), p. 601). (9.147b 


2^5! 
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From the terminating condition of the series (9.143c) 
1 
En = hw (n m (n-— 0,1,2,.:.) (9.147c) 


follows for the eigenvalues of the vibration energy. The spectrum of the energy levels is equidistant. 
The sum- 

mand +1/2 in the parentheses means that in contrast to the 

classical case the quantum mechanical oscillator has energy V69^ E(x) 

even in the deepest energetic level with n = 0, which is known 
as the zero-point vibration energy. 


Fig. 9.21 shows a graphical representation of the equidistant EA Bu 
spectra of the energy states, the corresponding wave functions 4 
from V, to V; and also the function of the potential energy \ 


(9.142c). The points of the parabola of the potential energy BA Z| 


represent the reversal points of the classical oscillator, which 2 
1 2.3 
are calculated from the energy E = =mw?a? as the amplitude κ- 1 
fiv 
1 [28 à ET ; *-0 
a = —4| —. The quantum mechanical probability of finding a " 
w\ m x 
particle in the interval (x, x -- da) is given by dwg, = |Y (x)|? dx. 
It is different from zero also outside of these points. So for, e.g., Figure 9.21 


JA χο δ - 
n = 1, hence for E = (3/2)hw, according to dw,, = 2 Dae dx, the maximum of the probability o. 
T 


presence is at 


Æl h 
maz qu = —= = 4| —. .147d 
Tmaz,q JX A oni (9.14 Τε 


For a corresponding classical oscillator, this is 


2E 3h. 
Tmazx,kl a 27+ s . (9.147e 
πιω πιω 


For large quantum numbers n the quantum mechanical probability density function approaches the 
classical one in its mean value. 


9.2.5 Non-Linear Partial Differential Equations: Solitons, Periodic 


Patterns and Chaos 
9.2.5.1 Formulation of the Physical-Mathematical Problem 
1. Notion of Solitons 


Solitons, also called solitary waves, from the viewpoint of physics, are pulses, or also localized distur- 
bances of a non-linear medium or field; the energy related to such propagating pulses or disturbances 
is concentrated in a narrow spatial region. They occur: 

ein solids, e.g., ininharmonic lattices, in Josephson contacts, in glass fibres and in quasi-one-dimensional 
conductors, 
e in fluids as surface waves or spin waves, 

ο in plasmas as Langmuir solitons, 

e in linear molecules, 

e in classical and quantum field theory. 

Solitons have both particle and wave properties; they are localized during their evolution, and the do- 
main of the localization, or the point around which the wave is localized, travels as a free particle; in 
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particular it can also be at rest. A soliton has a permanent wave structure: based on a balance between 
nonlinearity and dispersion, the form of this structure does not change. 

Mathematically, solitons are special solutions of certain non-linear partial differential equations occur- 
ring in physics, engineering and applied mathematics. Their special features are the absence of any 
dissipation and also that the non-linear terms cannot be handled by perturbation theory. Dissipative 
solitons are in non-conservative systems. 


2. Important Examples of Equations with Soliton Solutions 


a) Korteweg de Vries (KdV) Equation Ut + Cutty + User = 0, (9.148) 
b) Non-Linear Schroedinger (NLS) Equation iu; + tex + 2|u[?u = 0, (9.149) 
c) Sine-Gordon (SG) Equation Utt — Ure + sinu = 0. (9.150) 


The subscripts x and t denote partial derivatives, e.g., Ure = 0?u/Ox". 
In these equations the one-dimensional case is considered, i.e., u has the form u = u(x,t), where x is the 
spatial coordinate and t is the time. The equations are given in a scaled form, i.e., the two independent 
variables 2; and t are here dimensionless quantities. In practical applications, they must be multiplied 
by quantities having the corresponding dimensions and being characteristic of the given problem. The 
same holds for the velocity. 

3. Interaction between Solitons 

If two solitons, moving with different velocities, collide, they appear again after the interaction as if 
they had not collided. Every soliton asymptotically keeps its form and velocity; there is only a phase 
shift. Two solitons can interact without disturbing each other asymptotically. This is called an elastic 
interaction which is equivalent to the existence of an N-soliton solution, where N (N — 1,2,3,...) is 
the number of solitons. Solving an initial value problem with a given initial pulse u(x,0) that disag- 
gregate into solitons, the number of solitons does not depend on the shape of the pulse but on its total 
amount [72° u(x, 0) da. 

4. Periodic Patterns and Non-Linear Waves 
Such non-linear phenomena occur in several classic dissipative systems (i.e. friction or damping sys- 
tems), when an external impact or force is sufficiently large. E.g., if there is a layer of fluid in the 
gravitational field, and it is heated from below, the difference of temperature between the upper and 
lower surface corresponds to an external force. The higher temperature of the lower layer reduces its 
density and makes it lighter than the upper part, so the layering becomes unstable. At a sufficiently 
large temperature difference this unstable layering turns spontaneously into periodically arranged con- 
vection cells. It is called the bifurcation from the state of thermal conductivity (without convection 
into the well ordered Rayleigh-Bénard convection. Taking away the external force results because o: 
dissipation into damping of the waves (here the cellular convection). Strengthening of the externa 
force drives the ordered convection into a turbulent convection and into chaos (see 17.3, p. 892). Also 
in chemical reactions similar phenomena can occur. Important examples for equations describing such 
phenomena are: 
a) Ginsburg-Landau (GL) Equation uy u— (14+ ib)u, + (1 + ic)|ul?u = 0, (9.151 


b) Kuramoto-Sivashinsky (KS) Equation τι + Urr + Uerer + u2 = 0. (9.152 


In contrast to the dissipation-less KdV, NLS, SG, equations, the equations (9.151) and (9.152) are non- 
linear dissipative equations, which have, besides spatiotemporal periodic solutions, also spatiotempora. 
disordered (chaotic) solutions. Appearance of spatiotemporal patterns or structures is characteristic 
which turn into chaos. 

5. Dissipative Solitons 

Solitary (isolated) wave phenomena in non-conservative systems often are called dissipative solitons. 
Contrary to the conservative systems, in which the solitons usually form families of solutions with at 
least one continuously changing parameter, dissipative solitons can be found at single points of the 
parameter space, at which a balance is formed between dispersion and nonlinearity from one side and 
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energy or particle flow and dissipation from the other side. This property leads to a special kind of sta- 
bility of dissipative solitons, although they are not solutions of integrable wave equations. Dissipative 
solitons are described among others by the complex Ginsburg-Landau equation. They occur ,e.g., in 
non-linear optic cavitations, in optic semiconductor amplifiers and in reaction-diffusion systems (see 
also [9.16]). 

6. Non-Linear Evolution Equations 
Evolution equations describe the evolution of a physical quantity in time. Examples are the wave equa- 
tion (see 9.2.3.2, p. 590), the heat equation (see 9.2.3.3, p. 591) and the Schroedinger equation (see 
9.2.4.1, 1., p. 592). The solutions of the evolution equations are called evolution functions. 
In contrast to linear evolution equations, the non-linear evolution equations (9.148), (9.149), and (9.150) 
contain non-linear terms like uQu/Oc, 


u|?u, sinu and u2. These equations are (with the exception of 
(9.151)) parameter-free. From the viewpoint of physics non-linear evolution equations describe struc- 
ture formations like solitons (dispersive structures) as well as periodic patterns and non-linear waves 
(dissipative structures). 


9.2.5.2 Korteweg de Vries Equation 
(KdV) 


1. Occurrence 

The KdV equation is used in the discussion of 

e surface waves in shallow water, 

e inharmonic vibrations in non-linear lattices, 

e problems of plasma physics and 

e non-linear electric networks. 

2. Equation and Solutions 

The KdV equation for the evolution function u is 


Up + 6uus + User = 0. (9.153) 
It has the soliton solution 0 
; -vt- 
u(x,t) i (9.154) E 


'osh? | i IT στ UU -- 
2cosh* |5 /v(z — vt e) Figure 9.22 


This KdV soliton is uniquely defined by the two dimensionless parameters v (v > 0) and y. In Fig. 
9.22 v = 1 is chosen. A typical non-linear effect is that the velocity of the soliton v determines the 
amplitude and the width of the soliton: KdV solitons with larger amplitude and smaller width move 
faster than those with smaller amplitude and larger width (taller waves travel faster than shorter ones). 
The soliton phase y describes the position of the maximum of the soliton at time t = 0. 


Equation (9.153) also has N-solitons solutions. Such an N-soliton solution can be represented asymp- 
totically for t + -Eoo by the linear superposition of one-soliton solutions: 
N 


u(a,t) ~ M us (n, t). (9.155) 


n=1 


Here every evolution function u,(a,t) is characterized by a velocity v, and a phase y*. The initial 
phases y} before the interaction or collision differ from the final phases after the collision v7 , while the 
velocities v1, v2,...,Un have no changes, i.e., it is an elastic interaction. 

For N — 2, (9.153) has a two-soliton solution. It cannot be represented for a finite time by a linear 


1 1 
superposition, and with kp = aV". and a, = 3 Un(T — Unt — Yn) (n = 1,2) it has the form: 
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i ; 1 : ͵ 
klee + kde? + (ky — ο) εαν ται) |2 + — — — (kle + kee) 
E (kı + ka)? 
u(x,t) =8 A 3 . (9.156) 
ky — kg" 
1 209 20: ο(αι αρ) 
ΠΣ 3 ? 


Equation (9.156) describes two non-interacting solitons for £ — —oo asymptotically with velocities 
vi = 41 and vz = 4k3, which transform after their mutual interaction again into two non-interacting 
solitons with the same velocities for t + +00 asymptotically. 

The non-linear evolution equation 


wy + 6(w,)? + uns, = 0 (9.1572) 
Fr z . 
where w = T has the following properties: 
1 
a) For F(a,t)=1+e*%, a= ονυία — vt — q) (9.157b) 
it has a soliton solution and 
k — ον 
b) for F(a,t)=1+e%+e%4 εἰθι 122) 9.157c 
) for F(x,t) e e (οτε) € ( c) 


it has a two-soliton solution. With 2w, — u the KdV equation (9.153) follows from (9.157a). Equation 
(9.156) and the expression w following from (9.157c) are examples of a non-linear superposition. 

If the term +6uu, is replaced by —6uu, in (9.153), then the right-hand side of (9.154) has to be multi- 
plied by (—1). In this case the notation antisoliton is used. 


9.2.5.3 Non-Linear Schroedinger Equation (NLS) 

1. Occurrence 

The NLS equation occurs 

e in non-linear optics, where the refractive index n depends on the electric field strength E, as, e.g., for 
the Kerr effect, where n(E) = no + n| E|? with no, no = constant holds, and 

e in the hydrodynamics of self-gravitating discs which allow us to describe galactic spiral arms. 

2. Equation and Solution 

The NLS equation for the evolution function u and its solution are: 


op XP (-ἱ[2ξα + AE? — *)t — x]) 
cosh[2n(x + 4ξί — ϱ)] : 


itt + Ure + 2\ul?u=0, (9.158) u(x,t) 


(9.159) 


Here u(x, t) is complex. The NLS soliton is characterized by the four dimensionless parameters η, E, o, 
and y. The envelope of the wave packet moves 

with the velocity v = —4£; the phase velocity ^Re u(x,t) 

of the wave packet is 2(1? — €?) /£. 1 
In contrast to the KdV soliton (9.154), the 
amplitude and the velocity can be chosen in- 
dependently of each other. Fig. 9.23 displays 
the real part of (9.159) with η = 1/2 and 
£ —4. 
The solutions of the form (9.159) often are 
called light solitons; they solve the focusing 
NLS equation (9.158) for the case ”+”. The 
defocusing NLS equation (case "—") gives 
solitons, for which |u|? at the position of the Figure 9.23 
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solitons is reduced in comparison to a constant background [u(x — +oo)| = η. Such dark solitons have 
the form 


u(a,t) = iZ + y" - i tanh je - Eu — vt)| | - exp [-i (2η! + χ)] : (9.160) 
| 2 4 4 j 


They depend on the three parameters η, v and x and propagate with the velocity v < 27 on a back- 
ground with trivial (flat) phase (see [9.26], [9.23]). 

A general solution has in addition a phase gradient, which can be interpreted as velocity c of the back- 
ground, relative to which the soliton is moving. Then the solution has the following form: 


(æt) = [iz + Jm? (ΠΠ "TE τ t— ct) e i | 277 B dta 
u(x,t) = |i- n° — — | tanh p= =j =t =n ex 1| 27 t - : 
: 2 ] 1 ] 1 p ο μή 


(9.161) 


Beside of these exponential positioned soliton waves also periodic solutions of the NLS equation exist, 
which can be interpreted as wave packets of solitons. Such solutions can be find demanding stationarity 
and by integration of the remaining ordinary differential equation. Generally such solutions are elliptic 
Jacobian-functions (see 14.6.2, p. 763). Some relevant solutions see [9.17]. 

In the case of N interacting solitons, one can characterize them by 4N arbitrary chosen parameters: 
Nn, Eni Qn;Xn. (n=1,2,...,N). 

If the solitons have different velocities, the N-soliton solution splits asymptotically for t + +00 into a 
sum of N individual solitons of the form (9.159). 


9.2.5.4 Sine-Gordon Equation (SG) 

1. Occurrence 

The SG equation is obtained from the Bloch equation for spatially inhomogeneous quantum mechani- 
cal two-level systems. It describes the propagation of 

e ultra-short pulses in resonant laser media (self-induced transparency), 

e the magnetic flux in large surface Josephson contacts, i.e., in tunnel contacts between two supercon- 
ductors and 

ο spin waves in superfluid helium 3 (Πο). 

The soliton solution of the SG equation can be illustrated by a mechanical model of pendula and springs. 
The evolution function goes continuously from 0 to a constant value c. The SG solitons are often called 
kink solitons. If the evolution function changes from the constant value c to 0, it describes a so-called 
antikink soliton. Walls of domain structures can be described with this type of solutions. 


2. Equation and Solution 
The SG equation for the evolution function u is 
Ug — Uge + sinu = 0. (9.162) 
It has the following soliton solutions: 
1. Kink Soliton 
u(z,t) = 4arctan αγ πο ο (9.163) 


^ u(x,t) 


1 
ore Y = ; 
where 7 VIT and —1 < v < +1. 
The kink soliton (9.163) for v = 1/2 is given in Fig. 9.24. 
The kink soliton is determined by two dimensionless pa- X-X,-vt 
rameters v and zo. The velocity is independent of the am- 
plitude. The time and the position derivatives are ordi- Figure 9.24 
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nary localized solitons: 


Ut 2y 


z= . 9.164 
v cosh g(x — πρ — vt) ( ) 


2. Antikink Soliton 
u(x,t) ο... (9.165) 
3. Kink-Antikink Soliton One gets a static kink-antikink soliton from (9.163, 9.165) with v — 0: 
u(x, t) = 4arctan et», (9.166) 
Further solutions of (9.162) are: 
4. Kink-Kink Collision 


u(x,t) = 4arctan |v sint | (9.167 
| cosh yut 
5. Kink-Antikink Collision 
u(x,t) = 4arctan TUM (9.168 
| v cosh yx 
6. Double or Breather Soliton, also called Kink-Antikink Doublet 
[/1—u? sin wt 
u(x,t) = 4arcta i 9.169 
ώς κ. w — coshV1-— wx ( 


Equation (9.169) represents a stationary wave, whose envelope is modulated by the frequency w. 
7. Local Periodic Kink Lattice 


Coa md NS (9.1702) 
u(x,t) = 2arcsin | +sn j Fm. 170a 
| ky1— v? 
The relation between the wavelength A and the lattice constant k is 

à = AK(k)kv 1 — v?. (9.170b) 


For k = 1, i.e., for A — oo, one gets 


which is the kink soliton (9.163) and the antikink soliton (9.165) again, with xo = 0. 


Remark: snz is a Jacobian elliptic function with parameter k and quarter-period K (see 14.6.2, 
p. 763): 


u(x,t) = 4arctan οὐ αἱ) (9.170c) 


sng = sin y(x, k), (9.171a) 
sin y(x,k) 7/2 
d 
z “4g amp) κω- f EX fae 
0 (αλα — kg?) Ὁ Vl- k’sin O 


Equation (9.171b) comes from (14.104a), p. 763, by the substitution of sin = q. The series expansion 
of the complete elliptic integral is given as equation (8.104), in 8.2.5, 7., p. 515. 


9.2.5.5 Further Non-linear Evolution Equations with Soliton Solutions 


1. Modified KdV Equation 
u, + θά u, + Urre = 0. (9.172) 


The even more general equation (9.173) has the soliton (9.174) as solution. 
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jdesDos«2 |” 


Up + uPu, + Urse =0, (9.173) u(x,t) = Τ (9.174) 
^ 3 2 T vi — 
cosh (sov — vt ο)] 
2. Sinh-Gordon Equation 
Utt — Ure + sinh u = 0. (9.175) 


3. Boussinesq Equation 
Ure ο + usus = 0. (9.176) 


This equation occurs in the description of non-linear electric networks as a continuous approximation 
of the charge-voltage relation. 
4. Hirota Equation 

uj; + i3o|u|?u, + Bus, + igUerr + δι] τι = 0, aß = có. (9.177) 


5. Burgers Equation 
Ut — Ung + uus = 0. (9.178) 
This equation occurs when modeling turbulence. With the Hopf-Cole transformation it is transformed 
into the diffusion equation, i.e., into a linear differential equation. 
6. Kadomzev-Pedviashwili Equation 
The equation 
(te + θα + Urex)a = Uyy (9.179a) 
has the soliton 


ulr 8 1 Lm tik k2 2 l 
ulz, y, t) = 25; nla | je +i y — 3k ‘|| (9.179b) 


as its solution. The equation (9.179a) is an example of a soliton equation with a higher number of in- 
dependent variables, e.g., with two spatial variables. 

Remark: The CD-ROM to the 7th, 8th and 9th German editions of this handbook (see [22.8]) con- 
tains more non-linear evolution equations. Furthermore there is shown the application of the Fourier 
transformation and of the inverse scattering theory to solve linear partial differential equations. 


10 Calculus of Variations 
10.1 Defining the Problem 


1. Extremum of an Integral Expression 

A very important problem of the differential calculus is to determine for which x values the given func- 
tion y(r) has extreme values. The calculus of variations discusses the following problem: For which 
functions has a certain integral, whose integrand depends also on the unknown function and its deriva- 
tives, an extremum value? The calculus of variations concerns itself with determining all the functions 
y(x) for which the integral expression 


I|y] = / F(x, y(x), y (z),..., y? (z))dz (10.1) 


has an extremum, if the functions y(x) are from a previously given class of functions. Here, it ist possible 
to define some boundary and side conditions for y(x) and for its derivatives. 

2. Integral Expressions of Variational Calculus 

There can also be several variables instead of x in (10.1). In this case, the occurring derivatives are 
partial derivatives and the integral in (10.1) is a multiple integral. In the calculus of variations, mainly 
the following types of integral expressions are discussed: 


I|y] = [ Fe iG). G2) ae, (10.2 
Ts au] = | FG). ο. ο (10.3 
I|y] = [ Fo, ylz), y (x), ... y (a) dz, (10.4 


~= 


ul = / F(x, y, U, Uz, Uy) dx dy. (10.5 
Q 


Here the unknown function is u = u(x, y), and Q represents a plane domain of integration. 


Πα] = Jff F X,Y, σα, πα. Uy, Uz) dx dy dz. (10.6 
Z2 


The unknown function is u = u(x, y, ο), and R represents a space region of integration. Additionally, 
boundary values can be given for the solution of a variational problem, at the endpoints of the interva 
a and b in the one-dimensional case, and at the boundary of the domain of integration Q in the two- 
dimensional case. Besides, various further side conditions can be defined, e.g., in integral form or as a 
differential equation. 
A variational problem is called first-order or higher-order depending whether the integrand F contains 
only the first derivative y' or higher derivatives y? (n > 1) of the function y. 

3. Parametric Representation of the Variational Problem 

A variational problem can also be posed in parametric form. Considering a curve in parametric form 
x= x(t), y = y(t) (a € t € B), then, e.g., the integral expression (10.2) has the form 


Tey) [ FEO ult). (0.90) dt (10.7) 
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10.2 Historical Problems 


10.2.1 Isoperimetric Problem 
The general isoperimetric problem is to determine the plane region with the largest area among the 
plane regions with a given perimeter. The solution of this problem, a circle with a given perimeter, 
originates from queen Dido, who was allowed, as legend has it, to take such an area for the foundation 
of Carthago which she could be surround by one bull’s leather. She cut the leather into fine stripes, 
and formed a circle with them. 

A special case of the isoperimetric problem is to find the equation of 

the curve y = y(x) in a Cartesian coordinate system connecting the y y(x) 
points A(a, 0) and B(b,0) and having the given length l, for which 
the area determined by the line segment AB and the curve is the 
largest possible (Fig. 10.1). For the mathematical formalization a 
once continuously differentiable function y(x) is to be determined, 
such that 


0 A(a,0)  B(bO) x 
Figure 10.1 


b 
I|y] = EO dx = max (10.8a) 


a 


holds, where the side condition (10.8b) and the boundary conditions (10.56) are satisfied: 
b 
Gly] = [vi + y?(a) dx =l (10.8b) y(a) = y(b) =0 (10.8c) 


10.2.2 Brachistochrone Problem 


The brachistochrone problem was formulated in 1696 by J. Bernoulli, and it is the following: A point 
mass descends from the point Po(xo, yo) to the origin in the vertical plane x, y only under the influence 
of gravity. The curve y = y(x) along which the point reaches the origin in the shortest possible time 


from P» (Fig. 10.2) is to be determined. With the formula for the time of fall T, (see 11.5.1, p. 648), the 
following mathematical description is possible to determine a once continuously differentiable function 
y = y(x) for which 
VI +y? Yt 
i=j dx = min (10.9) dope i Po(Xo, yo) 
γ2ο(νο — v) | 
holds, (g is the acceleration due to gravity). The boundary value 
conditions are ys) | 
y(0) =0, y(xo) = yo- (10.10) 0 Χο x 
For x = xo in (10.9) there is a singularity. Figure 10.2 


10.3 Variational Problems of One Variable 


10.3.1 Simple Variational Problems and Extremal Curves 


A simple variational problem is to determine the extreme value of the integral expression given in the 
form 


I|y] = [Fe y(x), y (x)) dx, (10.11) 
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where y(x) is a twice continuously differentiable function satisfying the boundary conditions y(a) = A 
and y(b) = B. The values a,b and A, B, and the function F are given. 

The integral expression (10.11) is an example of a so-called functional. A functional assigns a real 
number to every function y(x) from a certain class of functions. 

If the functional I [y] in (10.11) takes, e.g., its relative maximum for a function yo(x), then 


Iyo] > Πν] (10.12) 


for every twice continuously differentiable function y satisfying the boundary conditions. The curve 
y = yo(x) is called an extremal curve. Sometimes all the solutions of the Euler differential equation of 
the variational calculus are called extremal curves. 


10.3.2 Euler Differential Equation of the Variational Calculus 


A necessary condition for the solution of the variational problem can be constructed by the help of an 
auxiliary curve or comparable curve for the extremal yo(x) characterized by (10.12) 


y(x) = ψο() + €n(x) (10.13) 
with a twice continuously differentiable function n(x) satisfying the special boundary conditions η(α) = 
n(b) = 0; eis a real parameter. Substituting (10.13) into (10.11) there is a function depending on € 
instead of the functional 7 |y] 

b 
1(ε) = | FG + ene) απ. (10.14) 
a 
The functional 7 [y] has an extreme value for yo(x) if the function (c), as a function of e, has an extreme 


value for ε = 0. Deducing the variational problem to an extreme value problem with the necessary 
condition 


τ =0 for «=0 (10.15) 
de 


and supposing that the function F, as a function of three independent variables, is differentiable as 
many times as needed, by the help of the Taylor expansion (see 7.3.3.3, p. 471) follows 


b 
OF OF ; 
1{ε) = / [βία ο + Dy ου F ay ^ yo, yo)e tr + O(€)) | d. (10.16) 
The necessary condition (10.15) results in the equation 
b b 
OF OF 
: dx + {ή dx = 0. 10.17 
145 πα T ay! ó ( ) 


Partial integration of this equation and considering the boundary conditions for n(x), gives 


b 
8F d (8F 
ας a (=) dx = 0. (10.18) 


From the assumption of continuity and because the integral in (10.18) must vanish for any considerable 


(x), 

OF d (OF 

E aser ασ rens 10. 
Oy dx (5) a (10:19) 


must hold. The equation (10.19) gives a necessary condition for the simple variational problem and it 
is called the Euler differential equation of the calculus of variations. The differential equation (10.19) 
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can be written in the form 


OF OF OF, OF 
Oy  OxOy — OyOy' ; Oy” d 


It is an ordinary second-order differential equation if Fy 7 0 holds. 
The Euler differential equation has a simpler form in the following special cases: 
Case 1: F(x,y,y') = F(y'), i.e., x and y do not appear explicitly. Then instead of (10.19) holds 
OF d (OF 
a 0 (10.21a) and T (55) =0. (10.21b) 
Case 2: F(x, y,y') = F(y, y), i.e., x does not appear explicitly. From 


d F- 98 OF ΠΝ OF Hn ,9F ,d (OF - 1) ar ο. (0 22a) 
dz "Oy, ay” Oy" "oy "deNOy) ” (ay de Vy di 


and because of (10.19) follows 


2 (r - v) =0, (10.22b) ie, Ε- T =c (c const) (10.22c) 


as a necessary condition for the solution of the simple variational problem in the case F = F(y, y). 


W A: The functional to determine the shortest curve connecting the points P; (a, A) and P»(b, B) in 
the z, y plane is: 


b 
Ify] = / V1l+y? dx = min. (10.23a) 


It follows from (10.21b) for F = F(y') = v1 + y? that 


d (55) — =Ù; (10.23b) 
dx \ Oy (vrxy?) 


so y” = 0, i.e., the shortest curve is the straight line. 
WB B: Connecting the points P; (a, A) and P»(b, B) by a curve y(x), and rotating it around the x-axis 
the surface area is 


b 
I[y] = 2r f υγ! 1- y? dz. (10.244) 


Which curve y(x) creates the smallest surface area? From (10.220) with F = F(y, y) = 27y/1 + y? 


: y\2 : 
follows y — zv +y? or y? = (2) — 1, where ει = = . This differential equation is separable 
(see 9.2.2.3, 1., p. 579), and its solution is 


y = & cosh (= + ca) (ει. C2 const), (10.24b) 
σι 


the equation of the so-called catenary curve (see 2.15.1, p. 107). The constants c; and c» can be de- 
termined from the boundary values y(a) — A and y(b) — B. So, it is to solve a system of non-linear 
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equations (see 19.2.2, p. 961), which cannot be solved for every boundary value. 


10.3.3 Variational Problems with Side Conditions 


These problems are usually isoperimetric problems (see 10.2.1, p. 611): The simple variational problem 
(see 10.2.1, p. 611), given by the functional (10.11), is completed by a further side condition in the form 


b 
i G(x, υ(α). ψ(α)) dz =1 (1 const) (10.25) 


where the constant / and the function G are given. A method to solve this problem is given by Lagrange 
(extreme values with side conditions in equation form, see 6.2.5.6, p. 456). Considering the expression 


A(x, y(x), y (x), X) = Εἴα, yle), y ()) + AG(@, y(x), y (2)) , (10.26) 


where λ is a parameter, and solving the problem 


b 
n (x, y(x), y (x), A)dx = extreme!, (10.27) 
a 

as an extreme value problem without side condition. The corresponding Euler differential equation is: 
OH d (OH 
u— — — — 0. (10.28) 
Oy | dr VOy 


The solution y = y(x, A) depends on the parameter A, which must be determined by substituting y(x, A) 
into the side condition (10.25). 
W For the isoperimetric problem 10.2.1, p. 611, one gets 


H (v, y(x), y (x), ) 2 y - AJ1- y^. (10.294) 
Because the variable x does not appear in H, instead of the Euler differential equation (10.28), analo- 
gously to (10.22c), one gets the differential equation 


f Bleau 
FAV y? poy? = ΕΞΩ (ci const), (10.29b) 
/ 14 a €c1— y 
whose solution is a family of circles 
(z=)? + (y-a) =A? (c&,c2, AÀ const). (10.299) 
The values ει, c» and À are determined from the conditions y(a) = 0, y(b) = 0 and from the requirement 


that the arclength between A and B should be l. The result is a non-linear equation for A, which should 
be solved by an appropriate iterative method. 


10.3.4 Variational Problems with Higher-Order Derivatives 
There are two types of problems to be considered here. 
1. F= F(x,y, y’, y”) 
The variational problem is: 
b 
I|y] = [ Fe y. y, y") dx = extreme! (10.304) 
with the boundary values 


yla)=A, y(b)=B, y()-A, y()- B, (10.30b) 
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where the numbers a,b, A, B, A’, and B’, and the function F are given. Similarly as in 10.3.2, p. 612, 
introducing comparable curves y(x) = yo(x) + en(x) with η(α) = n(b) = n'(a) = (b) = 0, yields the 
Euler differential equation 
OF d (OF d (OF 
; + > 0 (10.31) 
Oy | dx \ dy! dz? V Oy" 
as a necessary condition for the solution of the variational problem (10.30a). The differential equation 


(10.31) represents a fourth-order differential equation. Its general solution contains four arbitrary con- 
stants which can be determined by the boundary values (10.30b). 


W Consider the problem 


I|y] = fw ay” — By?) dx = extreme! (10.32a) 
with the iven constants a and 8 for F = F(y,y', y") = y? — ay? — By?. Then: E, = —28y, 
Fy = —2ay', Fy = 2y", E (Fy) = -2oy", ia (Fy) = 2y, and the Euler differential equation is 

y + ay" — By — 0. (10.32b) 


This is a fourth-order linear differential equation with constant coefficients (see 9.1.2.3, p. 553). 


2. F m F(z,y y... y) 
In this general case, when the functional I [y] of the variational problem depends on the derivatives of 
the unknown function y up to order n (n > 1), the corresponding Euler differential equation is 


OF d OF d? OF απ OF 
t eed pe ] 10. 
Oy απ 7») dx? (25) ;. da^ (zs) 0 (10.33) 


whose solution should satisfy the boundary conditions analogously to (10.30b) up to order n — 1. 


10.3.5 Variational Problem with Several Unknown Functions 
Suppose the functional of the variational problem has the form 


b 

I|yi. Y2,- - -, Yn] = [ Fe Vis Ρατ. (10.34) 
where the unknown functions yi(x), yo(x), ..., ya (x) should take given values at x = a and x = b. 
Considering n twice continuously differentiable comparable functions 

yi(x) = yio(x) + eimm(v) (i — 1,2,...,m), (10.35 
where the functions 7;(x) should vanish at the endpoints, (10.34) becomes I (e1, €», .. . , €n) with (10.35). 
From the necessary conditions 

OI 

—=0 (¢=1,2,...,n) (10.36 

δε; 
for the extreme values of a function of several variables, follow the n Euler differential equations 

OF d [δε OF d (OF OF 1 (OF 

a | =0, πη] =0, 2a x—-— = 0, (10.37 

Oy, dx VOyi Oy; dx \ dys Oy, dx VOy, 


whose solutions yi (x), yo(”),..., yn (x) must satisfy the given boundary conditions. 


10.3.6 Variational Problems using Parametric Representation 


For some variational problems it is useful to determine the extremal, not in the explicit form y — y(x), 
but in the parametric form 
c=x(t), y=y(t) (t €t&t5), (10.38) 
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where {1 and {2 are the parameter values corresponding to the points (a, A) and (b, B). Then the simple 
variational problem (see 10.3.1, p. 611) is 
t2 
Ile, y] = J F (x(t), y(t), i(t), g(t))dt = extreme! (10.39a 
tı 

with the boundary conditions 

z(tı) =a, απο) =b, y(ti)=A, y(te) — B. (10.39b 
Here i and ἡ denote the derivatives of x and y with respect to the parameter t, as usual in the parametric 
representation. 
The variational problem (10.39a) makes sense only if the value of the integral is independent of the 
parametric representation of the extremal curve. Το ensure the integral in (10.39a) is independent ο 
the parametric representation of the curve connecting the points (a, A) and (b, B), F must be a positive 
homogeneous function of first order, i.e., 

F(x,y, uà, uj) = pF (x,y, 5.) (μ 0) (10.40 
must hold. 
Because the variational problem (10.39a) can be considered as a special case of (10.34), the correspond- 
ing Euler differential equations are 


OF d (OF OF d (OF 
ania -|-0. 10.41 

Or dt ( Oi ) 0 ὃν dt ( Oy ) a ae 
They are not independent of each other, but they are equivalent to the so-called Weierstrass form o 
the Euler differential equation: 


OF BF 
&wp 24, ο. =0 [0558 
with 
92 2 2 
mel OP 1 OF 19F Hane 


jy? Ox? ry Ordy i? ϑῇλ᾽ 
Analogous to the calculation of the radius of curvature R of a curve given in parametric representation 
(see 3.6.1.1, 1., p. 243), the calculation of the radius of curvature of the extremal curve is to be made, 
considering (10.42a), with 
(e ny 
Tj — 20 


M (i? | py? 


τ Fiy = Fry 


(10.42c) 


W The isoperimetric problem (10.8a to 10.8c) (see 10.2.1, p. 611) has the form in parametric represen- 
tation: 
t2 


" 
Παιν] = y(t)a(t)dt = max! (10.43a) with Γ V à? (t) + y?(t) dt — LL. (10.43b) 
Jti 


Jt 


This variational problem with the side condition becomes an unconstrained variational problem ac- 
cording to (10.26) with 


H = H(xz,y, t, ἢ) = yt + Ay 32 + 02. (10.43c) 
H satisfies the condition (10.40), so it is a positive homogeneous function of first degree. Furthermore, 


M = a = A: 
(+P) 


j? 
holds, so (10.426) yields the radius of curvature R = |A|. Since A is a constant, the extremals are circles. 


πρι Hiy=1, Ha = 0, (10.43d) 
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10.4 Variational Problems with Functions of Several Variables 
10.4.1 Simple Variational Problem 


One of the simplest problems with a function of several variables is the following variational problem 
for a double integral: 


Πα] = / F(x,y, u(x, y), Ux, uy) dx dy = extreme! . (10.44) 
(c) 
The unknown function u = u(x,y) should take given values on the boundary I of the domain G. 
Analogously to 10.3.2, p. 612, a comparable function is to be introduced in the form 
u(x, y) = uo(x, y) + εη(α, y). (10.45 
where uo(z, y) is a solution of the variational problem (10.44), which takes the given boundary values, 
while n(x, y) satisfies the condition 
"n(x,y) Ξ 0 onthe boundary I’. (10.46 
n(x, y) together with uo(x, y) both are differentiable as many times as needed. The quantity e is a 
parameter. 
Next, a surface is to be determined by u = u(x, y), which is close to the solution surface uo(z, y). Πα 
becomes J(e) with (10.45), i.e., the variational problem (10.44) becomes an extreme value problem 
which must satisfy the necessary conditions 
dI 
ΠΠ 
From this follows the Euler differential equation 
OF ὃ 9Ρ ὃ (OF 
- ; =0 (10.48 
Ou Ox \ ður Oy \Ouy 
as a necessary condition for the solution of the variational problem (10.44). 
W A free membrane, fixed at the perimeter 7" of a domain G in the x, y plane, covers a surface with 
area 


0 fo ε-0. (10.47 


he / / dx dy. (10.49a) 
(G) 


If the membrane is deformed by a load so that every point has an elongation u = u(x,y) in the z- 
direction, then its area is calculated by the formula 


h= / ν1!-ω2-- u? dz dy. (10.49b) 
(G) 


Linearizing the integrand in (10.49b) and using Taylor series (see 6.2.2.3, p. 449), one gets the relation 
1 j : 
hxh+z Í if (u2 +u) de dy. (10.196) 
(G) 


For the potential energy U of the deformed membrane holds 


ζ -- σ(12 -- Γι) -- = If (u + us) dz dy, (10.49d) 
(G) 


where the constant σ denotes the tension of the membrane. In this way arises the so-called Dirichlet 
variational problem: The function u = u(x, y) is to be determined so that the functional 


Πα] = / (u? + u?) dx dy (10.49e) 


(c) 
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should have an extremum, and u vanishes on the boundary I’ of the plane domain G. The corresponding 
Euler differential equation is 


9 9 
+ ui =0; (10.49f) 


It is the Laplace differential equation for functions of two variables (see 13.5.1, p. 729). 


10.4.2 More General Variational Problems 

Two generalizations of the simple variational problem are to be considered here. 

1. F= F(x, Y, u(x, y), Ug, Wys Ίσα» Unys Uyy) 

The functional depends on higher-order partial derivatives of the unknown function u(x, y). If the 
partial derivatives occur up to second order, then the Euler differential equation is: 


OF @ (aF ὃ (OF δ᾽ ( OF a ( aF δ᾽ ( aF 
7 7 ERR: | dee s =0. (10.50) 
Ou Ox \ ður Oy \Ouy Ox? VOuss ὅσϑῃ VOus, ὅν” NOuy, 


2. PSF (ti 92 in (iy sss Ba) pa sees Mas) 
In the case of a variational problem with n independent variables z1, z2,...,:,, the Euler differential 
equation is: 


ο P» (το (10.51) 


Ou <= Ory \ Our, 


10.5 Numerical Solution of Variational Problems 

Most often two ways are used to solve variational problems in practice. 

1. Solution of the Euler Differential Equation and Fitting the Found Solution to 
the Boundary Conditions 

Usually, an exact solution of the Euler differential equation is possible only in the simplest cases, so 

numerical methods are to be used solving the boundary value problem for ordinary or for partial dif- 

ferential equations (see 19.5, p. 976 or 20.3.4, p. 10424). 

2. Direct Methods 

The direct methods start directly from the variational problem and do not use the Euler differential 

equation. The most popular and probably the oldest procedure is the Ritz method. It belongs to the so- 

called approximation methods which are also used to get approximate solutions of differential equations 

(see 19.4.2.2, p. 974 and 19.5.2, p. 977). The following example demonstrates this method. 

W Solve numerically the isoperimetric problem 


1 1 
[ y (x) dx = extreme! (10.52a) for [ y'(r)dr —1 and y(0)=y(1)=0. (10.52b 
Jo Jo 


The corresponding variational problem without integral side condition according to 10.3.3, p. 614, is: 
1 
I|y] = [ [y^ (2) — dy? (α)] dx = extreme! (10.52c 
Jo 
As starting approach to find an approximation solution can be used 
να) = aax(x — 1) 4 Ty (10.52d 
Both approximation functions z(x — 1) and z?(z — 1) are linearly independent, and satisfy the boundary 
conditions. Reducing (10.52c) with (10.52d) gives 


Y % Il. 1 
a? + TERE + gee) : (10.52e 


T 2 i 
F(a1,a2) = καὶ + 5% + <a a2 λ{ 


10.6 Supplementary Problems 619 


and the necessary conditions 2 ot - -&- = 0 result in he homogeneous linear equation system 


20A i À A 4 9λ 
E ΧΕ žes ς a)^*($ im) = uae 


This system has a non-trivial solution only if the determinant of the coefficient matrix is equal to zero: 

A? — 52A +420 =0, ie, à= 10, àz = 42. (10.52g 

For À = A, = 10 from (10.52f) follows a» = 0, αι arbitrary, so the normalized solution belonging to 
λι = 10 is: 

= 5.48x(x — 1). (10.52h 

Το make a comparison, consider the Euler differential equation belonging to (10.528). Here the bound- 

ary value problem 


y"+Ay=0 with y(0)—y(1)-0 (10.52i 
holds with the eigenvalues A; = k?z? (k = 1,2,...) and the solution yp = cp sin πα. The normalized 
solution, e.g., for the case k = 1, i.e., Ay = 2 = 9.87 is 

y = V2sin re, (10.52 


which is really very close to the approximate solution (10.52h). 


Remark: With today's level of computers and science first of all, the finite element method (FEM 
is to be applied for numerical solutions of variational problems. 

The basic idea of this method is given in 19.5.3, p. 978, for numerical solutions of differential equations. 

'The correspondence between differential and variational equations will be used there, e.g., by Euler 
differential equations or bilinear forms according to (19.146a,b). 

Also the gradient method can be used for the numerical solution of variational problems as an efficient 
numerical method for non-linear optimization problems (see 18.2.7, p. 936). 


10.6 Supplementary Problems 
10.6.1 First and Second Variation 


The derivation of the Euler differential equation with a comparable function by the help of the Taylor 
expansion of the integrand (see 10.3.2, p. 612) was stopped after the linear terms with respect to e: 
b 
I(c) = Ec yo + en, yo + ej) dz . (10.53) 
a 
Considering also quadratic terms yields 
b 
OF OF 
ΚΘ - 1(0) πο ο (1054) 
Oy Oy 


a 


OF OF 
h(E (x, yo. uo) + 2 TUM o s Yo: yo) nm + aya” yo. on” + Ο(ε)] d 


Denoting as 
1. Variation 6/ of the functional /[y] the expression 


b 
OF 
a= [[ (£, νο, YN + Dy ——3À;Ét. yo. wf dx and as (10.55) 
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2. Variation ὁ” [ of the functional I[y] the expression 


b E c o 
, OF j OF OF , 
l= [ [5,56 m i *?5,5y E f» θθ)ηη' + aya E Yo i dx, (10.56) 


a 


then one can write: 
2 
1(ε) -- I(0) ~ cl + P. (10.57) 


By the help of these variations different optimality conditions for the functional / [y] can be formulated 
(see [10.7]). 


10.6.2 Application in Physics 


Variational calculus holds its solid place in physics. So, e.g., the fundamental equations of the Newto- 
nian mechanics can be derived from a variational principle and in this way one can come the Hamilton- 
Jacobi theory. Variational calculus is also important in both atomic theory and quantum physics. It is 
obvious that the extension and generalization of classical mathematical notions is undoubtedly neces- 
sary. Therefore the calculus of variations must be discussed today by modern mathematical disciplines, 
e.g., functional analysis and optimization. Unfortunately, in this book it is not possible to give more 
than a brief account of the classical part of the calculus of variations (see [10.4], [10.5], [10.7]). 


11 Linear Integral Equations 


11.1 Introduction and Classification 
1. Definitions 


An integral equation is an equation in which the unknown function appears under the integral sign. 
There is no universal method for solving integral equations. Solution methods and even the existence 
of a solution depend on the particular form of the integral equation. 
An integral equation is called linear if linear operations are performed on the unknown function. The 
general form of a linear integral equation is: 
d(x) 
g(x)e(x) = f(x) +A / K(x, y)y(y)dy, εσας. (11.1) 
a(x) 
The unknown function is y(x), the function K(x, y) is called the kernel of the integral equation, and 
f(x) is the so-called perturbation function. These functions can take complex values as well. The 
integral equation is homogeneous if the function f(x) is identically zero over the considered domain, 
i.e., f(x) = 0, otherwise it is inhomogeneous. A is usually a complex parameter. 
Two types of equation (11.1) are of special importance. If the limits of the integral are independent of 
x, i.e., a(x) = a and b(x) = b, it is called a Fredholm integral equation (11.2a,11.2b). 


If a(x) = a and b(z) = z, it is called a Volterra integral equation (11.26, 11.24). 
If the unknown function y(x) appears only under the integral sign, i.e., g(x) = 0 holds, one speaks of 
an integral equation of the first kind as (11.2a), (11.2c). The equation is called an integral equation of 
the second kind if g(a) = 1 as in (11.2b), (11.24). 
b b 
0= f(x) + A f Ke. vol) dy, (11.34) p(x) = f(x) + A f KG eto dy, (11.2b) 
0= f(x) +A [K@.nel) dy, (1130) p(x) = f(x) +A | Kene) dy. (11.324) 


2. Relations with Differential Equations 

The problems of physics and mechanics relative rarely lead directly to an integral equation. These 
problems can be described mostly by differential equations. The importance of integral equations is 
that many of these differential equations, together with the initial and boundary values, can be trans- 
formed into integral equations. 

W From the initial value problem y'(x) = f(x,y) with x > xp and y(xo) = yo by integration from xo 
to one gets 


ule) = mt [^ FENO). (11.3) 


The unknown function y(x) appears on the left-hand side of (11.3) and also under the integral sign. The 
integral equation (11.3) is linear if the function f (€, y(€)) has the form f(E, n(£)) = a(£) y(£) + b(£), 
i.e., the original differential equation is also linear. 

Remark: This chapter 11 deals with only integral equations of the first and second kind of Fredholm 
and Volterra types, and also with some singular integral equations. 
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11.2 Fredholm Integral Equations of the Second Kind 
11.2.1 Integral Equations with Degenerate Kernel 


If the kernel K (x, y) of an integral equation is the finite sum of products of two functions of one variable, 
i.e., one depends only on x and the other one only on y, it is called a degenerate kernelor a product kernel. 
1. Solution in the Case of a Degenerate Kernel 


The solution of a Fredholm integral equation of the second kind with a degenerate kernel leads to the 
solution of a finite-dimensional system of equations. Consider the integral equation 


b 
p(x) = f(x) + A f Klevel) dy with (11.4a 
K(x,y) = αι(α)θι(υ) + os(x)Bo(y) +... + an(x) Bn (y). (11.4b 
The functions a;(x),..., o (x) and fi(x), ..., 8n(a) are given on the interval [a, b] and are supposed to 
be continuous. Furthermore, the functions o (x), ... , o4, (z) are supposed to be linearly independen 
of one another, i.e., the equality 
n 
X carle) = 0 (11.5 
k=1 
with constant coefficients c; holds for every x in [a,b] only ife; = c = ... = c, = 0. Otherwise, 


K (x, y) can be expressed as the sum of a smaller number of products. 
From (11.4a) and (11.4b) follows: 


b b 
ία) = f(a)  Aex(2) f A G)ey) dy +.. + Aala) | ιο) ο(ν) dv. (11.62) 


The integrals are no longer functions of the variable x, they are constant values. Let's denote them by 
Ay: 
b 
Ay = πο μμ ον (11.67) 
a 
The solution function y(x), if any exists, is the sum of the perturbation function f(x) and a linear 
combination of the functions o4 (x), ... , a (x): 


p(x) = f(x) + λΑιαι(α) + AAgas(z) +... + λΑπαπ(α). (11.6c) 


2. Calculation of the Coefficients of the Solution 
The coefficients 41, .. . , An are calculated as follows. Equation (11.6c) is multiplied by 8,(x) and its 
integral is calculated with respect to x with the limits a and b: 

b b b b 

EXT dx = [ ^f) dx + AA, | oale) dx+...+AAn EXON dx. (11.7a) 


The left-hand side of this equation is equal to Aj, according to (11.6b). Using the following notation 


b b 
bk = [ 656) dr and chy = no da: (11.7b) 
there is fork = 1,...,m: 


Ag = bk + λομιΑι + λοωΑο +... + ACknAn- (11.7c) 
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It is possible that the exact values of the integrals in (11.7b) cannot be calculated. When this is the 
case, their approximate values must be calculated by one of the formulas given in 19.3, p. 963. The 


linear system of equations (11.7c) contains n equations for the unknown values A;,..., An: 
(1 — Aen) At -λειο Ag— ... --λόιι An = b1, 
λωι Ay +(1 — λω2)4Α1 --... —Acon An = b2, 
2 Αι +( 22) Λο 2 2 (11.74) 
—Ac nl Αι Àc n2 Ap t (1 enn) An b, 


3. Analyzing the Solution, Eigenvalues and Eigenfunctions 
It is known from the theory of linear systems of equations that (11.7d) has one and only one solution 
for A1,..., An if the determinant of the matrix of the coefficients is not equal to zero, i.e., 


(1— Aen) -λεια ... —XCin 
—Aco, (1— €) ... —Acon, 

D(A) = a ( 22) a] oz (11.8) 
—XCni -λόπο ... {1 -- Aena) 


Obviously D(A) is not identically zero, as D(0) = 1 holds. So there is a number R > 0 such that 
D(A) # Oif [À| < R. For further investigation two different cases are considered. 
Case D(A) Æ 0: 
The integral equation has exactly one solution in the form (11.6c), and the coefficients Aj,..., An are 
given by the solution of the system of equations (11.7d). If (11.4a) is a homogeneous integral equation, 
ie. f(x) = 0, then bj = b =... = b, = 0. Then the homogeneous system of equations (11.7d) has 
only the trivial solution ΑἹ = A» =... = A, = 0. In this case only the function (x) = 0 satisfies the 
integral equation. 
Case D(A) = 0: 
D(A) is a polynomial of no higher than n-th degree, so it can have at most n roots. For these values 
of À the homogeneous system of equations (11.7d) with b; = by = ... = b, = 0 also has non-trivial 
solutions, so besides the trivial solution y(x) = 0 the homogeneous system of equations has other 
solutions of the form 

glx) = C - (Ajay (x) + Αχαο(α) 4-...-- Anay(x)) (C is an arbitrary constant.) 
Because o4(x),..., o, (x) are linearly independent, (x) is not identically zero. The roots of D(A) 
are called the eigenvalues of the integral equation. The corresponding non-vanishing solutions of the 
homogeneous integral equation are called the eigenfunctions belonging to the eigenvalue A. Several 
linearly independent eigenfunctions can belong to the same eigenvalue. If the integral equation has a 
general kernel, then are be to considered all values of À eigenvalues, for which the homogeneous integral 
equation has non-trivial solutions. Some authors call the A with D(A) = 0 the characteristic number, 


1 b 
and u = 3 is called the eigenvalue corresponding to an equation form µῳφ(α) = / K (v, y)e(y) dy. 
Ja 


4. Transposed Integral Equation 

Now it is necessary to investigate the conditions under which the inhomogeneous integral equation will 
have solutions if D(A) = 0. For this purpose the transposed integral equation (or adjoint in the complex 
case) of (11.4a) is introduced: 


W(x) = g(x) +A | Klu, coy) dy. (11.9a) 


Let λ be an eigenvalue and (x) a solution of the inhomogeneous integral equation (11.4a). It is easy 
to show that A is also an eigenvalue of the adjoint equation. Now multiply both sides of (11.4a) by any 
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solution (x) of the homogeneous adjoint integral equation and evaluate the integral with respect to x 
between the limits a and b: 


[ome a) dx = / f (z)v(x) dx ZI [κι z, y)v(a D ply) dy. (11.9b) 


Assuming that ij(y) = f K(x, y) (x) dx, then f" f(x)w(x) dz = 0 holds. 


That is: The inhomogeneous integral equation (11.4a) has a solution for some eigenvalue λ if and only 
if the perturbation function f (x) is orthogonal to every non- vanishing solution of the homogeneous ad- 
joint integral equation belonging to the same A. This statement is valid not only for integral equations 
with degenerate kernels, but also for those with general kernels. 


4. : ; 
MA: g(t) = υ-- L (αἲν + ay? — zy)e(y) dy, alz) = a?, αὐ(α) = v, αλα) = --ᾱ, fi(y) = 


y, Baly) = y^, βι(υ) = y. The functions o; (x) are linearly dependent. Therefore one transforms the 


4. 
integral equation into the form y(x) = x 4 / [xy + z(y? — y)|e(y) dy. For this integral equation 
-1 


αι(α) = 2°, a(g ) = x, Bily) =y, Poly) = y? — y holds. If any solution y(x) exists, it has the form 
g(r) = £ + Aya? + Αγα. 


44. , BE a 2 ERE s 2 
eu f; x? dx = 0, ο. = 3? b= 2 a dx = 5, 
+1 2 , 9 - , 2 
C21 E (x! — a?) dx 5) C22 fv. (οὗ a?) dx 3 by I, (a? x?) dx 3° 
With these values the system of equations to determinate A; and As has the form: A; — 2A» = 
2 2 2 2 1 2 
3? παι | (1 | =) Ag = τη which in turn yield that A; = = Ag = EC and y(x) = z + 


10, 2 
lr — τας q 
21 T 21 7 
WB B: p(x) =a xf sin(x + y)p(y) dy, i.e.: και) = sin(x + y) = sin z cosy + cos xsin y, p(x) 
0 
a Asia f cosy p(y) dy + Acosx [^ sin y p(y) dy. 
0 0 


T " T T 
on = [ sinrcosrdr —0, Cy. = ie cos? x dx = PL b= [ xcosxdx = —2, 
Jo Jo Jo 


on = [sin ede = 7, ca = | coszsinzdz = 0, b= | csinrdr-. 
0 2 0 0 
T T 
With these values the system (11.7d) is Αι — A 7 A» 2, -λ 9 Αι + Ag = m. It has a unique solution 
2 
π π΄ 
1 -A= λ---2 i-X 
for any À with D(A) = τ 2]-1-λ᾽- 10 SoA, = —2—, A = αἱ ) , and 
“Az 1 | Lee ο 
2 4 4 
À 2 
the solution of the integral equation is y(x) = «4 z (s 2) sina + 7(1— A) cos]. The 
ie yT 2 
. , . 2 2 
eigenvalues of the integral equation are Ay =—, A2=——. 
π π 


T 
The homogeneous integral equation p(x) = Ax / sin(x + y)g(y) dy has non-trivial solutions of the 
0 
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2 
form p(x) = Ak(Aisinz + Agcosx) (k = 1,2). For 4; = — holds A; = Az, and with an arbitrary 
π 


2 

constant A it follows φι(α) = A(sin x + cos x). Similarly for 4 = —— holds φο(α) = B(sin x — cos x) 
π 

with an arbitrary constant B. 


Remark: This previous solution method is fairly simple but it only works in the case of a degenerate 
kernel. This method can be used to get a good approximate solution in the case of a general kernel too 
if it is possible to approximate the general kernel by a degenerate one closely enough. 


11.2.2 Successive Approximation Method, Neumann Series 
1. Iteration Method 
Similarly to the Picard iteration method (see 9.1.1.5, 1., p. 549) for the solution of ordinary differential 


equations, an iterative method needs to be given to solve Fredholm integral equations of the second 
kind. Starting with the equation 


P(e) = f(x) +A f Ks.) ἀν. (1140) 


one defines a sequence of functions qo(x), yi(x), Yo(x),.... Let the first be φρ(α) = f(x). The subse- 
quent o; (x) can be get by the formula 


b 
exa) = f(a) ο dy (n ο ο (11.113) 
Following the given method the first step is 
b 
vir) = f(a) +A f KG.) flu) ἀν. (11.11b 


According to the iteration formula this expression of p(y) is substituted into the right-hand side ο 
(11.10). To avoid the accidental confusion of the integral variables, y is denoted by η in (11.11b). 


b b 
pale) = fa) +A f KG) 19) + f και) d dy (1116 
^ ' bb 
= f(x)+r / K (x,y) f(y) dy 4- X EC y) K (y. m) f (y) dy dn. (11.11d 


b 
Introducing the notation of ια, y) = Κ(α, y) and Κο(α, y) = / K (x, £) K (£, y) d£, and renaming n 


as y, one can write y2(x) in the form 


b b 
pale) = f(a) +A f Kis.) FG) ἀν +X f Kale, y) f(y) dv. (11.116) 
Denoting 
b 
Ky (x, y) = [ KG. oi, y)dé (n=2,3,...) (11.11f) 


a 
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then there is the representation of the n-th iterated v; (x): 


b b 
a(t) = f(z) +A / K,(2,y) f(y) dy +... +A" / Kyla, y) f(y) dy. (11.118) 


Km, y) is called the n-th iterated kernel of K (x, y). 
2. Convergence of the Neumann Series 
To get the solution y(x), it is to be discussed the convergence of the power series of A 
x b 
f(x) + X A” | Kale, y) f(y) dy, (11.12 
n=l à 
which is called the Neumann series. If the functions K (x, y) and f(x) are bounded, i.e., the inequalities 
[K(x,y) <M (a<a<b, a<y<b) and |f(z) <N (a<a<b), (11.13a 
hold, then the series 


NX AMQ- a)|" (11.13b 


n=0 


is a majorant series for the power series (11.12). This geometric series is convergent for all 


1 
την, 
Al M(b — a) 
The Neumann series is absolutely and uniformly convergent for all values of A satisfying (11.13c). By 
a sharper estimation of the terms of the Neumann series one can give the convergence interval more 
precisely. According to this, the Neumann series is convergent for 


1 


b b 
[FIG v) 


aa 


(11.13c 


|A| < (11.13d) 


|? dx dy 


This restriction for the parameter A does not mean that there are no solutions for any |A| outside the 
bounds set by (11.13d), but only that one cannot get it by the Neumann series. The expression 


D(z,y; 3) = 35A μία, y) (11.142) 


n=1 


is called the resolvent or solving kernelof the integral equation. Using the resolvent one gets the solution 
in the form 


ple) = fla) +A f Gv f(y) ay. (11.145) 


1 
W For the inhomogeneous Fredholm integral equation of the second kind y(x) = «+A / xy p(y) dy fol- 
0 


1 Ï 1 " 
lows Kı (x,y) = xy, Ko(x,y) = / τη ην dy = 559, Γαία) = 638, ser Kalag) = A and from 
0 


3n-1 
œ yn 


λ 
this l'(z, y; A) = xy (È x) With the limit (11.13c) the series is definitely convergent for |A| < 1, 


n=0 
Try 
o3) 
do 
3 


because 


K(a,y)| < M = 1 holds. The resolvent I(x, y; A) = is a geometric series which is 
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1 
convergent even for |A| < 3. Thus from (11.14b) follows y(x) = x + A Í 


Remark: If for a given A the relation (11.134) does not hold, then any continuous kernel can be de- 
composed into the sum of two continuous kernels K(x, y) = K(x, y) + K?(z, y), where Τα, y) is a 
degenerate kernel, and K?(x, y) is so small that for this kernel (11.13d) holds. In this way one has an 
exact solution method for any À which is not an eigenvalue. 


11.2.3 Fredholm Solution Method, Fredholm Theorems 
11.2.3.1 Fredholm Solution Method 


1. Approximate Solution by Discretization 
A Fredholm integral equation of the second kind 


ple) = fla) +A f Κία.υλρίν) dy (11.15) 


can be approximately represented by a linear system of equations. It should be assumed that the func- 
tions K(x, y) and f(x) are continuous for a € x € b, a € y € b. 

The integral in (11.15) should be approximated by the so-called left-hand rectangular formula (see 
19.3.2.1, p. 964). It is also possible to use any other quadrature formula (see 19.3.1, p. 963). An equidis- 
tant partition 


b— 
yk =a+(k—-1)h (K=1,2,---,n, h= η (11.163) 
yields the approximation 
p(x) 55 f(x) + Ah μΚ(α, ψι)φ(νι) +... + K(x, yn)P(Yn)] - (11.16b) 
Replacing y(x) in this expression by a friction p(x) exactly satisfying (11.16b) yields: 
σα) = f(x) + Ah [K (z.yi)gQA) +... + Κία, ψα)φ(νι)]. (11.16c) 
To determine this approximate solution, it is necessary to know the substitution values of p(x) at the 
interpolation nodes x, = a + (k — 1)h. Substituting £ = T1, T = $9,..., Ὁ = Tn into (11.16c), yields 


a linear system of equations for the required n substitution values of ία). Using the short-hand 
notations 


Kin = K(£j Yk), ex = P(e), fr = f (£x) (11.17a) 
this system has the form 
(1 — Ah Ka): -ληΚιοφα- ... -λη ΚΗ Yn = fi, 
—AhKa φι +(1 — M3) — ... —Ah Ko, Pn = fr, (11.17) 
—Ah KE, i —AhKn2 p2 — ... (1 — ARKnn) Gn = fa. 
This system has the determinant of the coefficients 
(1 — Ahi) —ARhK;, ... —ARh Ky, 
B —ANKg, (1— λος) ... — Ah Kos, (11.176) 
—ARhK,i —AhKn2 ... (1— Ah) 


This determinant has the same structure as the determinant of the coefficients in the solution of an 
integral equation with a degenerate kernel. The system of equations (11.17b) has a unique solution 
for every \ where D, (A) Z 0. The solution gives the approximate substitution values of the unknown 
function q (x) at the interpolation nodes. The values of A with D,,(A) = 0 are approximations of the 
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eigenvalues of the integral equations. The solution of (11.17b) can be written in quotient form (see 
Cramer rule, 4.5.2.3, p. 311): 


r= R2 (xy Rod sas i 11.18 
Pk ΠΧ} eux), TIL ( ) 
Here follows D* (A) from D, (A) by replacing the elements of the k-th column by fi, fa,---, fn- 

2. Calculation of the Resolvent 

If n tends to infinity, so does the number of rows and columns of the determinants DF(A) and D,,(A), 
as well. The determinant 


D(A) = lim D,(A) (11.194) 
is used to get the solution kernel (resolvent) I(x, y; A) (see 11.2.2, p. 625) in the form 
D(x, yi λ) 
Γία,ψ;λ) = ------- 11.191 
ΦΑΞ E (11.19) 


It is true that every root of D(A) is a pole of I (x,y; A). Exactly these values of A, for which D(A) = 0, 
are the eigenvalues of the integral equation (11.15), and in this case the homogeneous integral equation 
has non-vanishing solutions, the eigenfunctions belonging to the eigenvalue A. In the case of D(A) 4 0, 
knowing the resolvent I(x, y; A), an explicit form of the solution is: 


b b 
λ 
ία) = fla) +A f Γαλ) dy = f) + σοι [ DG FO) ay. (11.19¢) 
To get the resolvent, one needs the power series of D(x, y; A) and D(A) with respect to A: 


D(zyX iC D Keys A" (11.202) 
D(A) X 1rd, A 


n=0 


D(a,y;A) = 


where dọ = 1, Ko(r,y) = K(r, y). One gets the further coefficients from the recursive formula: 


b b 
1 
dom / Reside κ db do / K(x,t)Kn1(t, y) dt. (11.20b) 
TL 


Ed 
2 


ΒΑ: g(x) 2 sinz Ελ [ sin x cos y p(y) dy. The exact solution of this integral equation is 
Jo 


2 

p(x) = J-J sinz. For n = 3 with 7; = 0, πο 5 v3 3 h s gives 

1 0 0 
D(A) xaX a (: E Αν 52 M. oos 

3 = 12 24 24 = . X 2.200 
24 192 12 νϑπ 
Υ3λπ 3λπ 1 V3AT 
12 24 24 


is an approximation of the exact eigenvalue À = 2. From the first equation of the system of equations 
(11.17b) for fı = 0 follows the solution yı = 0. Substituting this result into the second and third equa- 


V3AT AT 1 3AT V3AT 
p2 1 Qa = 


24 2479 79 Ty” 24 


-= » Φα Ξ ve .IfA=1,theny,; =0, w= 
a- “A gd d 


tion gives the system of equations: (: 


3 ; ; 
us This system has the solution p2 
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0.915, p3 = 1.585. The substitution values of the exact solution are: y(0) = 0, y (=) =], oy e = 
1.732. 


In order to achieve better accuracy, the number of interpolation nodes needs to be increased. 


BB: p(x )-s«af. (4ry — z?)g(y) dy; do — 1, Κοία,ν) 2 4xy — α”, di = [ae dx 1, 


4 4 1 
κια, y) iy x? [ (4xt — a?) (4ty — 1?) dt = + 2a?y — =a? — αχ, dy hs Μι(α,α) dx = —, 
3 3 2 Jo 18 


Ko(x,y) = πάν x’) — fx K (x, t) Κιν) dt = 0. With these the values da, Ks(x, y) and all the fol- 


lowing values of αι and (x, y) are equal to zero. T(x, y; A) = 


λ2 
From 1 — à + — = 0 the two eigenvalues λι» = 9 + 3V7 follow. If λ is not an eigenvalue, then the 


18 
--- 3x(2\ — 3Ax + 6 
solution is p(x) = x +A L T (x,y; A) f(y) dy = πο . 


11.2.3.2 Fredholm Theorems 


For the Fredholm integral equation of the second kind 


p(z) = f(x) + Af K(v,y)ey) dy (11.21a) 


the correspondent transposed integral equation is given by 


W(x) = g(x) +A | Κίυ. coy) dy. (11.21b) 


For this pair of integral equations the following statements are valid (see also 11.2.1, p. 622). 


1. A Fredholm integral equation of the second kind can only have finite or countably infinite eigenval- 
ues. The eigenvalues cannot accumulate in any finite interval, i.e., for any positive R there are only a 
finite number of A for which |A| < R. 

2. If A is not an eigenvalue of (11.21a), then both of the inhomogeneous integral equations have a 
unique solution for any perturbation function f(x) or g(a), and the corresponding homogeneous inte- 
gral equations have only trivial solutions. 

3. If A is an eigenvalue of (11.21a), then A is also an eigenvalue of the transposed equation (11.21b). 
Both homogeneous integral equations have non-vanishing solutions, and the number of linearly inde- 
pendent solutions are the same for both equations. 

4. For an eigenvalue A the homogeneous integral equation can be solved if and only if the perturbation 
function is orthogonal to every solution of the homogeneous transposed integral equation, i.e., for every 
solution of the integral equation 


b 
ψ(α) -- λ | Kw) dy, (11.22a) from (r)dr — 0 holds. (11.22b) 


'The Fredholm alternative theorem follows from these statements: Either the inhomogeneous integral 
equation can be solved for any perturbation function f(x) or the corresponding homogeneous equation 
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has non-trivial solutions. 


11.2.4 Numerical Methods for Fredholm Integral Equations ofthe 
Second Kind 


Often it is either impossible or takes too much work to get the exact solution of a Fredholm integral 
equation of the second kind 


p(x) = fx) +A f Ke) dy (11.23) 


by the solution methods given in 11.2.1, p. 622, 11.2.2, p. 625 and 11.2.3, p. 627. In such cases certain 
numerical methods can be used for approximation. Three different methods are given below to get the 
numerical solution of an integral equation of the form (11.23). 


11.2.4.1 Approximation of the Integral 


1. Semi-Discrete Problem 
Working on the integral equation (11.23) often one replaces the integral by an approximation formula. 
These approximation formulas are called quadrature formulas. They take the form 


b n 
f fe) dz Qua) = Dens len), (11.24) 
a k=1 


i.e., instead of the integral there is now a sum of the substitution values of the function at the inter- 
polation nodes x, weighted by the values wp. The numbers cj should be suitably chosen (so as to be 
independent of f). Equation (11.23) can be written in the approximate form: 


p(x) = f(x) + AQua Cr, Jolt) = f(x) + AY wK (x, yi) Qui.) (11.25a) 
k=1 
The quadrature formula Qu, (K (x, -)y(-)) also depends on the variable x. The dot in the argument of 
the function means that the quadrature formula will be used with respect to the variable y. Defining 
the relation 


P(x) = f(x) +A X ox K(x, ypy). (11.25b) 
k 


=1 
P(x) is an approximation of the exact solution y(x). One considers (11.25b) as a semi-discrete problem, 
because the variable y is turned into discrete values while the variable x can still be arbitrary. 

If the equation (11.25b) holds for a function G(x) for every x € [a,b], it must also be valid for the 
interpolation nodes x = xx: 


n 
σαν) = F(E) HAY wK (zr yP)  k—1,2,....m. (11.25c) 
j-l 

This is a linear system of equations containing n equations for the n unknown values (x4). Substituting 
these solutions into (11.25b) yields the solution of the semi-discrete problem. The accuracy and the 
amount of calculations of this method depend on the quadrature formula used. For example using the 
left-hand rectangular formula (see 19.3.2.1, p. 964) with an equidistant partition yy = xy = a + h(k — 
1, h—(b—a)/n, (k — 1,...,n) yields: 


b n 
[| KG WeWdy e 3 hK Gr, neu (11.263) 
a k=1 


With the notations 
Kj = (ανν fe =f (2x), Px = P(e) (11.26b) 
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the system (11.25c) has the form: 


(1 — Ahi) gr —AhKi2 φο-- ... —AAK in Yn = fi 
—AhKa q1 +(1 — ARKoay2) — ... —Ah Kos Yn = fo, (11.26c) 
—Ah πι pı —AhKn2 Yo- ... +(1 = ARKnn) Pn = jns 


The same system was involved in the Fredholm solution method (see 11.2.3, p. 627). As the rectangular 
formula is not accurate enough, for a better approximation of the integral one can increase the number 
of interpolation nodes, along with an increase in the dimension of the system of equations. Hence one 
gets the idea of looking for another quadrature formula. 


2. Nystróm Method 
In the so-called Nystróm method the Gauss quadrature formula is to be used for the approximation of 
the integral (see 19.3.3, p. 965). In order to derive this, one considers the integral 


f= f f(a) dz. (11.27a) 


The integrand is replaced by a polynomial p(x), namely the interpolation polynomial of f(x) at the 
interpolation nodes αρ: 


ρα) = S Ly (x) f (x) with 
k-i 


(z—21) ... (£ xk τα Όχι)... (£ -- Tn) 
(αν — 21)... (xy — zy a) (uy — za) (zx — Ln)” 
For this polynomial 

p(xx) = Εαν), k — 1,...,n. (11.27c) 
holds. The replacement of the integrand f(x) by p(x) results in the quadrature formula 


Li(x) = 


(11.27b) 


b b n b : b 
ὦ) ἆπ πο 2) dx = σε) | Γη) dz = wef (ap) with w = | Ly(x)dx. (11.27d 
[ fe) f») 3.169 f n) Dives lee) with ωι { tale) (11.274) 


For the Gauss quadrature formula the interpolation nodes cannot be chosen arbitrarily but they must 
be chosen by the formula: 
b b- 
πο ο ών να (11.28a) 
2 2 
The n values ἐκ are the n roots of the Legendre polynomial of the first kind (see 9.1.2.6, 3., p. 565) 
1 d? (e _ 1)" 
P, t TR . 
(t) 2". m! dt" 


These roots are in the interval [-1,--1]. The coefficients cw can be calculated by the substitution 


(11.28b) 


δις: 
2 -- Tk = tt — tp), 80: 


b 1 
1 t—t,)...(¢— ty 1)( = th πίη 
σσ | Sake ο 
J 25 (ἐκ — 51)... (tk — tci) (te — tha)... (tk — tn) 
= (b — a)A;. (11.29) 
In Table 11.1 are given the roots of the Legendre polynomial of the first kind and the weights Ap for 
R= Vos 6: 


632 11. Linear Integral Equations 


Table 11.1 Roots of the Legendre polynomial of the first kind 
n t A n t A 
1 t= 41-1 5 t, — —0.9062 A, = 0.1185 
2 tı = —0.5774 A; = 0.5 tə = —0.5384 Ag = 0.2393 
to = 0.5774 4A» = 0.5 tz= 0 As = 0.2844 
3 tı = —0.7746 Αι = 0.2778 t4 = 0.5384 A4 = 0.2393 
to = Ay = 0.4444 ts = 0.9062 As = 0.1185 
tz = 0.7746 As = 0.2778 6 = —0.9324 Αι = 0.0857 
4 tı = —0.8612 A, = 0.1739 = —0.6612 A» = 0.1804 
to = —0.3400 A» = 0.3261 tz = —0.2386 As = 0.2340 
t4 — 0.3400 As — 0.3261 t4 = 0.2386 A4 = 0.2340 
ty = 0.8612 A4 = 0.1739 ts = 0.6612 As = 0.1804 
tg = 0.9324 Ag = 0.0857 
W Solve the integral equation y(x) = cos 7x4 Fae 1)4 fe €"! p(y) dy by the Nyström method 
forn = 3. 
m=3: 2, = 0.1127, £5 = 0.5, 24 = 0.8873, 
A, = 0.2778, Ay = 0.4444, As = 0.2778, 
fi = 0.96214, f» = 0.13087, fs = —0.65251, 


Ky, = 1.01278, 


Koo = 1.28403, Μη» = 2.19746, 


Ki» = Ka = 1,05797, Kis = Kg = 1.10517; Kos = K2 = 1.55838. 


The system of equations (11.25c) for φι, φο, and qs is 


0.71864, 
--0.29390φι + 0.42938» 


— 0.470169» — 0.307023 = 0.96214, 


— 0.432923 = 0.13087, 
—0.30702,, — 0.692540» + 0.389553 = —0.65251. 


The solution of the system is: pı = 0.93651, p2 = —0.00144, p3 = —0.93950. The substitution values 
of the exact solution at the interpolation nodes are: y(2) = 0.93797, q(x3) =0, (x3) = —0.93797. 


11.2.4.2 Kernel Approximation 
Replace the kernel K(x, y) by a kernel K(x, y) so that K(x, y) ~ K(a,y) fora<a<ba<y<b. 
Try to choose a kernel making the solution of the integral equation 


ρα) = fle) +d | KG. ew) du (11.30) 


the easiest possible. 
1. Tensor Product Approximation 
A frequently-used approximation of the kernel is the tensor product approximation in the form 


K(a,y) ~ K(v,y) 2 3, X dir aj (x) By) (11.31a) 
10 k=0 
with given linearly independent functions ao(x), ... , o, (x) and boly), .... 8,(y) whose coefficients dj, 


must be chosen so that the double sum approximates the kernel closely enough in a certain sense. 
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Rewriting (11.314) in a degenerate kernel gives: 


K(a,y) = Σ α)(α) 5 dj By) 
j=0 k=0 


Now, the solution method 11.2.1, p. 62 


n 


ΟΙ 


150 


Functions ag(x), ... o, (x) and Bo(y),... 


can be calculated easily and also that 
2. Special Spline Approach 
One chooses 


YS a;(x)d;(y) 


ον ου ο (11.31b) 
k=0 j=0 

2 can be used for the integral equation 

| Ply) dy. (11.31c) 


, Pa (y) should be chosen so that the coefficients dj, in (11.31a) 
he solution of (11.31c) isn't too difficult. 


ox (x) = B(x) 
0 


for a special kernel approximation on 


non-zero values only in the so called carrier interval ( 


(x) 


xY 


kl 
n 


Figure 11.1 


As known, the solution of (11.919) has 
P(x) = f(x) + Aoao(x) + 


c Ánan (T). 


n (11:32) 
otherwise 


he interval of integration [a,b] = [0, 1]. The function o; (x) has 


RE κα ) , (Fig. 11.1). 


To calculate the coefficients dj, in (11.31a), consider K (x, y) 


at the points x = l/n, y = i/n (l,i =0,1,...,n). Then 
ἰ i 1 for =i k =i, 
K (5) m (7) 7 p otherwise (11.33) 


holds, and consequently K(I/n,i/n) = dn. Hence, one sub- 


— fd l 
stitutes dj; = κί “) = κί 
TL TL 


n 
the form 


; 3 Now (11.31a) has 
n 


(11.34) 


j=0 k=0 


the representation 


(11.35) 


The expression Agag(x) +... + Anan(x) is a piecewise linear function with substitution values Αχ at 


the points x = k/n. Solving (11.31c 


system of equations for the numbers Ao, ... , 
--λορι Αι — ... 
--λοιο Ao + (1 — λοιιΑι) -- ... 


(1 oF Acoo) Ao 


) by the method given for the degenerate kernel, gives a linear 
n : 
—Acon An = bo, 


-λο n Απ =b ; 
à : (11 368) 


--λόπρ Αρ Ent Αι 
where 
1 L 
cj = | δα)ακ(α) da -|5: 
i=0 
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1 1 
- πια), ε.α ΤΉΝΕ 
=K (2. 3 { osos) dx+...+K (2. 2) / αμ(α)ακ(α) da. (11.360) 
For the integrals it holds 1 
-- for j=0,k=Oandj=n,k=n, 
3n 
; : for j=k,1l<j< 
= E J =k; n, 
xc OOL =} 3n P= S e (11.36c) 
i 1 
? — do j=k+1,j=k-1, 
6n 
0 otherwise. 


The numbers ἐμ in (11.36a) are given by 


n k 4 
5 κο 3 se) dx. (11.36d) 
j=0 


Taking a matrix C with numbers cj; from (11.36a), a matrix B with the values K(j/n, k/n) and a 


bi = [κω 


matrix A with the values [μι respectively, a vector b from the numbers bo, . . . , bn, and a vector a from 
the unknown values Ao, ..., An, the system of equations (11.36a) has the form 

(I— AC)a = (I — ABAJa = b. (11.966) 
In the case when the matrix (I — ABA) is regular, this system has a unique solution a = (Ag,..., An). 


11.2.4.3 Collocation Method 


Suppose the n functions yi (x),..., n(x) are linearly independent in the interval [a,b]. They can be 
used to form an approximation function p(x) of the solution y(x): 

p(x) ~ G(x) = αιφι(α) + αοφο(α) +... + augu (x). (11.37a 
The problem is now to determine the coefficients a1, ..., an. Usually, there are no values a4,..., Gn 
such that the function p(x) given in this form represents the exact solution y(x) = P(x) of the integra 
equation (11.23). Therefore, n interpolation points £1, ... , &n are defined in the interval of integration, 
and it is required that the approximation function (11.37a) satisfies the integral equation at least at 
these points: 


(ry) = aii (zx) +... + anten (tk) (11.37b 


b 
f(x) + A f Klen) largi(y) +... +anpnly)] dy (k=1,...,n). (11.37c 


With some transformations this system of equations takes the form: 


Pn(Xr) z A | Klen y)pn(y) ay an 


a 


ου = A f K(an.yerly) ἀμ] ai... 


= f(a) (R= 1,..:, 7); (11.37d) 
Defining the matrices 
vizi): Yn(x1) Bu Bin P 
A= : : , B=] : with fj, = | Ke vet) dy (11.37e) 
pilin) diii Qo (X5) bni ὦ Brn a 


and the vectors 
a= (αν... an) , b-(f(n)..., f(x). (11.371) 
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then the system of equations to determine the numbers a1,..., απ can be written in matrix form: 


(A — AB) a — b. (11.378) 


T 1 
E p(x) = B + [ ναό p(y) dy. The approximation function is G(x) = a12? + agx + az, φι(α) = 


1’, φο(α) = x, p(x) = 1. The interpolation nodes are σι = 0,» = 0.5, 23 = 1. 
001 0 0 0 0 
- V2 v2 ES 
xd CE E B-| 7 5 3 b-| 2/2 
iod d 2 xx 2 
7 5 3 2 


The system of equations is 


a3 = 0, 
1 νὰ 1 νᾶ i V2 1 
= αι τις az + a3 = , 
1 7 1 9 5 2 3 3 2/2 
5 4 3 H 1 1 
=a = ay 2032 -, 
τα 5 2 3 03 2" 


whose solutions are αι = —0.8197, a2 = 1.8092, a3 = 0 and with these P(x) = —0.8197 x? + 1.8092 x, 
and so g(0) = 0, (0.5) = 0.6997, Ῥ(1) = 0.9895. 
The exact solution of the integral equation is y(x) = yx with the values y(0) = 0, φ(0.5) = 0.7071, 
p(1) =1. 
In order to improve the accuracy in this example, it is not a good idea to increase the degree of the 
polynomial, as polynomials of higher degree are numerically unstable. It is much better to use different 
spline approximations, e.g., a piecewise linear approximation g(r) = aii (x) + aoqo(x) ++ ++ +dnYn(x) 
with the functions introduced in 11.2.4.2 
for © 1 κας Put 

n n 


0 otherwise. 
In this case, the solution y(x) is approximated by a polygon G(x). 
Remark: There is no theoretical restriction as to the choice of the interpolation nodes for the colloca- 


tion method. In the case, however, when the solution function oscillates considerably in a subinterval 
the number of interpolation points in this interval should be increased. 


11.3 Fredholm Integral Equations ofthe First Kind 
11.3.1 Integral Equations with Degenerate Kernels 


1. Formulation of the Problem 
Consider the Fredholm integral equation of the first kind with degenerate kernel 
f(a) = fawno +... +an(x)bnly))ply)dy (e<«<d), (11.38a) 
and feasts notation similar to that used in 11.2, p. 622, 
b 
A= [Boudy G 12... (11.38b) 


a 
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Then (11.382) has the form 


f(x) = Aion (x) +... + Απαπ(α), (11.38c) 
i.e., the integral equation has a solution only if f(a) is a linear combination of the functions oi (x), ..., 
oj (x). If this assumption is fulfilled, the constants A;,...,A, are known. 


2. Initial Approach 
Looking for the solution in the form 


p(x) = efi(z) +...+ eB (x) (11.394) 

where the coefficients ci,..., c, are unknown, substituting in (11.387) 
b b 

A= a | By) dy +... ται | BGB) dy. (6 =1,2,-..,n), (11.39b) 

and introducing the notation 
b 

Ky = | 8i(y)8i(y) dy (11.390) 
gives the following system of equations for the unknown coefficients ci,.... Cn: 

Kua +...+ Kintn = Αι, 

: ] : (11.39d) 
Kaiti bb ue = Απ. 


3. Solutions 

The matrix of the coefficients is non-singular if the functions 6) (y),..., 8n(y) are linearly independent 
(see 12.1.3, p. 656). However, the solution obtained in (11.39a) is not the only one. Unlike the integral 
equations of the second kind with a degenerate kernel, the homogeneous integral equation belonging 
to (11.38a) always has a non-trivial solution. Suppose y"(x) is such a solution of the homogeneous 
equation and y(x) is a solution of (11.384). Then y(x) + qe" (x) is also a solution of (11.384). 

To determine all the solutions of the homogeneous equation, consider the equation (11.38c) with f(x) — 


0. If the functions a(x), . .. , o; (x) are linearly independent, the equation holds if and only if 
b 
Ay = | Bilyely) dy=0 (712...) (11.40) 


i.e., every function y"(y) orthogonal to every function 3;(y) is a solution of the homogeneous integral 
equation. 


11.3.2 Analytic Basis 


1. Initial Approach 
Several methods for the solution of Fredholm integral equations of the first kind 


fe) = [καιρο (cS x <d) (11.41) 


determine the solution y(y) as a function series of a given system of functions (8,(y)) = {81 (y), βα(υ), 
...}, Łe., looking for the solution in the form 


p(y) = Σ cjB;(y) (11.42) 


where there are to determine the unknown constants cj. Choosing the system of functions it is to be 
considered that the functions (5,(y)) should generate the whole space of solutions, and also that the 
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calculation of the coefficients c; should be easy. 

For an easier survey only real functions are discussed in this section. All of the statements can be 
extended to complex-valued functions, too. Because of the solution method there are to establish cer- 
tain requirements for properties of the kernel function K (x, y) (see [11.3], [11.11], [11.12]). It is to be 
assumed that these requirements are always fulfilled. Next, there are to discuss some relevant informa- 
tion. 

2. Quadratically Integrable Functions 

A function u(y) is quadratically integrable over the interval [a, b] if 


b 
[WO ay < 00 (11.43) 


holds. For example, every continuous function on [a, b] is quadratically integrable. The space of quadrat- 
ically integrable functions over [a, b] will be denoted by L?[a, b]. 

3. Orthonormal System 

Two quadratically integrable functions 8;(y), 8;(y), y € [a,b] are considered orthogonal to each other 
if the equality 


OLO =0 (11.44a) 


holds. A system of functions (3, (y)) in the space L?[a, b] is called an orthonormal system if the following 
equalities are true: 


b 
, 1. for. i=j, 
/ Bi(y)B;(u) dy = { 0 for : E ji ( 


An orthonormal system of functions is complete if there is no function (y) 4 0 in L?[a, b] orthogonal 
to every function of this system. A complete orthonormal system contains countably many functions. 
These functions form a basis of the space L?[a, b]. To transform a system of functions (8,(y)) into an 
orthonormal system (87 (y)) the Schmidt orthogonalization procedure can be used. This determines the 
coefficients 5,1, 5,5, . .. , Onn for n = 1,2,... successively so that the function 


1.44b) 


n 
Bau) = X brO) (11.440) 
j=1 
is normalized and orthogonal to every function Bt (y), .... 85 4(v). 


4. Fourier Series 
If (8, (y)) is an orthonormal system and v(y) € L?[a, b], then one calls the series 


oo 


» djBi(u) = vy) (11.455) 


j=l 
the Fourier series of p(y) with respect to (8,(y)), and the numbers d; are the corresponding Fourier 
coefficients. Based on (11.44b) 
b x b 
| Bee) dy = Yd; | 8,0089) dy = αι (1.450) 
a j=l a 


holds. If (8,(y)) is complete, the Parseval equality holds: 


b oo 
[WO ay = Y al. (11.450) 
à ja 
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11.3.3 Reduction of an Integral Equation into a Linear System of 
Equations 

A linear system of equations is needed in order to determine the Fourier coefficients of the solution func- 

tion (y) with respect to an orthonormal system. First, a complete orthonormal system (8,(y)), y € 


(a, b] is chosen. A corresponding complete orthonormal system (oj (z)) can be chosen for the interval 
x € [c,d]. With respect to the system (a,,(x)) the function f(x) has the Fourier series 


d 


f(a) = = fiai(x) with f;— J acr) f(a) da. (11.463) 


c 


If the integral equation (11.41) is multiplied by a;(x) and the integral is evaluated for x running from 
c to d alone, one gets: 


d b 
fio | | κω.υ)ρίθαιω) dy dz 
Py d 
- [Γκαν a ply)dy (i=1,2,...). (11.46b) 


The expression in braces is a function of y with the Fourier representation 


d oo 
/ K(2,y)a;(«) dx = K;(y) = Y, KgB;(y) with (11.469) 
c 4Ξ1 


Kj = f κο.υ«ιώβιώ dx dy. 


With the Fourier series approach 


ply) = Σ, ey) (11.464) 
k=1 
follows 
b oo oo 
h= /| Κυθη(υ) (Σ aw} dy 
a j=1 k=1 
oo oo b 
=> ΣΣ Kyo. f BOA) dy (i=1,2,...). (11.46e) 
j=1k=1 A 


Because of the orthonormal property (11.44b) the system of equations 


co 
h=] Kye; (6 =1,2,...) (11.46f) 
j=l 
holds. This is an infinite system of equations to determine the Fourier coefficients c4, c9, .... The matrix 


of coefficients of the system of equations 
Ku Kio Rua 
Ka Ko Kə °°: 

K= | Ky Kay Κα 


(11.46g) 
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is called a kernel matrix. The numbers f; and Kj; (i,j = 1,2,...) are known, although they depend 
on the orthonormal system chosen. 


TORS 


m Jo cosy — cosx 
principal value. As a complete orthogonal system can be used: 


1. a(x) = "1 alx) = 12 cosiz (02 12:4) 2. ον) = "Hn (35:152: sed) 


By (11.464), the coefficients of the kernel matrix are 
: 7 sinysin jy : 
K κ / ie dy =0 (j =1,2,.-.), 
s A Lg o Jo cosy — cosa usd 6 vh 
sin iy cos ix 2 [τ m sir 
Ki o [ ΠΡ. dx dy 5 [ sin y sin iy {I Lu a) dy (1—1,2,...). 
7 mro Jo iT T? Jo Jo COS ir 


COS y — COS x COS y — COS X 
For the inner integral the equation 


sin; 
E φ(υ) ἀν. 0 € x € v. The integral is considered in the sense of the Cauchy 


T =T (11.47) 


cosy — cosx ` siny 


T cosir sin iy 
d: a 
0 


0 forizj, 


2 π 
holds. Consequently Ki; = —— / sin jy sin iy dy = | , 
T Jo -1 for i=j. 


The Fourier coefficients of f(a) from (11.46a) are f; = / f(x)a;(x)dx (i =0,1,2,...). The system of 
0 


0 00-..Ν fà A 
—1 0 0: (9 1 
equations is 0—10- e | = fo |. According to the first equation, the system can have any 


J3 


solution only if the equality fo 


ri f (z)ao(x) dz = T go dr = 0 holds. Then c; = —f; (j = 
1,2,...), and y(y) --25 fisin jy = T SY μία) dx holds. 
Geos sr 


cos y — cos T 


11.3.4 Solution ofthe Homogeneous Integral Equation ofthe First 
Kind 


If p(y) and y"(y) are arbitrary solutions of the inhomogeneous and the homogeneous integral equation 
respectively, i.e., 


b b 
f(a) = / K (a, y)y(y) dy (11.48a) and 0= J K(x, y)” (y) dy, (11.48b) 


then the sum y(y) + e" (y) is a solution of the inhomogeneous integral equation. Therefore there are 
to determine all the solutions of the homogeneous integral equation. This problem is the same as de- 
termining all the non-trivial solutions of the linear system of equations 


Y Kye; =0 (i=1,2,...). (11.49) 
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As sometimes this system is not so easy to solve, the following method can be used for the calculations. 
If there is a complete orthonormal system (a,,(x)), take the functions 


d 
= ΤΟ dr (i=1,2,...). (11.50a) 
σ 
If ^ (y) is an arbitrary solution of the homogeneous equation, i.e., 
b 
[ Ke") dy — 0 (11.50b) 
a 
holds, then multiplying this equality by a; r and performing an integration with respect to x, gives 


o= fec o] eaa adu feo) ιν) αν (i—1,2,...), (11.509) 


i.e., every solution y’(y) of the homogeneous equation must be orthogonal to every function K;(y). 
Replacing the system (&,(y)) by an orthonormal system (K7(y)) and using an orthogonalization pro- 
cedure, instead of (11.50c) follows 
b 
[ enr dy — 0. (11.50) 
a 
Extending the system (/¢*(y)) into a complete orthonormal system, the conditions (11.50d) are obvi- 
ously valid for every linear combination of the new functions. If the orthonormal system (7(y)) is 
already complete, then only the trivial solution y"(y) = 0 exists. 
The solution system of the adjoint homogeneous integral equation can be calculated in exactly the same 
way: 
d 


[ Keyj) dx = 0. (11.50e) 
: sin z d= ori Anoomi costes: 9) = 20 iz (i= 
, cosy - IUS αν y y 5 StS ΤΠ. n orthonormal system 18: aj(v) = i sin ır { = 
1,2,...), Kily) = "E [ a z= 2l "a BED I ONG ee Apiye 
T T JO T TOME CHR 0 cos y — COS T 
21 f'sin(i — 1)y — sin(i + 1): 2 
(11.47) twice yields K;(y) 23 (au τς ») "m (i = 1,2,...). The 
π2 sin y π 


system (€, (y)) is already an orthonormal system. The function Ko(y) = completes this system. 


1 
Vn 
Consequently the homogeneous equation has only the solution: y"(y) = c—= = č, (cis arbitrary). 


VR 


11.3.5 Construction of Two Special Orthonormal Systems for a 
Given Kernel 


1. Preliminaries 

The solution of infinite systems of linear equations (see 11.3.3, p. 638) is not usually easier than the 
solution of the original problem. Choosing suitable orthonormal systems (oj, (1) and (8,(y)) one can 
change the structure of the kernel matrix K in such a way that the system of equations can be solved 
easily. By the following method two orthonormal systems can be constructed such that the coefficients 
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Kij of the kernel matrix are non-zero only for i = j and i = j + 1. 

Using the method given in the previous paragraph, first two orthonormal systems (5^ (y)) and (a(x) 
are to determine, i.e., the solution systems of the homogeneous and the corresponding adjoint homo- 
geneous integral equations respectively. This means that it can be given all the solutions of these two 
integral equations by a linear combination of the functions 5" (y) and a^ (x). These orthonormal sys- 
tems are not complete. By the following method these systems are to be completed step by step into 
complete orthonormal systems o;j(x), 8;(y) (j = 1,2,...). 


2. Procedure 


First a normalized function o4(r) is determined, which is orthogonal to every function (αμα). Then 


the following steps are performed for j = 1,2,...: 
1. Determination of the function £;(y) and the number v; from the formulas 


d 

νιβι(υ) = | K(x,y)oi(r)dx or (11.51a) 
ν 

θα) = f Κίω.υ)αν(α) ἀσ--μ-ι8,ιῳ) GAD), (11.510) 


so that v; is never equal to zero and §;(y) is normalized. Then 5;(y) is orthogonal to the functions 
((85()). BU) .... 85-1 (y)). 


2. Determination of the function &œj4ı (x) and the number p; from the formula 


b 
nios) = | Key) ily) du — να). (1510 


There are two possibilities: 

a) u; £0: The function a;+;(x) is orthogonal to the functions ((ak(a)), αι)... «αγ(α)). 

b) uj = 0: Then the function αγ γι (α) is not uniquely defined. Here again there are two cases: 

Οι) The system (510), oa(z),... «αγ(ο)) is already complete. Then the system (619), πο. 
8;(y)) is also complete, and the procedure is finished. 

bz) The system (1), oj (x), ...,05 (z)) is not complete. Then again one chooses an arbitrary func- 
tion αγγ (z) orthogonal to the previous functions. 


This procedure is repeated until the orthonormal systems are complete. It is possible that after a cer- 
tain step the case b) does not occur during a countable number of steps, but the system of this countable 


number of functions ((ah(z)), oa (x), .. ) is still not complete. Then again one can start the procedure 
by a function d (x), which is orthogonal to every function of the previous system. 


If the functions o; (x), 8;(y) and the numbers vj, jjj are determined by the procedure given above, the 
kernel matrix K has the form 


00 0... pH 0 0 
0 κι Ü 438 m (m NEST 
K=|0 0 K2... with K™= p m (m) . (11.52) 
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m 


The matrices K" (m = 1,2,...) are finite if during the procedure us ) = 0 holds after a finite number 


(m. 
2 
and zero columns in K corresponds to the number of functions in the systems (o^(x)) and (8^(y)). A 


very simple case happens if the matrices K™ contain one number v? = vm only, i.e., all numbers ps!” 
3 1 m y, , j 


are equal to zero. 
Using the notation of 11.3.3, p. 638, for the solution of the infinite system of equations under the as- 
sumptions f; = 0 for a,(x) € (a? (x)) it holds: 


of steps. They are infinite if for countably many values of j, μ ) # 0 holds. The number of zero rows 


fi [ 
a for (i Bry) ; 
-E- x Alu) € (Br) (11.53) 
arbitrary for j(y)€ (6h(y)) ] 
11.3.6 Iteration Method 
'To solve the integral equation 
b 
Λίο) = | Kaye) dy (cx « « 4). (1.549) 
starting with αρ) = f(x) one determines the functions 
d b 
Baly) = : K(2,y)om1()dz (11.54b) and d_(«) = / K(2,y)Br(y) dy, (1540) 
for n = 1,2,.... If there is a quadratically integrable solution y(y) of (11.54a) then the following 
equalities hold: 
b bd 
Γρίβα = f f KE yos κα) de dy 
d 
= fioi) ἄν (n=1,2,...). (11.544) 


Orthogonalization and normalization of the function systems (11.54b,c) give the orthonormal systems 
(o (x)) and (G*(y)). Using the Schmidt orthogonalization method then *(y) has the form 


By) = bnbl) | (n—12,...). (11.540) 
j-l 
Now it is assumed that the solution y(y) of (11.54a) has the representation by the series 
p(y) = 3 o8). (11.549 
j=l 


In this case for the coefficients c, regarding (11.54d) 


b n b n d 
cn = f eG) dy = Yo bn | elu) Bila) du = X bns | Goss (n) dz. (11.548) 
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holds. To have a solution in the form (11.54f) the following conditions are both necessary and sufficient: 


d co 
1. fire (£)? da = ES χα’ (x 7) def? ; (11.55a) 2. Mile < oo. (11.55b) 


n=l 5 n-l 


11.4 Volterra Integral Equations 
11.4.1 Theoretical Foundations 


A Volterra integral equation of the second kind has the form 


p(x) = f(a) + | Ki yeu) ἀν. (11.56) 


The solution function y(x) with the independent variable x from the closed interval J = [a,b] or from 
the semi-open interval J = [α, ου) is required. There is the following theorem about the solution of 
a Volterra integral equation of the second kind: If the functions f(x) for x € I and K(z, y) on the 
triangular region x € I and y € [a, x] are continuous, then there exists a unique solution y(x) of the 
integral equation such that it is continuous for x € J. For this solution 

φία) — f(a) (11.57) 
holds. In many cases, the Volterra integral equation ofthe first kind can be transformed into an equation 
ofthe second kind. Hence, theorems about existence and uniqueness of the solution are valid with some 
modifications. 
1. Transformation by Differentiation 
Assuming y(x), K(x, y), and K,(x, y) are continuous functions, the integral equation of the first kind 


f(x) = nc y)e(y) dy (11.584) 
can be transformed into the form 
f'(x) = Κ(α, π)φ(α) + [5 -K (v, υ)φ(υ) dy (11.58b) 


by differentiation with respect to x. If K (x, x) Z 0 for all x € I, then dividing the equation by K (x, x) 
gives an integral equation of the second kind. 


2. Transformation by Partial Integration 
Assuming that y(x), K(x, y) and K,(r, y) are continuous, one can evaluate the integral in (11.584) by 
partial integration. Substituting 


[ρω oly (x) (11.59a) 


gives 


f(x) 


ll 
A 
— 
[5 
τον 
LI 
S 
= 
= 
| 
Oe 
LEES 
Q 


(11.59b) 


ll 
~ 
R 
R 
L— 
- 
~~ 
oe, 

e 
5 
a 
— 
8 
< 
— 
ATA 
e 
= 
< 
= 
a 
< 
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If K (x, x) #0 for x € I, then dividing by K(x, x) gives an integral equation of the second kind: 


ψία) Aa E / ( 5K) wy) dy. (11.59c) 


Differentiating the solution v(x) yields the solution y(x) of (11.584). 


11.4.2 Solution by Differentiation 

In some Volterra integral equations the integral vanishes after differentiation with respect to x, or it 
can be suitably substituted. Assuming that the functions (Ar y), Kr, y), and y(x) are continuous 
or, in the case of an integral equation of the second kind, y(x) is differentiable, and differentiating 


f(z)- / K(z,y)e(y)dy (11.609 ^ or φία) -- f(x) + / K(z,y)up(y) dy (11.60b) 
with respect to x yields 

f'(x) = K(x, υ)φ(α JE: -K(v,y)v(uy)dy or (11.60c) 

gy (x) = f'(x) + K(2,2) ote) + [σε (x,y) p(y) dy. (11.60d) 


T pi 
W Find the solution y(x) for x € lo. 5) of the equation [ cos(x — 2y)p(y) dy = at sing (I). Differ- 
Jo 
T 1 
entiating it twice with respect to x gives y(x) cos x — [ sin(x — 2y)p(y) dy = 5 (sin x + x cos x) (IIa), 
20 
E 1 
and (1) cosx — i cos(x — 2y)p(y) dy = cos a — P sin x (IIb). The integral in the second equation 
0 
is the same as that in the original problem, so one can substitute it. This yields y’(x) cos x = cos x and 


0, 5), q(r)—-1, so φ(α) --αἠ-Ο. 


To determine the constant C substitute x = 0 in (IIa) to obtain y(0) = 0. Consequently C = 0, and 
the solution of (I) is y(x) = 


because cos x Æ 0 for x € 


Remark: If the kernel of a Volterra integral equation is a polynomial, then one can transform the in- 
tegral equation by differentiation into a linear differential equation. Suppose the highest power of x in 
the kernel is n. After differentiating the equation (n + 1) times with respect to x follows a differential 
equation of n-th order in the case of an integral equation of the first kind, and of the order n + 1 in the 
case of an integral equation of the second kind. Of course it is to be assumed that (a) and f(x) are 
differentiable as many times as necessary. 


T D 
ΗΠ πώ» [2(x — y) + Πρίν) αν = a? (1. After differentiating three times with respect to a holds 


ΠΝ z—y)e(y) dy = 3x? (II*a) eG) [ol y)dy = 6x (138), φ”(α)ἠ-4φ(α) = 6 (II*c). 


The general solution of this differential equation is y(x) = Asin 2x + B cos2r 4 - Substituting x 0 
in (I*a) and (II*b) results in y(0) = 0, φ’(0) = 0, so the solution is A = 0, B = —1.5. The solution 
of the integral equation (I*) is φ(α) = (1 — cos 21). 
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11.4.3 Solution of the Volterra Integral Equation of the Second 
Kind by Neumann Series 


The solution of the Volterra integral equations of the second kind can be represented by using a Neu- 
mann series (see 11.2.3, p. 627). If the equation has the form 


ία) = fla) +A f Ks.) dy, (11.61) 


so one formally substitutes 


K (x,y) = ο Hd E A (11.62a) 

With this transformation (11.61) is identically to a Fredholm integral equation 
b 

p(x) = fx) +A | Κία,υγρίω) dy, (11.62b) 

allowing b = co as adi. 'The solution has the representation 
- b 

ole) = f(a) +N" J Foe av. (11.620) 
The iterated kernels Ki, Ko : ... are defined by the following equalities: 

Ki(v,y) = Κία.ν). Κοία,υ) = [Rank Ὁ) dy = [κα κιν) dr, ... (11.624) 
and in general: Í | 

K,(r,y) = [κα η). (η, y) dn. (11.620) 
The equalities Κα, y) = Ofory > x (j = 1,2,...) are also valid for iterated kernels. Contrary to 


Fredholm integral equations if (11.61) has any solution, the Neumann series converges to it regardless 
of the value of A. 


E v(x) =1+ λ/ e" "p(y)dy. Ki(x,y) = K(x,y) =e, Κο(α, ) = f 6” Πε dy = e" (y — 
Jo 


y 
es Kalna) = cu yy 
Uh: nV, y (n 1)! y s 
oo Ar 
Consequently the resolvent is: P(x, y; à) = e V7 7 (a — y)” = επ ΔΕ, Tt is well-known that 
n=o 7 


this series is convergent for any value of the parameter λ. 


E 
One gets y(x) = 1+ af e MOD dy = 14 ΘΝ ο Ον dy, in particular if A = —1: y(x) = 
Jo 0 


1 j 
1-α,λ 1: φ(α) = Xx (1 | e+). 


11.4.4 Convolution Type Volterra Integral Equations 
If the kernel of a Volterra integral equation has the special form 


k(vr—y) for 0<y<a, 
J=10 


K(a,y) = οσο ey (11.63a) 
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can be used the Laplace transformation to solve the equations 


[Ble —y)elu)dy = F(x) (163) or ία) = fa) + fke- yey) dy. (11.635) 
0 0 
If the Laplace transforms L{y(x)} = S(p), L{f(x)} = F(p), and L{k(x)} = K(p) exist, then the 
transformed equations have the form (see 15.2.1.2, 11., p. 773) 
K(p)®(p) = F(p) (11.64a) or Φ(ρ) = F(p) + K(p)d(p) resp. (11.64b) 
From these follows 
F(p) F(p) 
(p) = —— 11.64c) or (p) = ————. resp. 11.64d 
= XO (11.64e) or So) = πω (11.644) 


The inverse transformation gives the solution y(x) of the original problem. Rewriting the formula for 
the Laplace transform of the solution of the integral equation of the second kind yields 


F(p) K(p) 


Φ F(p)4 F(p). 11.64e 
οτε Ὁ στ [5 (p) (11.64e 
The formula 
K(p) ; 
ike H(p) (11.64f 
depends only on the kernel, and denoting its inverse by A(x), the solution is 
ία) = f(a) + | he-y fly) dv. (11.648 


0 


The function h(a — y) is the resolvent kernel of the integral equation. 


WB p(x) = f(x) + i οἳ οφ) dy: (p) = F(p) + — 90). i.e., (p) = PIFO). The inverse 
0 p- p— 


transformation gives y(x). From H(p) = 


ία) = fle) + [ ef) dy. 


1 , 
5 it follows that A(x) = e?*. By (11.646) the solution is 


11.4.5 Numerical Methods for Volterra Integral Equation of 
the Second Kind 


The problem is to find the solution for the integral equation 
ία) = f(x) + f Kley) dy (11.65) 


for x from the interval J = [a,b]. The purpose of numerical methods is somehow to approximate the 
integral by a quadrature formula: 


/κα, yoy) dy = Qual K (æ, Je). (11.66a) 


Both the interval of integration and the quadrature formula depend on x. This fact is emphasized by 
the index [a, x] of Qr, ;;(. ..). One gets the following equation as an approximation of (11.65): 


Ble) = f(x) + Qua (Kv, JPO). (11.605) 
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The function G(x) is an approximation of the solution of (11.65). The number and the arrangement of 
the interpolation nodes of the quadrature formula depend on 2, so as to allow little choice. If € is an 
interpolation node of Qu ;4( (v, .)P(.)) , then (K(x, £)ig(£)) and especially P(E) must be known. For 
this purpose, the right-hand side of (11.66b) should be evaluated first for x = £, which is equivalent to a 
quadrature over the interval [α, €]. As a consequence, the use of the popular Gauss quadrature formula 
is not possible. 

The problem is to be solved by choosing the interpolation nodes as a = rg < πι < ... < zy «€ ... 
and using a quadrature formula Qt, ,,; with the interpolation nodes zo, z;,...,x,. The values of the 
function at the interpolation nodes are denoted by the brief notation ox = ία) (k = 0,1,2,...). 
For yo follows (see 11.3.1, p. 635) 


po = f(xo) = f(a),  (11.66c) and with this: φι = f(z1) + Qu (K (mi,.)g(.)). (11.664 
Θα οἱ) has the interpolation points zo and πι and consequently it has the form 
Qu, (K (21, .)ig(.) = wo (v1, xo)to + wi K (21, 21)1 (11.66e 


with suitable coefficients wo and w1. Continuing this procedure, the values pp are successively deter- 
mined from the general relation: 


Pr = f (Tk) + Ωιαεμ(Κίαι,.)φ(-)), k-12,3,.... (11.66f 
The quadrature formulas (δα...) have the following form: 


k 
Quas (K (£x, JPC) = 5 wj KR (Lp, Lj) Y;- (11.66g 


j=0 
Hence, (11.66f) takes the form: 


k 
Pr πο + 3 wK (xi, αη)φῃ. (11.66h 


j-0 
The simplest quadrature formula is the left-hand rectangular formula (see 19.3.2.1, p. 964). For this 
the coefficients are 


Wyk =Uj41—-%; for j«k and uwy,-0. (11.661) 
With this follows the system 

po = f(a), 

pı = f (a1) + αι — xo) K (21, ro) o, (11.67a) 


φο = f (x2) + (αι — xo) K (£2, τοφο + (x2 — αι) K (x2, £1)p1 
and generally 


ο 
Pr = f) + X (ziti — vj) K (ae, 25) 7. (11.67b) 


j-0 
More accurate approximations of the integral can be obtained by using the trapezoidal formula (see 
19.3.2.2, p. 964). To make it simple, one chooses equidistant interpolation nodes rj = a + kh, k = 
07172,5535 
h h Mi. 

EO dr το B g(xo) «325 (x) + glær) | . (11.67c) 

a j=l 
Using this approximation for (11.66f) one gets: 

po = f(a), (11.67d) 


Pk = ο 


ki 
K (aj, τοφο + K (xy, vi) t 2 X K (xe, sje) f (11.67e) 


j=1 
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Although the unknown values also appear on the right-hand side of the equation, they are easy to 
express. 

Remark: With the previous method one can approximate the solution of non-linear integral equations 
as well. Using the trapezoidal formula to determine the values y; one has to solve a non-linear equation. 
To avoid this one can use the trapezoidal formula for the interval [a, rj. 1], and use the rectangular 
formula for the interval [αι 1, £]. If h is small enough, this quadrature error does not have a significant 
effect on the solution. 


W The problem is to solve the integral equation y(x) = 2 + [i (x — υ)φ(υ) dy by the formula (11.66f) 
0 


using the rectangular formula. The interpolation nodes are the equidistant points x, = k: 0.1, and 
hence h = 0.1. 


T exact | rectangular | trapezoidal 
Qo =2, formula formula 
pe fla) ο. 30) 02, 0.2 | 2.0401 2.0602 2.0401 
da = fla) +h Ga to) po + ο 04 | 21621 | 223090 2.1620 
= 2 + 0.1(0.2 - 2 4- 0.1- 2.02) = 2.0602 0.6 | 2.3709 2.4342 2.3706 
etc. 0.8 | 2.6749 2.7629 2.6743 
1.0 | 3.0862 3.2025 3.0852 


In the table the values of the exact solution are given, as well as the approximate values calculated by 
the rectangular and the trapezoidal formulas, respectively, so the accuracies of these methods can be 
compared. The step size used is h = 0.1. 


11.5 Singular Integral Equations 

An integral equation is called a singular integral equation if the range of the integral in the equation 
is not finite, or if the kernel has singularities inside of the range of integration. It is to be supposed 
that the integrals exist as improper integrals, or as Cauchy principal values (see 8.2.3, p. 506ff.). The 
properties and the conditions for the solutions of singular integral equations are very different from 
those in the case of “ordinary” integral equations. In the following sections only some special problems 
are discussed. For further discussions see [11.2], [11.3], [11.7], [11.8]. 


11.5.1 Abel Integral Equation 

One of the first applications of integral equations for a physical prob- 
lem was considered by Abel. A particle is moving in a vertical plane 
along a curve under the influence only of gravity from the point 
Po(xo, yo) to the point P; (0,0) (Fig. 11.2). 

The velocity of the particle at a point of the curve is 


Ygy Po (Xo Yo) 


» 15 
Ρ/(0/) Xo x υ-- τν = V29(yo — y). (11.68) 
Figure 11.2 The time of fall as a function of yo is calculated by integration: 
[ 
T (yo) -/——“— (11.69a) 
o γ2φο-) 
If s is considered as a function of y, i.e., s = f(y), then 
Yo 
1 1 
T(yo) = | == f - ΓΘ) z% (11.690) 


ὁ ντο. v Uo — 
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The next problem is to determine the shape of the curve as a function of yo if the time of the fall is 
given. By substitution 


¥29-T(yo) =F(yo) and f'(y)- ew) (1.690) 


and changing the notation of the variable yo into x, a Volterra integral equation of the first kind is 
obtained: 


dy. (11.69d) 
It is to be considered now the slightly more general equation 
f(a) = [στὰ with 0<a<1. (11.70) 
J (z= y)° 


The kernel of this equation is not bounded for y = x. In (11.70), the variable y is formally replaced 
by € and the variable x by y. By these substitutions the solution is obtained in the form y = q(x). If 


both sides of (11.70) are multiplied by the term and integrated with respect to y between 


i 
(z-y)-* 


the limits a and z, it yields the equation 


|: ἃ f »ί8) "o. 
στης, (/ ae æ) dy luy dy. (11.712) 


Changing the order of integration on the left-hand side gives 


A [ f) 
e(E IT dp = | ay, (11.71b 
jæ (x — y) af -έ)α J (x — y)! α ) 
The inner "o can be evaluated by the substitution y = € + (a — £)u: 
T 1 
dy du π 
= = ; 11.71 
/ (s—y)-*( - 8 i ΠΝ ΤΟ; ος 


This result is to be substituted into (11.71b). After differentiation with respect to x one gets the func- 
tion φ(α): 


φία) = sin(am) d Ir f(y) - dy. (11.71d) 


π drj (r—y)- 


H= * v) dy, (x) T1 
Jo yrz—y T T 


11.5.2 Singular Integral Equation with Cauchy Kernel 
11.5.2.1 Formulation of the Problem 


Consider the following integral equation: 


alajo) + + [ EG oy) dy = fea), ser (11.72) 


Tid Your 
py 8 


_ 2 
π 


T is a system consisting of a finite number of smooth, simple closed curves in the complex plane such 
that they form a connected interior domain S* with 0 € S* and an exterior domain S^. Driving along 
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the curve there is S+ always on the left-hand side of I. A function u(x) is Hólder continuous (or satisfies 
the Hélder condition) over I if for any pair x1, 20 € I the relations 

lu(zi) —u(za3) < K|xi ο, O<B<1, K>0 (11.73) 
are valid. It is supposed that the functions a(x), f(x), and y(x) are Hólder continuous with exponent 


£1, and K(x, y) is Hólder continuous with respect to both variables with the exponents £5 > βι. The 
kernel K(r,y)(y — x)! has a strong singularity for x = y. The integral exists as a Cauchy principal 


Κα) -- Και α) 


y—c 


value. With K(x, x) = b(x) and k(v, y) = (11.72) holds in the form 


(£q)(x) := a(x) p(x) 4 (x) ely) dy à E / k(x,y)e(y)dy— f(a), wel. (11.74a) 


wis υ-α 
pd 


The expression (£y)(a) denotes the left-hand side of the integral equation in abbreviated form. £ is 
a singular operator. The function k(x, y) is a weakly singular kernel. It is assumed that the normality 
condition a(x)? — b(x)? 4 0, x € ΙΓ holds. The equation 


(Εοφ)(α) = α(χ)φ(α) + we) / = ) dy =f(x), ver, (11.74b) 


is the characteristic equation pertaining to (11.74a). The operator Lo is the characteristic part of the 
operator £. The adjoint integral ον of (11. soi yields the equality 


Eyo) = aowa- [ 95 , T. f (su ΞΟ 


πι =z g= 
Ῥ y y 


—-49(y. ver. (11.74c) 


11.5.2.2 Existence of a Solution 


The equation (Ly)(x) = f(x) has a solution φ(α) if and only if for every solution v(y) of the homoge- 
neous transposed equation (£L'y)(y) = 0 the condition of orthogonality 


ΠΟΤΟ dy =0 (11.75a) 


is satisfied. Similarly, the transposed equation (Z' v)(y) = g(y) has a solution if for every solution 
(a) of the homogeneous equation (Ly) (a) = 0 the following is valid: 


OUO dz = 0. (11.75b) 

T 
11.5.2.3 Properties of Cauchy Type Integrals 
The following function is called a Cauchy type integral over T: 

1 als 
eccl TU a. νεα (11.76a) 
arid y—z 
D 

For z ¢ Γ the integral exists in the usual sense and represents a holomorphic function (see 14.1.2, 
p. 732). Also Φ(οο) = 0 holds. For z = x € I in (11.76a) is to be considered the Cauchy principal 
value 


(Hee) - zi. [ £9) 


y-a 


dy, ΕΤ. (11.76b) 
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The Cauchy type integral ®(z) can be extended continuously over I" from S* and from S^. The limits 
when approaching z the point x € I’ are denoted by +(x) and Φ- (x), respectively. The formulas of 
Plemelj and Sochozki are valid: 
1 τ 1 

D(x) = 30l) + Helo), a)  - ze) + (Hy) la). (11.169) 
11.5.2.4 The Hilbert Boundary Value Problem 
1. Relations 
The solution of the characteristic integral equation and the Hilbert boundary value problem are strongly 


correlate. If (x) is a solution of (11.74b), then (11.76a) is a holomorphic function on S+ and S^ with 
d (oo) = 0. Because of the formulas of Plemelj and Sochozki (11.76c) 


p(x) = ΦΤ(α) - D(x), 2({φ)(α) = Ot (a) +O (x), rer (11.77a) 
holds. With the notation 
a(x) — b(x) f(x) 
2) = —————— and g(x) = ———_., 11.77k 
tz) a(x) + ὑ(α) and. 48) a(x) + θα) ’ . 
the characteristic integral equation has the form: 
D(x) = G(r)P (x) + g(x), zer. (11.77c) 
2. Hilbert Boundary Value Problem 
Looking for a function Φ(2) which is holomorphic on S* and S~, and vanishes at infinity, and satisfies 
the boundary conditions (11.77c) over I. A solution Φ(α) of the Hilbert problem can be given in the 
form (11.76a). So, as a consequence of the first equation of (11.772), a solution y(x) of the characteristic 
integral equation is determined. 
11.5.2.5 Solution of the Hilbert Boundary Value Problem 
(in short: Hilbert Problem) 


1. Homogeneous Boundary Conditions 
Ot (x) = G(x) (x), rel. (11.78) 


During a single circulation of the point x along the curve /7 the value of log G(x) changes by 27 iA;, 
where A; is an integer. The change of the value of the function log G(x) during a single traverse of the 
complete curve system I" is 


» οπ]λις πίη. (11.79a) 


1-0 


n 
The number & = Σ) Aj is called the index of the Hilbert problem. Now it is to compose a function 


[50 
Go(r) = (x — ao) "Π(α)σ(α) (11.79b) 
I(x) = (z — a1)" (x — ag)? +++ (£ — as, (11.79c) 
where ag € S* and αι (l= 1,...,n) are arbitrarily fixed points inside ΓΙ. If P = I is a simple closed 
curve (n = 0), then one defines I(x) = 1. With 
1 [l [ 
iy [ 5580, (11.794) 
20i £ yz 


the following particular solution of the homogeneous Hilbert problems is obtained, which is called the 
fundamental solution: 
II'(z)expl(z) for z € S*, 


X(2)= { (z— ag) “exp I (z) for σε. (11.79e) 
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This function doesn't vanish for any finite z. The most general solution of the homogeneous Hilbert 
problem, which vanishes at infinity, for & > 0 is 

Φῃ(α) = X(z)Pe-1(z), 2 EC (11.80) 
with an arbitrary polynomial P,,1(z) of degree at most (x — 1). For κ. € 0 there exists only the trivial 
solution ®;,(z) = 0 which satisfies the condition Φῃ(οο) = 0, so in this case P, 1(z) = 0. For & > 0 the 
homogeneous Hilbert problem has & linearly independent solutions vanishing at infinity. 


2. Inhomogeneous Boundary Conditions 
The solution of the inhomogeneous Hilbert problem is the following: 


Φ(α) = X(z) R(z) + Φι(2) (11.81) with R(z) xil NG y)dy . (11.82) 


If & < 0 holds, for the existence of a solution vanishing at infinity the following necessary and sufficient 
conditions must be fulfilled: 
y g(y)dy 
X*(y) 


=0 (k-0,L...,—& — 1). (11.83) 


11.5.2.6 Solution of the Characteristic Integral Equation 


1. Homogeneous Characteristic Integral Equation 
If a(z) is the solution of the corresponding homogeneous Hilbert problem, from (11.772) follows the 
solution of the homogeneous integral equation 


φι(α) = D$ (x) — (x), ve€I. (11.84a) 
For & € 0 only the trivial solution y(x) = 0 exists. For x > 0 the general solution is 
φη(α) = [X*(z) — X^ (x)| P4 ι(α) (11.84b) 


with a polynomial P,,_; of degree at most κ — 1. 


2. Inhomogeneous Characteristic Integral Equation 
If (z) is a general solution of the inhomogeneous Hilbert problem, the solution of the inhomogeneous 
integral equation can be given by (11.772): 


p(x) = &* (x) — D(x) (11.85a) 
= X*(r)R*(x) - X (x) R (x) + Of (a) — G(x), ver. (11.85b) 
Using the formulas of Plemelj and Sochozki (11.76c) for R(z) holds 
1 g(x = 1 g(x g 
R+ (æ) = XS g (x2) (x), Β-(α) = ane (3) (a). (11.85c) 


Substituting (11.85c) into (11.85a) and considering (11.76b) and g(x) = f(x)/(a(v) + b(x)) finally 


results in the the solution: 


T. X* (x) + X- (x) r 
p(x) 2(a(a) + b(x)) X* (x) . 


1 f) cab. ud 
nij Tale UMN Tay ee sero Qna 


F(X* (2) - X (2)) 


According to (11.83) in the case & « 0 the following relations must hold simultaneously in order to 
ensure the pios i a solution: 


dy=0 (k—0,1,...,—& — 1). (11.87) 


[aci o Far 
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W The characteristic integral equation is given with constant coefficients a and b: 


bro 
ag(x) + — ΤΙ 2w dy = f(x). Here I’ is a simple closed curve, i.e., l = Ip (n = 0). From (11.77b) 


—b 5 
and g(a“) = m . Gis a constant, consequently & = 0. Therefore, Π(α) = 1 and 
a 


follows G = = 
a 


+b 

—b —b 1 1 E 
epe I(z) = log? / πο ESSE 

a+b a+b 2riJ y—z 0 zes- 

a—b 

z T -ὂ 

MS) arb κα ο το απ 

1. zes, a+b 


Since & = 0 holds, the homogeneous Hilbert boundary value problem has only the function $,(2) = 0 
as the solution vanishing at τς m (11.86) a 
a b κο 


Xt 4 X~ fly ; 
per 2a 5x5 € yan us πὶ ο. -r a? — pl) 1? — b2 mid € 
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1. Functional Analysis 

Functional analysis arose after the recognition of a common structure in different disciplines such as 
the sciences, engineering and economics. General principles were discovered that resulted in a common 
and unified approach in calculus, linear algebra, geometry, and other mathematical fields, showing their 
interrelations. 

2. Infinite Dimensional Spaces 

There are many problems, the mathematical modeling of which requires the introduction of infinite 
systems of equations or inequalities. Differential or integral equations, approximation, variational or 
optimization problems could not be treated by using only finite dimensional spaces. 

3. Linear and Non-Linear Operators 

In the first phase of applying functional analysis — mainly in the first half of the twentieth century 
— linear or linearized problems were thoroughly examined, which resulted in the development of the 
theory of linear operators. More recently the application of functional analysis in practical problems 
required the development of the theory of non-linear operators, since more and more problems had 
to be solved that could be described only by non-linear methods. Functional analysis is increasingly 
used in solving differential equations, in numerical analysis and in optimization, and its principles and 
methods became a necessary tool in engineering and other applied sciences. 

4. Basic Structures 

In this chapter only the basic structures will be introduced, and only the most important types of 
abstract spaces and some special classes of operators in these spaces will be discussed. The abstract 
notion will be demonstrated by some examples, which are discussed in detail in other chapters of this 
book, and the existence and uniqueness theorems of the solutions of such problems are stated and 
proved there. Because of its abstract and general nature it is clear that functional analysis offers a large 
range of general relations in the form of mathematical theorems that can be directly used in solving a 
wide variety of practical problems. 


12.1 Vector Spaces 
12.1.1 Notion ofa Vector Space 


A non-empty set V is called a vector space or linear space over the field F of scalars if there exist two 
operations on V — addition of the elements and multiplication by scalars from F — such that they have 
the following properties: 

1. for any two elements x, y € V, there exists an element z = x + y € V, which is called their sum. 

2. For every x € V and every scalar (number) a € F there exists an element ax € V, the product of x 


and the scalar α so that the following properties, the axioms of vector spaces (see also 5.3.8.1, p. 365), 
are satisfied for arbitrary elements x, y, ΕΥ and scalars q, 8 € F: 

(V1) 2+(y+z)=(t+y) +2. (12.1 

(V2) There exists an element 0 € V, the zero element, such that x +0 = x. (12.2 

(V3) To every vector x thereisa vector — x such that x+ (—z)- 0. (12.3 

(V4) r+y=y+r. (12.4 

(V5) l-vw=a2, 0-α-0. (12.5 

(V6) α(θα) = (αβ)α. (12.6 

(VT) (a+ B)zr — ox + Bx. (12.7 

(V8) a(xr-y)- ox + oy. (12.8 
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V is called a real or complex vector space, depending on whether F is the field R of real numbers or 
the field C of complex numbers. The elements of V are also called either points or, according to linear 
algebra, vectors. The vector notation 7 or x is not used in functional analysis. 

The difference x — y of two arbitrary vectors x,y € V also can be defined in V as x — y = x + (—y). 
From the previous definition, it follows that the equation x +y = z can be solved uniquely for arbitrary 
elements y and z. The solution is x = z — y. Further properties follow from axioms (V1)-(V8): 

e the zero element is uniquely defined, 

e ax = prx and v £0, imply a = £, 

e ox — oy and a z 0, imply «= y, 

ο -(αν)-α:(-α). 


12.1.2 Linear and Affine Linear Subsets 


1. Linear Subsets 
A non-empty subset Vo of a vector space V is called a linear subspace or a linear manifold of V if together 
with two arbitrary elements x,y € Vo and two arbitrary scalars o, € F, their linear combination 
ax + By is also in Vo. Vo is a vector space in its own right, and therefore satisfies the axioms (V1)— 
(V8). The subspace Vo can be V itself or only the zero point. In these cases the subspace is called 
trivial. 
2. Affine Subspaces 
A subset of a vector space V is called an affine linear subspace or an affine manifold if it has the form 
{zo +y: y € Vo}, (12.9) 
where zo € V is a given element and Vo is a linear subspace. It can be considered (in the case αρ Æ 0) 
as the generalization of the lines or planes not passing through the origin in R. 
3. The Linear Hull 
'The intersection of an arbitrary number of subspaces in V is also a subspace. Consequently, for every 
non-empty subset E C V, there exists a smallest linear subset lin(E) or [E] in V containing E, namely 
the intersection of all the linear subspaces, which contain E. The set lin(E) is called the linear hull of 
the set E, or the linear subspace generated by the set E. It coincides with the set of all (finite) linear 
combinations 
0424 + O33 + ... + On; (12.10) 
comprised of elements x1, z5,..., 24, € E and scalars a1, @2,..., Qn E F. 
4. Examples for Vector Spaces of Sequences 
W A Vector Space IF": Let n be a given natural number and V the set of all n-tuples, i.e., all finite 
sequences consisting of n scalar terms ((£1,...,£,) : & € F, i = 1,...,n). The operations will be 


defined componentwise or termwise, i.e., if x = (£1,...,£,) and y = (ηι,....ημ) are two arbitrary ele- 
ments from V and a is an arbitrary scalar, a € F, then 
ct+y=(&+m,..-,6 +n), (12.118) a. a =(a&,...,a&,). (12.11b) 


In this way, the vector space F” is defined. The linear spaces R or C are special cases for n = 1. 

This example can be generalized in two different ways (see examples B and C). 

E B Vector Spaces of all Sequences: Considering the infinite sequences as elements x = (6,12. 
En € F and defining the operations componentwise, similar to (12.114) and (12.11b), the vector space 
S of all sequences are obtained. 


E C Vector Space (also coo) of all Finite Sequences: Let V be the subset of all elements of 
s containing only a finite number of non-zero components, where the number of non-zero components 
depends on the element. This vector space — the operations are again introduced termwise — is denoted 
by « or also by coo, and it is called the space of all finite sequences of numbers. 
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Β D Vector Space m (also 155) of all Bounded Sequences: A sequence x = {€,}°2, belongs 
to m if and only if there exists C, > 0 with |£,] < Cr, Vn = 1,2,.... This vector space is also denoted 
by 1°. 
H E Vector Space c of all Convergent Sequences: A sequence x = {€,,}°2, belongs to c if and 
only if there exists a number £j € F such that for Ve > 0 there exists an index no = no(£) such that 
for all n > ng one has |En — &| < € (see 7.1.2, p. 458). 
E F Vector Space ορ of all Null Sequences: The vector space co of all null sequences, i.e., the 
subspace of c consisting of all sequences converging to zero (£o — 0). 
E G Vector Space |’: The vector space of all sequences x = {€,}°2, such that 177 |En? is conver- 
gent, is denoted by I? (1 € p < oo). 
It can be shown by the Minkowski inequality that the sum of two sequences from I” also belongs to 10, 
(see 1.4.2.13, p. 32). 
Remark: For the vector spaces introduced in examples A-G, the following inclusions hold: 
eCcoCccmcs and gclcl'ccg where 1<p<q<oo. (12.12) 
5. Examples of Vector Spaces of Functions 
W A Vector Space F(T): Let V be the set of all real or complex valued functions defined on a given 
set T, where the operations are defined point-wise, i.e., if v = x(t) and y = y(t) are two arbitrary 
elements of V and a € F is an arbitrary scalar, then we define the elements (functions) x + y and o: x 
by the rules 
(x y)t) =2(t)+ y(t) VteT, (12.13a) 
(ax)(t)=a-a(t) Vte T. (12.13b) 
This vector space is denoted by F(T). 
Some of the subspaces are introduced in the following examples. 
E B Vector Space B(T) or M(T): The space B(T) is the space of all functions bounded on T. This 
vector space is often denoted by M(T). In the case of T = N, one gets the space M(N) = m from 
example D of the previous paragraph. 
E C Vector Space C([a, b]): The set C([a, ὑ]) of all functions continuous on the interval [a,b] (see 
2.1.5.1, p. 58). 
E D Vector Space C“)([a,b]): Letk € IN, k > 1. The set C? ([a, δ) of all functions k-times 
continuously differentiable on [a,b] (see 6.1, p. 432-437) is a vector space. At the endpoints a and 
b of the interval fa, b], the derivatives have to be considered as right-hand and left-hand derivatives, 
respectively. 
Remark: For the vector spaces of examples A-D of this paragraph, and T = [a,b] the following 
subspace relations hold: 
C? (la, δ) C 6([α, δ) C Βί[α, b]) C F([a, b]). (12.14) 
E E Vector Subspace of C([a, b]): For any given point to € [a,b], the set {x € Ο([α. ὑ]}: α({ο) = 0} 
forms a linear subspace of C([a, b]). 


12.1.3 Linearly Independent Elements 


1. Linear Independence 
A finite subset (z1,..., &n} of a vector space V is called linearly independent if 

aiti +++++An%, — 0 implies ay -:::— a, — 0. (12.15) 
Otherwise, it is called linearly dependent. If αι = +--+ = a, = 0, then for arbitrary vectors x1,...,%p 
from V, the vector αι] + +++ + oz, is trivially the zero element of V. Linear independence of the 
vectors #1,..., &n means that the only way to produce the zero element 0 = α121 +++ 4-02, is when 
all coefficients are equal to zero @ = +++ = an 0. This important notion is well known from linear 
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algebra (see 5.3.8.2, p. 366) and was used e.g. for the definition of a fundamental system of solutions of 
linear homogeneous differential equations (see 9.1.2.3, 2., p. 553). An infinite subset E C V is called 
linearly independent if every finite subset of E is linearly independent. Otherwise, E is called linearly 
dependent. 
W If the sequence whose k-th term is equal to 1 and all the others are 0 is denoted by e; , then belongs ep 
to the space y and consequently to any space of sequences. The set (ei, €2,...} is linearly independent 
in every one of these spaces. In the space C([0, 7), e.g., the system of functions 

1, sinnt, cosnt (n — 1,2,3,...) 
is linearly independent, but the functions 1, cos 2t, cos? t are linearly dependent (see (2.97), p. 81). 
2. Basis and Dimension of a Vector Space 
A linearly independent subset B from V, which generates the whole space V, i.e., lin(B) = V holds, is 
called an algebraic basis or a Hamel basis of the vector space V (see 5.3.8.2, p. 366). B = {re : € € E] 
is a basis of V if and only if every vector x € V can be written in the form x = Σ) αε”ε, where the 

£e 


coefficients ας are uniquely determined by x and only a finite number of them (depending on Ὁ) can be 
different from zero. Every non-trivial vector space V, i.e., V σε {0}, has at least one algebraic basis, 
and for every linearly independent subset E of V, there exists at least one algebraic basis of V, which 
contains this subset of E. 


A vector space V is m-dimensional if it possesses a basis consisting of m vectors. That is, there exist 
m linearly independent vectors in V, and every system of m + 1 vectors is linearly dependent. 

A vector space is infinite dimensional if it has no finite basis, i.e., if for every natural number m there 
are m linearly independent vectors in V. 

The space F” is n-dimensional, and all the other spaces in examples B-E are infinite dimensional. The 
subspace lin({1, t, t?}) C Ο([α, b]) is three-dimensional. 

In the finite dimensional case, every two bases of the same vector space have the same number of ele- 
ments. Also in an infinite dimensional vector space any two bases have the same cardinality, which is 
denoted by dim(V). The dimension is an invariant quantity of the vector space, it does not depend on 
the particular choice of an algebraic basis. 


12.1.4 Convex Subsets and the Convex Hull 
12.1.4.1 Convex Sets 


A subset C of a real vector space V is called convex if for every pair of vectors x,y € C all vectors of 
the form Ax + (1 — A)y, 0 € A € 1, also belong to C. In other words, the set C is convex, if for any 
two elements 2; and y, the whole line segment 


{Av + (1—A)y: 0€xAx1) (12.16) 


(which is also called an interval), belongs to C. (For examples of convex sets in R? see the sets denoted 
by A and B in Fig. 12.5, p. 684.) 

The intersection of an arbitrary number of convex sets is also a convex set, where the empty set is agreed 
to be convex. Consequently, for every subset E C V there exists a smallest convex set which contains 
E, namely, the intersection of all convex subsets of V containing E. It is called the convex hull of the 
set E and it is denoted by co (E). co(£) is identical to the set of all finite convez linear combinations 
of elements from E, i.e., co (E) consists of all elements of the form A12 +++: + λα, where 2,..., En 
are arbitrary elements from E and A; € (0, 1] satisfy the equality Ay + +--+ A, = 1. Linear and affine 
subspaces are always convex. 


12.1.4.2 Cones 
A non-empty subset C of a (real) vector space V is called a convex cone if it satisfies the following 
properties: 
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1. Cis a convex set. 

2. From a € C and A 7 0, it follows that Ax € C. 
3. From zv € C and —z € C, it follows that x = 0. 
A cone can be characterized also by 3. together with 


r,y€ C and λ.μ»0 imply Ar-cgye C. (12.17) 
ΒΑ: The set R} of all vectors x = (&,..., , £4) with non-negative components is a cone in R”. 


E B: The set C, of all real continuous functions on [a, b] with only non-negative values is a cone in the 
space C([a, b]). 
E C: The set of all sequences of real numbers (£, }55 , with only non-negative terms, i.e., £, > 0, Vn, 
is a cone in s. Analogously, cones are obtained in the spaces of examples C-G in 12.1.2, p. 655, if the 
sets of non-negative sequences are considered in these spaces. 


WB D: The set C C I? (1 < p < oc), consisting of all sequences (£, 5? ,, such that for some a > 0 


x En? X a (12.18) 


n=1 


is a convex set in I”, but obviously, not a cone. 


E E: Examples from R° see Fig. 12.1: a) convex set, not a cone, b) not convex, c) convex hull. 
A 
y y 
Z (0,1) 
A a 
a) X b) | 


Figure 12.1 


12.1.5 Linear Operators and Functionals 
12.1.5.1 Mappings 
A mapping T: D — Y from the set D C X into the set Y is called 


e injective, if 


T(r) 2 T(y) — x — y, (12.19) 
e surjective, if for 
Vy€Y thereexistsanelement r€ D suchthat T(r) =y, (12.20) 


e bijective, if T is both injective and surjective. 
D is called the domain of the mapping T and is denoted by Dr or D(T), while the subset (y € Y: 
Jx € Dr with T(x) = y) of Y is called the range of the mapping T and is denoted by R(T) or Im(T). 


12.1.5.2 Homomorphism and Endomorphism 
Let X and Y be two vector spaces over the same field F and D a linear subset of X. A mapping T': 
D — Y iscalled linear (or a linear transformation, linear operator or homomorphism), if for arbitrary 
x,y E Dando,f EF, 

T(ax + By) = ατα + BTy. (12.21) 
For a linear operator T the notation Τα is preferred, which is similarly used for linear functions, while 
the notation T(x) is used for general operators. 
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The range R(T) is the set of all y € Y such that the equation Tr = y has at least one solution. 
N(T) = (x € X: Tx = 0} is the null space or kernel of the operator T and is also denoted by ker(T). 

A mapping of the vector space X into itself is called an endomorphism. If T is an injective linear 
mapping, then the mapping defined on R(T) by 


y — v, such that Tx = y, y € R(T) (12.22) 
is linear. It is denoted by T^!: R(T) — X and is called the inverse of T. If Y is the vector space F, 
then a linear mapping f: X — F is called a linear functional or a linear form. 
12.1.5.3 Isomorphic Vector Spaces 


A bijective linear mapping T: X —9 Y is called an isomorphism of the vector spaces X and Y. Two 
vector spaces are called isomorphic provided an isomorphism exists. 


12.1.6 Complexification of Real Vector Spaces 


Every real vector space V can be extended to a complex vector space V. The set V consists of all pairs 
(x,y) with x,y € V. The operations (addition and multiplication by a complex number a + ib € €) 
are defined as follows: 


(21,91) + (va Y2) = (πι + xo, yi + yo), (13.234) (a 4-ib)(z, y) = (ax — by, bx + ay).(12.23b) 
Since the special relations 
(x,y) = (1.0) + (0, y) and i(y,0) = (0 +il)(y,0) = (O-y—1-0,ly+0-0) = (0,y) (13.24) 


hold, the pair (x, y) can also be written as x + iy. The set V is a complex vector space, where the set 


V is identified with the linear subspace Vo = {(x,0): x € V), i.e., x € V is considered as (x,0) or as 
2; 4 i0. 

This procedure is called the complexification of the vector space V. A linearly independent subset in V 
is also linearly independent in V. The same statement is valid for a basis in V, so dim(V) = dim(V). 


12.1.7 Ordered Vector Spaces 
12.1.7.1 Cone and Partial Ordering 


If a cone C is fixed in a vector space V, then an order can be introduced for certain pairs of vectors in 
V. Namely, ifr — y € C for some x,y € V then one writes r > y or y € x and say x is greater than 
or equal to y or y is smaller than or equal to x. The pair (V, C) is called an ordered vector space or a 
vector space partially ordered by the cone C. An element x is called positive, if x > 0 or, which means 
the same, if x € C holds. Moreover 

C={xeEV:x> 0}. (12.25) 


If the vector space IR? ordered by its first quadrant as the cone C(— Ri) is under consideration, then a 
typical phenomenon of ordered vector spaces will be seen. This is referred to as “partially ordered” or 
sometimes as “semi-ordered”. Namely, only certain pairs of two vectors are comparable. Considering 
the vectors x = (1, — 1) and y = (0,2), neither the vector x — y = (1, —3) nor y — x = (—1, 3) is in C, 
so neither x > y nor x < y holds. An ordering in a vector space, generated by a cone, is always only a 
partial ordering. 


It can be shown that the binary relation > has the following properties: 

(O1) r2rVrcV (reflexivity). (12.26) 
(O2) x> y and y> z imply «> z (transitivity). (12.27) 
(O3) r2y ando 20, o € R, imply az > ay. (12.28) 


(O4) αι yi and πο > ys imply αι 1-10 > Yı + yo. (12.29) 
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Conversely, if in a vector space V there exists an ordering relation, i.e., a binary relation > is defined 
for certain pairs of elements and satisfies axioms (O1)-(O4), and if one puts 
Vi = {x EV: «> 0}, (12.30) 
then it can be shown that V is a cone. The order >y, in V induced by V+ is identical to the original 
order >; consequently, the two possibilities of introducing an order in a vector space are equivalent. 
A cone C C V is called generating or reproducing if every element x € V can be represented as x = 
u — v with u,v € C. It can be written in the form V = C-C. 
W A: Anobvious order in the space s (see example B, p. 655) is induced by means of the cone 
CH={x={& $e: 20 Vn) (12.31) 
(see example C, p. 658). 
In the spaces of sequences (see (12.12), p 656) usually the natural coordinate-wise order is considered. 
This is defined by the cone obtained as the intersection of the considered space with C (see (12.31), 
p. 660). The positive elements in these ordered vector spaces are then the sequences with non-negative 
terms. It is clear that other orders can be defined by other cones, as well. Then orderings different from 
the natural ordering can be obtained (see [12.17], [12.19]). 


E B: In the real spaces of functions F(T), B(T), C({a,b]) and C™ (Ja, b]) (see 12.1.2, 5., p. 656), the 
natural order x > y for two functions x and y is defined by x(t) > y(t), Vt € T, or Vt € [a,b]. 
Then zx > 0 if and only if x is a non-negative function in T. The corresponding cones are denoted by 
F(T), B.(T), etc. Also C4} = C,(T) = F} (T) NC(T) can be obtained if T = [a, b]. 


12.1.7.2 Order Bounded Sets 
Let E be an arbitrary non-empty subset of an ordered vector space V. An element z € V is called 
an upper bound of the set E if for every x € E, {x € z. An element u € V is a lower bound of E if 
u € z, Vx € E. For any two elements x,y € V with x € y, the set 

[z y] = (ve V:x v € y (12.32) 
is called an order interval or (0)-interval. 
Obviously, the elements x and y are a lower bound and an upper bound of the set [x, y], respectively, 
where they even belong to the set. Α set E C V is called order bounded or simply (ο) bounded, if E is a 
subset of an order interval, i.e., if there exist two elements u, z € V such that u € x € z, Vr € E or, 
equivalently, E C fu, ο]. A set is called bounded above or bounded below if it has an upper bound, or a 
lower bound, respectively. 


12.1.7.3 Positive Operators 
A linear operator (see [12.2], [12.17]) T: X —+ Y from an ordered vector space X = (X, X,) into an 
ordered vector space Y = (Y, Y +) is called positive, if 

T(X,)CY,, ie, Tr 20 forall c0. (12.33) 


12.1.7.4 Vector Lattices 


1. Vector Lattices 

In the vector space R' of the real numbers the notions of (o)-boundedness and boundedness (in the 
usual sense) are identical. It is known that every set of real numbers which is bounded from above 
has a supremum: the smallest of its upper bounds (or the least upper bound, sometimes denoted by 
lub). Analogously, if a set of reals is bounded from below, then it has an infimum, the greatest lower 
bound, sometimes denoted by glb. In a general ordered vector space, the existence of the supremum 
and infimum cannot be guaranteed even for finite sets. They must be given by axioms. An ordered 
vector space V is called a vector lattice or a linear lattice or a Riesz space, if for two arbitrary elements 
x,y € V there exists an element z € V with the following properties: 

1. rXz and y<z, 

2. ifwe V with x € u and y € u, then z € u. 
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Such an element z is uniquely determined, it is denoted by x V y, and it is called the supremum of x 
and y (more precisely: supremum of the set consisting of the elements x and y). In a vector lattice, 
there also exists the infimum for any x and y, which is denoted by x ^ y. For applications of positive 
operators in vector lattices see, e.g., [12.2], [12.3] [12.15]. 
A vector lattice is called Dedekind complete or a K-space (Kantorovich space) if every non-empty subset 
E that is order bounded from above has a supremum lub(£) (equivalently, if every non-empty subset 
that is order bounded from below has an infimum glb(£)). 


E A: In the vector lattice F([a, b]) (see 12.1.2, 5., p. 656), the 
supremum of two functions 2, y is calculated pointwise by the for- 


mula 
(x V y)(t) = max(z(t),u(t)) Vt € [a,b]. (12.34) 
In the case of [a,b] = [0, 1], a(t) = 1 — $t and y(t) = t’ (Fig. 12.2), 
—3 af O< txt, 
(x V y)(t) = { A if 1 Zhe 1 (12.35) 


is obtained. 
E B: The spaces C([a,b]) and B([a,b]) (see 12.1.2, 5., p. 656) are 


» 

t also vector lattices, while the ordered vector space CV ([a, b]) is not 
a vector lattice, since the minimum or maximum of two differentiable 
functions may not be differentiable on [α, b], in general. 
A linear operator T: X — Y from a vector lattice X into a vector 
lattice Y is called a vector lattice homomorphism or homomorphism 

Figure 12.2 of the vector lattice, if for all rv, y € X 
T(rVy)— Tr VTy and T(z^y)= TrA Ty. (12.36 


2. Positive and Negative Parts, Modulus of an Element 
For an arbitrary element x of a vector lattice V, the elements 

z,—zV0, zr —(-z)VO0 and |2|=24+2_ (12.37 
are called the positive part, negative part, and modulus of the element x, respectively. For every elemen 
x € V, the three elements z.,, x_, |x| are positive, where for x,y € V the following relations are valid: 


zrErz,X|r, zr—z,—z., z,^z.—0, |x|—-zV(-z) (12.38a 
(ry) Sti νι (+y) Se -ty, |α4-9| s lel + yl, (12.38b 
ry implies r, Xy, and x δη. (12.38c 
and for arbitrary a > 0 
(ar), =ar}, (oar). =ar, |ac|-a]r|. (12.38d 
a = = | Ly 
0 0| ν΄ x,(t) b Ofa; x(t) b Of al” Ixl(t b 


Figure 12.3 


In the vector spaces F (fa, b]) and C(|a, b]), the positive part, the negative part, and the modulus of a 
function x(t) can be got by means of the following formulas (Fig. 12.3): 
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x(t) = D * a E j (13.394) 
ο ος r 7 zd (12.39b) let) =|2(t)| Vte [a,b]. (12.39c) 


12.2 Metric Spaces 
12.2.1 Notion of a Metric Space 


Let X be a set, and suppose a real, non-negative function p(z, y) (x,y € X) is defined on X x X. If 
this function p : X x X + R} satisfies the following properties (M1)-(M3) for arbitrary elements 
x,y,z € X, then it is called a metric or distance in the set X, and the pair X = (X, p) is called a metric 
space. The axioms of metric spaces are: 


(M1) p(a,y)>0 and p(x,y) — 0 if and only if x = y (non-negativity), (12.40) 
(M3) p(z.y) = ply.) (symmetry), (12.41) 
(M3) p(x,y) € p(x, 2) + p(z,y) (triangle inequality). (12.42) 


A metric can be defined on every subset Y of a metric space X = (X, p) in a natural way if the metric 
p of the space X is restricted to the set Y, i.e., if p is considered only on the subset Y x Y. The space 
(Y, p) of X x X is called a subspace of the metric space X. 

W A: The sets R” and C" are metric spaces with the Euclidean metric defined for points 

v= (έι,... δι) and y = (ηι,....ημ) as 


p(z,y) =| Σὺ |ξε — ml. (12.43) 
k=1 


W B: The function 


x,y) = max |f, — nk 12.44 

p(z,y) = max |&k — nel (12.44) 

for vectors x = (&,...,&) and y = (m,.-., Mn) also defines a metric in R” and C", the so-called 
mazimum metric. If & = (& Jos En) is an approximation of the vector x, then it is of interest to know 


how much is the maximal deviation between the coordinates: max [£x — €]. 
1<k<n 


The function 
play) = J |ξε — m (12.45) 
k=1 


for vectors x,y € R” (or C") defines a metric in R” and C", the so-called absolute value metric. The 
metrics (12.43), (12.44) and (12.45) are reduced in the case of n = 1 to the absolute value |x — y| in the 
spaces R and C (the sets of real and complex numbers). 

Wi C: Finite 0-1 sequences, e.g., 1110 and 010110, are called words in coding theory. If the num- 
ber of positions is counted where two words of the same length n have different digits, i.e., for x = 
(&,.... 62), Y = (m,---,M™), ξε, m € {0,1}, o(x, y) is defined as the number of the k € {1,...,n} 
values such that & A ng, then the set of words with a given length n is a metric space, and the metric 
is the so-called Hamming distance, e.g., o((1110), (0100)) = 2. 

W D: In the set m and in its subsets c and ορ (see (12.12), p. 656) a metric is defined by 


ρίωιν) = sup |g ml (@ (έν. ντ meme) (12.46) 
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[5.5] 
W E: Inthe set l? (1 € p < oc) of sequences x = (£1, ,...) with absolutely convergent series X |é, |” 
n=1 


a metric is defined by 


= Y in- ml’, (ye). (12.47) 
n=1 


ΒΕ: In the set C([a, b]) a metric is defined by 
BP) = max x(t) — v(t)]. (12.48) 
Ela, 


E G: In the set C™ ((a, b]) a metric is defined by 


-y max la? (t) — y (t)], (12.49) 


izo telo 
where (see (12.14) CO ([a, b]) is understood as C([a, 0])). 
E H: Consider the set L?(Q) (1 € p < oc) ofthe equivalence classes of Lebesgue measurable functions 


which are defined almost everywhere on a bounded domain Q C IR" and / |x(t)|? du. < oo (see also 


à 
12.9, p. 693). A metric in this set is defined by 


po.) = B| f lett) - vlr dp. (12.50) 


Q 


12.2.1.1 Balls, Neighborhoods and Open Sets 
In a metric space X = (X, p), whose elements are also called points, the following sets 
B(zo;r) = {x € X : p(x, vo) < r}, (12.51) B(ro;r) = {x € X : p(x, uo) < r} (12.52) 


defined by means of a real number r > 0 and a fixed point 29, are called an open and closed ball with 
radius r and center at 29, respectively. 

The balls (circles) defined by the metrics (12.43) and (12.44) and (12.45) in the vector space R? are 
represented in Fig. 12.4a,b with αρ = 0 andr = 1. 


“ion "ου ο 
{(x, ys ty” <1} 
(1,0) x (L0) x (L0) x 
b) c) 


Figure 12.4 


A subset U of a metric space X = (X, p) is called a neighborhood of the point xo if U contains xo together 
with an open ball centered at ro, in other words, if there exists an r > 0 such that B(xo;r) C U. A 
neighborhood U of the point x is also denoted by U(x). Obviously, every ball is a neighborhood of its 
center; an open ball is a neighborhood of all of its points. A point zo is called an interior point of a set 
AC X if xo belongs to A together with some of its neighborhood, i.e., there is a neighborhood U of £o 
such that r9 € U C A. A subset of a metric space is called open if all of its points are interior points. 
Obviously, X is an open set. 

The open balls in every metric space, especially the open intervals in R, are the prototypes of open sets. 
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The set of all open sets satisfies the following axioms of open sets: 
e If Ga is open for Va € J, then the set U Ga is also open. 
αεί 


n 
e If G4, G»,..., Gn are finitely many arbitrary open sets, then the set [] Gg is also open. 


ο The empty set Ý is open by definition. " 
A subset A of a metric space is bounded if for a certain element xo (which does not necessarily belong 
to A) and a real number R > 0 the set A is in the ball Β(αρ; R), i.e., p(x, £o) < R for all x € A. 


12.2.1.2 Convergence of Sequences in Metric Spaces 


Let X = (X, p) be a metric space, αρ € X and (r, 4, v, € X a sequence of elements of X. 
The sequence {2,,}°2, is called convergent to the point zo if for every neighborhood U (zo) there is an 
index no = no(U (xo)) such that for all n > no, £n € U (xo). The usual notation 


In — To (n oo) or lim. Tn = 20 (12.53) 


is used and the point zy is called the limit of the sequence (x, }°2,. The limit of a sequence is uniquely 
determined. Instead of an arbitrary neighborhood of the point x, it is sufficient to consider only open 
balls with arbitrary radii, so (12.53) is equivalent to the following: V € > 0 (now thinking about the 
open ball B(zo; £)), there is an index ng = no(£), such that if n > no, then p(r,,xo) < £. Notice that 
(12.53) means p(x,, xo) — 0. 

With these notions introduced in special metric spaces the distance between points can be calculated 
and the convergence of point sequences can be investigated. This has a great importance in numerical 
methods and in approximating functions by certain classes of functions (see, e.g., 19.6, p. 982). 

In the space R”, equipped with one of the metrics given above, convergence always means coordinate- 
wise convergence. 

In the spaces B([a, b]) and C([a, b]), the convergence introduced by (12.48) means uniform convergence 
of the function sequence on the set (a, b] (see 7.3.2, p. 468). 

In the space L?(Q) convergence with respect to the metric (12.50) means convergence in the (quadratic) 
mean, 1.6., x, — 20 if 


n -to du — 0 for n— oo. (12.54) 
Q 


12.2.1.3 Closed Sets and Closure 
1. Closed Sets 


A subset F of a metric space X is called closed if X V F is an open set. Every closed ball in a metric 
space, especially every interval of the form fa, b], (a, oo), (--οο, a] in R, is a closed set. 
Corresponding to the axioms of open sets, the collection of all closed sets of a metric space has the 
following properties: 
e If F, are closed for Va € I, then the set N Fa is closed. 
acr "m 

e [f Fi,..., F, are finitely many closed sets, then the set U Fp is closed. 
e The empty set () is a closed set by definition. n 
The sets Ø and X are open and closed at the same time. 
A point xo of a metric space X is called a limit point of the subset A C X if for every neighborhood 
U (xo), 

U(t) Γ A z 0. (12.55) 
If this intersection always contains at least one point different from xo, then 2ρ is called an accumulation 
point of the set A. A limit point, which is not an accumulation point, is called an isolated point. 
An accumulation point of A does not need to belong to the set A, e.g., the point a with respect to the 
set A = (a, b], while an isolated point of A must belong to the set A. 
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A point xo is a limit point of the set A if there exists a sequence {£n}? | with elements x, from A, which 
converges to xo. If xo is an isolated point, then x, = £o, V n > no for some index no. 

2. The Closure of a Set 

Every subset A of a metric space X obviously lies in the closed set X. Therefore, there always exists a 
smallest closed set containing A, namely the intersection of all closed sets of X, which contain A. This 
set is called the closure of the set A and it is usually denoted by A. A is identical to the set of all limit 
points of A; A is obtained from the set A by adding all of its accumulation points to it. A is a closed 
set if and only if A = A. Consequently, closed sets can be characterized by sequences in the following 
way: A is closed if and only if for every sequence {x,,}°2, of elements of A, which converges in X to an 
element xo(€ X), the limit αρ also belongs to A. 
Boundary points of A are defined as follows: αρ is a boundary point of A if for every neighborhood 
U (x9), U(vo) ΓΑ Æ Ý and also U (xo) O (X \ A) - 0. αρ itself does not need to belong to A. Another 
characterization of a closed set is the following: A is closed if it contains all of its boundary points. (The 
set of boundary points of the metric space X is the empty set.) 


12.2.1.4 Dense Subsets and Separable Metric Spaces 
A subset A of a metric space X is called everywhere dense if A = X, i.e., each point x € X is a limit 
point of the set A. That is, for each x € X, there is a sequence {£n} 2, € A such that x, — x. 


W A: According to the Weierstrass approximation theorem, every continuous function on a bounded 
closed interval [a,b] can be approximated arbitrarily well by polynomials in the metric space of the 
space C({a, b]), i.e., uniformly. This theorem can now be formulated as follows: The set of polynomials 
on the interval [a, b] is everywhere dense in C([a, ὑ]). 


W B: Further examples for everywhere dense subsets are the set of rational numbers Q and the set of 
irrational numbers in the space of the real numbers R.. 

A metric space X is called separable if there exists a countable everywhere dense subset in X. A count- 
able everywhere dense subset in IR" is, e.g., the set of all vectors with rational components. The space 
] =I is also separable, since a countable everywhere dense subset is formed, for example, by the set of 
its elements of the form 2 = (r1, r2, ..., rN,0,0,...) , where r; are rational numbers and N = N(x) is 
an arbitrary natural number. The space m is not separable. 


12.2.2 Complete Metric Spaces 
12.2.2.1 Cauchy Sequences 


Let X = (X, p) be a metric space. A sequence {£n}; with x, € X is called a Cauchy sequence if for 
Ve > 0 there is an index no = no(£) such that for V n, m > no there holds the inequality 

(Zn, Lm) < €. (12.56) 
Every Cauchy sequence is a bounded set. Furthermore, every convergent sequence is a Cauchy se- 
quence. In general, the converse statement is not true, as is shown in the following example. 


E Consider the space 1! with the metric (12.46) of the space m. Obviously, the elements ας") = 


11 1 
g gaa 0,0... belong to l! for every n = 1,2,... and the sequence {2'"}°°, is a Cauchy 
n 
sequence in this space. If the sequence (of sequences) [zoe 1 converges, then it has to be convergent 
11 1 1 
also coordinate-wise to the element x = g ο... Y 
2 ὃ n nl 
oo 
1 
n 


; ) . However, «°) does not belong 


to I^, since $7 


n=1 


= +00 (see 7.2.1.1, 2., p. 459, harmonic series). 
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12.2.2.2 Complete Metric Spaces 


A metric space X is called complete if every Cauchy sequence converges in X. Hence, complete metric 
spaces are the spaces for which the Cauchy principle, known from real calculus, is valid: A sequence 
is convergent if and only if it is a Cauchy sequence. Every closed subspace of a complete metric space 
(considered as a metric space on its own) is complete. The converse statement is valid in a certain way: 
If a subspace Y of a (not necessary complete) metric space X is complete, then the set Y is closed in X. 


E Complete metric spaces are, e.g., the spaces: m, I? (1 € p « oo), c, B(T), C([a,b]), C([a, b]) 
L?(a,b) (1 < p < oo). 


12.2.2.3 Some Fundamental Theorems in Complete Metric Spaces 
'The importance of complete metric spaces can be illustrated by a series of theorems and principles, 
which are known and used in real calculus, and which are to be applied even in the case of infinite 
dimensional spaces. 
1. Theorem on Nested Balls 
Let X be a complete metric space. If 

B(ziri) D Β(υ»;τα) Ὁ: 2 B(anjrn) D+: (12.57) 
is a sequence of nested closed balls with r, —— 0, then the intersection of all of those balls is non- 
empty and consists of only a single point. If this property is valid in some metric space for any sequence 
satisfying the assumptions, then the metric space is complete. 


2. Baire Category Theorem 


œo 
Let X be a complete metric space and {F,}?2, a sequence of closed sets in X with U Fẹ = X. Then 


there exists at least one index ko such that the set 7, has an interior point. 

3. Banach Fixed-Point Theorem 

Let F be a non-empty closed subset of a complete metric space (X, p). Let T: X — X be a contracting 
operator on F, i.e., there exists a constant q € [0, 1) such that 


p(Tx,Ty) € qp(r,y) forall x,y €F. (12.58 
Suppose, if x € F, then Tx € F. Then the following statements are valid: 
a) For an arbitrary initial point xo € F the iteration 

appe. 012...) (12.59 
is well defined, i.e., x, € F for every n. 

b) The iteration sequence {£n}? ϱ converges to an element z* € F. 
c) Ta* =2", i.e., z^ is a fixed point of the operator T. (12.60 
d) The only fixed point of T in F is x*. 

e) The following error estimation is valid: 


n 


p(x* τμ) < icm). (12.61 


The Banach fixed-point theorem is sometimes called the contraction mapping principle. 
12.2.2.4 Some Applications of the Contraction Mapping Principle 
1. Iteration Method for Solving a System of Linear Equations 


The given linear (n, n) system of equations 


Q1111 01212 T... Aint, = by, 
2101 02212 T... ann = bə, 


(12.62a) 


AniTi TAn2%2 71 
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can be transformed according to 19.2.1, p. 955, into the equivalent system 
zı —(1— an )zı +aigt dec +41ntn = by, 
22 Εασισι —(1 — d23)v9 ++ antn = bo, (12.62b) 
Tn +anı Tı ΓΩη902 +++ α απῃ) η = br 
If the operator T: F” — F” is defined by 
n n T 
Τα = (s - Sayre + buses En — angen + 2 (12.63) 
k=l k=1 
then the last system is transformed into the fixed-point problem 
r-Ty (12.64) 


in the metric space F”, where an appropriate metric is considered: The Euclidean (12.43), the maxi- 


mum (12.44) or the absolute value metric p(x, y) = X |£ — yx| (compare with (12.45)). If one of the 
k=1 


numbers 
n n n 
2 , ax : 
2. lax, max $. laje], max αν] (12.65) 
jk=l k=1 1-1 


is smaller than one, then T turns out to be a contracting operator. It has exactly one fixed point 
according to the Banach fixed-point theorem, which is the componentwise limit of the iteration sequence 
started from an arbitrary point of F”. 


2. Fredholm Integral Equations 
The Fredholm integral equation of second kind (see also 11.2, p. 622) 


e) - f Κία. yet) dy = fe), x € [aU (12.66) 


with a continuous kernel K(x, y) and continuous right-hand side f(x) can be solved by iteration. By 
means of the operator T: C({a, δ]) ---» Ο([α. ϐ]) defined as 


Τρία) = | KG, yet) dy + f(x). Ve € C(la, b), (12.67) 


it is transformed into a fixed-point problem Ty = ¢ in the metric space C([a, b]) (see example A in 
b 

12.1.2, 4., p. 655). If max | | K (x, y)| dy < 1, then T is a contracting operator and the fixed-point 
a-r-6Ja 


theorem can be applied. The unique solution is now obtained as the uniform limit of the iteration 
sequence {pn } p1, where o, = T'9,4, starting with an arbitrary function yo(x) € C(|a, b]). It is clear 
that yn = T" and the iteration sequence is {T” po}. 


3. Volterra Integral Equations 
The Volterra integral equation of second kind (see 11.4, p. 643) 


ote) — [Κανλί dy = fle), ee fnt (12.68) 


with a continuous kernel and a continuous right-hand side can be solved by means of the Volterra 
integral operator 


(να) = f Kley) dy Y E Cla, t) (12.69) 
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and Ty = f + Vo as the fixed-point problem Ty = y in the space Ο([α, b]). 
4. Picard-Lindelof Theorem 
Consider the differential equation 

à = f(x) (12.70) 
with a continuous mapping f: J x G — R”, where J is an open interval of R and G is an open domain 
of R". Suppose the function f satisfies a Lipschitz condition with respect to x (see 9.1.1.1, 2. p. 541), 
i.e., there is a positive constant L such that 

olf (t3), f (t v2)) € Lo(ai,x2) V (t, 21), (πο) € I XG, (12.71) 
where ϱ is the Euclidean metric in IR". (Using the norm (see 12.3.1, p. 669) and the formula (12.81) 
o(x, y) = lx — y|| (12.71) can be written as || f(t, 1) — f (t, v2)|| < L- ||x1 — πο.) Let (to, zo) € 1 x G. 
Then there are numbers 3 > 0 and r > Osuch that the set Q = (t, £) € RxR”: |t—to| € B, o(z, zo) € 
r} lies in I x G. Let M = maxo o(/(t, ο), 0) anda = min{Z, up Then there is a number b > 0 such 


that for each 2 € B = {x € R”: o(x, zo) € b}, the initial value problem 

t= f(t,x), αρ) =% (12.72) 
has exactly one solution y(t, 2), i.e., e(t, 2) = f(t, y(t, Z)) for Vt satisfying |t—to| < a and y(to, 1) = 3. 
The solution of this initial value problem is equivalent to the solution of the integral equation 


t 
y(t,#) — i4 J 16.6.3) as, t € [tp — o, to +a]. (12.73) 
to 
If X denotes the closed ball {y(t, x): d(p(t, x), το) € r} in the complete metric space C([to — a, to + 


a] x B; R”) with metric 
dy, V) 


then X is a complete metric space with the induced metric. If the operator T: X — X is defined by 


ο(φ(ί, x), v (t, α)), (12.74) 


= max 0 
(t) e (]t-to]| Sa} B. 


Τφί(!, α) -Ξ 2 1- EC p(s, 2)) ds (12.75) 


then T is a contracting operator and the solution of the integral equation (12.73) is the unique fixed 
point of T' which can be calculated by iteration. 


12.2.2.5 Completion of a Metric Space 


Every (non-complete) metric space X can be completed; more precisely, there exists a metric space X 
with the following properties: 


a) X contains a subspace Y isometric to X (see 12.2.3, 2., p. 669). 

b) Y is everywhere dense in X. 

c) Xisa complete metric space. 

d) If Z is any metric space with the properties a)-c), then Z and X are isometric. 

'The complete metric space, defined uniquely in this way up to isometry, is called the completion of the 
space X. 

12.2.3 Continuous Operators 


1. Continuous Operators 
Let T: X — Y be a mapping of the metric space X = (X, p) into the metric space Y = (Y, o). T 
is called continuous at the point τρ € X if for every neighborhood V = V (yo) of the point yo = T (zo) 
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there is a neighborhood U = U (xo) such that: 

T(r)€V forall «eu. (12.76) 
T is called continuous on the set A C X if T is continuous at every point of A. Equivalent properties 
for T to be continuous on X are: 
a) For any point x € X and any arbitrary sequence (5, 8 4, £n € X with x, — x there always holds 
T(r,) — T(x). Hence p(,, x) — 0 implies o(T (zn), T(x)) — 0. 
b) For any open subset G C Y the inverse image T^! (G) is an open subset in X. 
c) For any closed subset F C Y the inverse image 7T" ! (F) is a closed subset in X. 


d) For any subset A C X one has T(A) C T(A). 

2. Isometric Spaces 

If there is a bijective mapping T: X — Y for two metric spaces X = (X, p) and Y = (Y, o) such that 
p(z,y) = o(T(x), T(y)) ναι € X, (12.77) 


then the spaces X and Y are called isometric, and T is called an isometry. 


12.3 Normed Spaces 


12.3.1 Notion ofa Normed Space 
12.3.1.1 Axioms of a Normed Space 


Let X be a vector space over the field F. A function || - ||: X — RÌ is called a norm on the vector 
space X and the pair X = (X, || - ||) is called a normed space over the field F, if for arbitrary elements 
x,y € X and for any scalar a € F the following properties, the so-called axioms of a normed space, are 


fulfilled: 


(N1) ||lz|20, and ]|r|-0 ifandonlyifz = 0, (12.78 
(N2) |jaz|| = Ja] - ||z]| (homogenity), (12.79 
(N3) ||x —- yll πι + [0] (triangle inequality). (12.80 


A metric can be introduced by means of 

p(z,y) = [αυ]; x,y €X, (12.81 
in any normed space. The metric (12.81) has the following additional properties which are compatible 
with the structure of the vector space: 

p(x 4- z,y +z) = p(x, y), zexXx (12.82a 

p(ax,ay) = |o|p(x,y), αετ. (12.82b 

So, in a normed space there are available both the properties of a vector space and the properties of a 
metric space. These properties are compatible in the sense of (12.82a) and (12.82b). The advantage is 
that most of the local investigations can be restricted to the unit ball 

B(0;1)= {x EX: lal] <1} or B(0;1)2(zeX: |x|] <1} (12.83 


since 
B(x;r)= {y E€ X: |y- αἱ <r} 2x-rB(0;1, VaeX and Vr>0. (12.84 


Moreover, the algebraic operations in a vector space are continuous, i.e., 


In >T, YY, αι-λα imply 
Tn ca — THY, An%n Om, [55| — UE ( 2.85 
In normed spaces instead of (12.53) one may write for convergent sequences 
llen — zo| — 0 (n — oo). (12.86 
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12.3.1.2 Some Properties of Normed Spaces 


Among the linear metric spaces, those spaces are normable (i.e., a norm can be introduced by means 
of the metric, if one defines ||x|| = p(x, 0)) whose metric satisfies the conditions (12.82a) and (12.82b). 


Two normed spaces X and Y are called norm isomorphic if there is a bijective linear mapping T: X — 
Y with [Τα] = |α|| for all x € X. Let || - ||; and ||- ||2 be two norms on the vector space X, and denote 
the corresponding normed spaces by X; and X», i.e., X; = (X, || τι) and Xo = (X, || - |»). 


The norm || - ||: is stronger than the norm || - ||», if there is a number 7 > 0 such that |||» € η|α]|. 
for all x € X. In this case, the convergence of a sequence {x,}°, to x with respect to the stronger 
norm || - |11» i-e., ||z, — αμ — 0, implies the convergence to x with respect to the norm || : |12, i.e. 
|En — x||a — 0. 


Two norms ||-|| and ||- ||ı are called equivalentifthere are two numbers γι > 0, y2 > Osuch that Vr € X 
there holds γι]! € |||; < yl|a||. In a finite dimensional vector space all norms are equivalent to 
each other. 


A subspace of a normed space is a closed linear subspace of the space. 


12.3.2 Banach Spaces 
A complete normed space is called a Banach space. Every normed space X can be completed into a 
Banach space X by the completion procedure given in 12.2.2.5, p. 668, and by the natural extension o 
its algebraic operations and the norm to X. 


12.3.2.1 Series in Normed Spaces 


In a normed space X infinite series can be considered. That means for a given sequence (x, 1 0 
elements x, € X a new sequence {sp}? is constructed by 


$1 = 14,89 = T1 + T2,- ., Sk = T1 b Ek = Ska ER, sue (12.87 
If the sequence {s+}? is convergent, i.e., [δι -- s|| — 0 (k — oo) for some s € X, then a convergen 
series is defined. The elements s1, $5, . .., Sk, ... are called the partial sums of the series. The limit 
k 
s — lim 1t 12.88 
pe " ( 


oo oo 
is the sum of the series, and it is denoted by s = Ð xn. A series Σ) zr, is called absolutely convergent 
n=1 n=1 


œ 
if the number series 5; ||z,|| is convergent. In a Banach space every absolutely convergent series is 
n=1 


convergent, and ||s|| < )» \|v,|| holds for its sum s. 


n=1 


12.3.2.2 Examples of Banach Spaces 


ΒΛ: F” with [1 = (x ir) ,ifl<p<o; [α]- max [óc f if p = oo. (12.89a 


k=1 

These normed spaces over the same vector space F” are often denoted by P(n) (1 € p € oo). For 

1 < p < œ, they are called Euclidean spaces in the case of F = R, and unitary spaces in the case o 

F=C. 

HB: m with ||| =sup|&|. (12.89b 
k 


MC: c and co with the norm from m. (12.89c 


1 
ED: P with ||| = |||; = p ea) (1 € p < oo). (12.89d 


n=1 
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B E: C(|a.b]) with 1 = mar |x(t)|. (12.89e) 
Ela, g 
b Ῥ 

ΒΕ: L((a,b)) (1€ p «oo) with ||| = ||x||; = (/ sopa) : (12.598) 


k 
BG: C({a,b]) with ||z|| = 2 max je (t)|, where v(t) stands for x(t). (12.89g) 
150 Ε[α. 


12.3.2.3 Sobolev Spaces 


Let 2 C R" be a bounded domain, i.e., an open connected set, with a sufficiently smooth boundary 
OQ. For n = 1 or n = 2,3 one can imagine 2 being something similar to an interval (a, b) or a bounded 
convex set. 

A function f: Q — R is k-times continuously differentiable on the closed domain 2 if f is k-times 
continuously differentiable on Q and each of its partial derivatives has a finite limit on the boundary, 
i.e., if x approaches an arbitrary point of OQ. In other words, all partial derivatives can be continuously 
extended on the boundary of Ω, i.e., each partial derivative is a continuous function on Ω. In this vector 
space (for p € [1,00)) and with the Lebesgue measure λ in R” (see example C in 12.9.1, 2., p. 695) the 
following norm is defined: 

1 


lfl = If =| foa 2. (12.90 


8 1lelkz 


The resulting normed space is denoted by W*? (Q) or also by WE(Q) (in contrast to the space C? ({a, b] 


which has a quite different norm). Here a means a multi-indez, i.e., an ordered n-tuple (04,...,04) ο 
non-negative integers, where the sum of the components of o is denoted by |a| = αι + az +++: + an. 
For a function f(x) = f(&,...,) with x = (&,...,&,) € © the brief notation is used as in (12.90): 
alal 
Df = f (12.91 


DUREE 


The normed space W*?(Q) is not complete. Its completion is denoted by W*?(Q) or in the case o 


p = 2 by H*(Q) and it is called a Sobolev space. 


12.3.3 Ordered Normed Spaces 


1. Conesin a Normed Space 

Let X be a real normed space with the norm || - ||. A cone X, C X (see 12.1.4.2, p. 657) is called solid, 
if X, contains a ball (with positive radius), or equivalently, X contains at least one interior point. 

E The usual cones are solid in the spaces R, C([a, ὑ]), c, but in the spaces L?((a, b)) and P (1 € p < oo) 
they are not solid. 

A cone X, is called normal if the norm in X is semi-monotonic, i.e., there exists a constant M > 0 such 
that 
Ozzy = lll < Ml. (12.92) 
If X is a Banach space ordered by a cone X,, then every (o)-interval is bounded with respect to the 
norm if and only if the cone X, is normal. 

W The cones of the vectors with non-negative components and of the non-negative functions in the 
spaces IR", m, c, co, C, IP and L”, respectively, are normal. 


A cone is called regular if every monotonically increasing sequence which is bounded above, 
χι < ag S00 Say LLa (12.93) 


672 12. Functional Analysis 


is a Cauchy sequence in X. In a Banach space every closed regular cone is normal. 
W The cones in R”, IP and L” for 1 € p < co are regular, but in C and m they are not. 
2. Normed Vector Lattices and Banach Lattices 


Let X be a vector lattice, which is a normed space at the same time. X is called a normed lattice or 
normed vector lattice (see [12.15], [12.19], [12.22], [12.23]), if the norm satisfies the condition 


|z| < ly] implies ||| € μι Vz.y € X (monotonicity of the norm). (12.94) 
Α complete (with respect to the norm) normed lattice is called a Banach lattice. 
E The spaces C([a,b]), LP, P, B([a,b]) are Banach lattices. 


12.3.4 Normed Algebras 


A vector space X over F is called an algebra, if in addition to the operations defined in the vector space 
X and satisfying the axioms (V1)-(V8) (see 12.1.1, p. 654), a product x-y € X is defined for every two 
elements x,y € X (or with a simplified notation by a product zy), such that for arbitrary x,y,z € X 
and a € F the following conditions are satisfied: 


(A1) 2(yz) = (αι). (12.95 
(A2) α(υ-α) = ty + 22, (12.96 
(A3) (xz+y)z yz, (12.97 
(A4) a(ay) = (ax)y = x(ay). (12.98 


An algebra is commutative if xy = yx holds for two arbitrary elements x,y. A linear operator (see 
(12.21), p. 658) T: X — Y of the algebra X into the algebra Y is called an algebra homomorphism i 
for any 11, 20 € X: 

T(zi: x2) = Τσι «πο. (12.99 
An algebra X is called a normed algebra or a Banach algebra if it is a normed vector space or a Banach 
space and the norm has the additional property 

læ -yll < [ο] llull- (12.100 
In a normed algebra all the operations are continuous, i.e., additionally to (12.85), if sn —> x and 
Un — y, then also z,y, — xy (see [12.20]). 
Every normed algebra can be completed to a Banach algebra, where the product is extended to the 
norm completion with respect to (12.100). 


Β A: C([a,b]) with the norm (12.89e) and the usual (pointwise) product of continuous functions. 


E B: The vector space W (10. 27]) of all complex-valued functions x(t) continuous on (0, 27] and having 
an absolutely convergent Fourier series expansion, i.e., 


ὅς 
s= >> ee", (12.101) 
n-—-—oo 
with the norm |[z|| = » |c,| and the usual multiplication. 
n=—oo 


WB C: The space L(X) of all bounded linear operators on the normed space X with the operator norm 
and the usual algebraic operations (see 12.5.1.2, p. 677), where the product T S of two operators is 


defined as the sequential application, i.e., TS(x) = T(S(x)), x € X. 
Bl D: The space L! (—oo, oc) of all measurable and absolutely integrable functions on the real axis (see 
12.9, p. 693) with the norm 

lll =| |x(t)| dt (12.102) 


oo 
is a Banach algebra if the multiplication is defined as the convolution (x * y)(t) = / x(t — s)y(s) ds. 
«ος 
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12.4 Hilbert Spaces 


12.4.1 Notion ofa Hilbert Space 
12.4.1.1 Scalar Product 


A vector space V over a field F (mostly F = C) is called a space with scalar product or an inner product 
space or pre-Hilbert space if to every pair of elements x,y € V there is assigned a number (x, y) € F (the 
scalar product of x and y), such that the axioms of the scalar product are satisfied, i.e., for arbitrary 
x,y,z E VandacF: 


(H1) (2,2) 2 0, (1.6., (x, Ὁ) is real), and (x, x) = 0 if and only if x = 0, (12.103) 
(H2) (az, y) = atr, y), (12.104) 
(H3) (x +y,z) = (v, z) + (y, 2). (12.105) 
(H4) (x,y) = (y, x). (12.106) 


(Here w denotes the conjugate of the complex number w, which is denoted by w* in (1.133c). Sometimes 
the notation of a scalar product is (x, y).) 

In the case of F = R, i.e., in a real vector space, (H4) means the commutativity of the scalar product. 
Some further properties follow from the axioms: 


(x,ay) =a(a,y) and (z,y+ 5) -- (x,y) + (v, z). (12.107) 


12.4.1.2 Unitary Spaces and Some of their Properties 


In a pre-Hilbert space H a norm can be introduced by means of the scalar product as follows: 


tll = (@2) (e). (12.108) 


A normed space H = (H, ||- ||) is called unitary if there is a scalar product satisfying (12.108). Based on 
the previous properties of scalar products and (12.108) in unitary spaces the following facts are valid: 
a) Triangle Inequality: 

|z + yll < [|] + llyll- (12.109 
b) Cauchy-Schwarz Inequality or Schwarz-Buniakowski Inequality (see also 1.4.2.9, p. 31): 


(yl € y (v x) y (v. v) - (12.110 


c) Parallelogram Identity: This characterizes the unitary spaces among the normed spaces: 


le vl + lle — yl? = 2 (hl + Ill). (211 
d) Continuity of the Scalar Product: 
In >T, Y >y imply (Tn, Yn) — (9, y). (12.112 


12.4.1.3 Hilbert Space 
A complete unitary space is called a Hilbert space. Since Hilbert spaces are also Banach spaces, they 
possess their properties (see 12.3.1, p. 669; 12.3.1.2, p. 670; 12.3.2, p. 670). In addition they have the 
properties of unitary spaces 12.4.1.2, p. 673. A subspace of a Hilbert space is a closed linear subspace. 
ΒΔ: (n), P and L?((a,b)) with the scalar products 

b p 


(y) =Y ET (ey) =X Gm and (x,y) = / x(t)y(t) dt. (12.113) 
ki k=l d 
E B: The space H?(Q) with the scalar product 


(f.g)— ποσο dr M D? f (z) De g(x) dz. (12.114) 


EN 
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E C: Let y(t) be a measurable positive function on [a,b]. The complex space L?((a, b), p) of all mea- 
surable functions, which are quadratically integrable with the weight function y on (a, b), is a Hilbert 
space if the scalar product is defined as 


(x,y) = Í " e(t) Dye (t) dt. (12.115) 


12.4.2 Orthogonality 
Two elements x, y of a Hilbert space H are called orthogonal (denoted by x L y) if (x, y) = 0 (the notions 
of this paragraph also make sense in pre-Hilbert spaces and in unitary spaces). For an arbitrary subset 
A C H, the set 

A+ ={r EH: (x,y) =0 Vye A} (12.116) 
of all vectors which are orthogonal to each vector in A is a (closed linear) subspace of H and it is called 
the orthogonal space to A or the orthogonal complement of A. The notation A L B means that (x,y) = 0 
for alla € A and y € B. If A consists of a single element x, then the notation x L B is used. 
12.4.2.1 Properties of Orthogonality 
The zero vector is orthogonal to every vector of H. The following statements hold: 
a)x Lyanda L zimply z L (ay + 8z) for any a, 8 € C. 
b) From x L yn and y, > y it follows that x L y. 


c) « L Aif and only if x L lin(A), where lin(A) denotes the closed linear hull of the set A. 
d) Ifa L A and A is a fundamental set, i.e., lin(A) is everywhere dense in H, then x = 0. 


e) Pythagoras Theorem: If the elements z4,..., 2, are pairwise orthogonal, that is x; L 2; for all 
k Al, then 


[ΣΟ ell? Σαν. (12.117) 
k=1 k=1 


f) Projection Theorem: If Ho is a subspace of H, then each vector x € H can be written uniquely 
as 


z—z +r", z'c Heo, z” 1 Ho. (12.118) 
g) Approximation Problem: Furthermore, the equation ||z'|| = p(x, Ho) = infycu, (||: — y||) holds, 
and so the problem 

lc — y|| + inf, y € Ho (12.119) 


has the unique solution x’ in Hy. In this statement Ho can be replaced by a convex closed non-empty 
subset of H. 
The element 2’ is called the projection of the element x on Ho. It has the smallest distance from x (to 


Ho), and the space H can be decomposed: H = Ho & Hj. 
12.4.2.2 Orthogonal Systems 


A set {σε: € € E} of vectors from H is called an orthogonal system if it does not contain the zero vector 
and ze L £y, E η, hence (σε, £y) = den holds, where 


1 for =n 
= x 9 
ben { nw (12.120) 
denotes the Kronecker symbol (see 4.1.2, 10., p. 271). An orthogonal system is called orthonormal if 
in addition ||x¢|] = 1 V €. 

In a separable Hilbert space an orthogonal system may contain at most countably many elements. 
Therefore = = IN is assumed from now on. 
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ΒΑ: The system 
1 1 1 1 1 
j cos {, sin t, cos 2t, —— sin 2t, ... 12.121 
Jin yT Cm T yn ( ) 
in the real space L?((—7,7)) and the system 
1 

ν2π 
in the complex space L?((—7, π)) are orthonormal systems. Both of these systems are called trigono- 
metric. 
W B: The Legendre polynomials of the first kind (see 9.1.2.6, 2., p. 566) 
απ 
dt” 
form an orthogonal system of elements in the space L?((—1, 1)). The corresponding orthonormal system 


P,(t) = \/n+ ; i P,(t). (12.124) 


B C: The Hermite polynomials (see 9.1.2.6, 6., p. 568 and 9.2.4, 3., 602) according to the second 
definition of the Hermite differential equation (9.66b) 


i uu (n = 0,1,...) (12.125) 
dt^ 


form an orthogonal system in the space L?((—oo, οο)). 

B D: The Laguerre polynomials form an orthogonal system (see 9.1.2.6, 5., p. 568) in the space 
L^((0, οο)). 

Every orthogonal system is linearly independent, since the zero vector was excluded. Conversely, if 
T4,12,... T4,...1s8 a system of linearly independent elements in a Hilbert space H, then there exist vec- 
tors €1, 62, ..., En, - - ., obtained by the Gram-Schmidt orthogonalization method (see 4.6.2.2, 1., p. 316) 
which form an orthonormal system. They span the same subspace, and by the method they are deter- 
mined up to a scalar factor with modulus 1. 


ο" (n =0,+1,+2,...) (12.122) 


Ρ.{) [0.1] (n=0,1,...) (12.123) 


H,,(t) = οἱ 


12.4.3 Fourier Series in Hilbert Spaces 
12.4.3.1 Best Approximation 


Let H be a separable Hilbert space and 
(e: n=1,2,...} (12.126) 


a fixed orthonormal system in H. For an element x € H the numbers cp = (x, ἐπ) are called the Fourier 
coefficients of x with respect to the system (12.126). The (formal) series 


Σ Gren (12.127) 


n=1 
is called the Fourier series of the element x with respect to the system (12.126) (see 7.4.1.1, 1., p. 474). 
The n-th partial sum of the Fourier series of an element x has the property of the best approximation, 
i.e., for fixed n, the n-th partial sum of the Fourier series 


n 


gem Σα, €¢ ek (12.128) 


k=1 
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n 
gives the smallest value of |x — X a,xe,|| among all vectors of H, = lin({e1,...,en}). Furthermore, 
Ea 


1 — σῃ is orthogonal to H,, and there holds the Bessel inequality: 
oo 
Me < |], cn =(z,en) (π--1,3,...). (12.129) 
n=l 

12.4.3.2 Parseval Equation, Riesz-Fischer Theorem 

The Fourier series of an arbitrary element x € H is always convergent. Its sum is the projection of 


the element x onto the subspace Hp = lin({e,}%,). If an element x € H has the representation 
co 

g= Qnen, then an are the Fourier coefficients of x (n = 1,2,...). If (o €, is an arbitrary 
n=1 


co 
sequence of numbers with the property »; |a,|? < oo, then there is a unique element z in H, whose 
n=1 


Fourier coefficients are equal to a, and for which the Parseval equation holds: 


Y. ls. eJI* - Y 


n=1 n=1 


? = |ie]? (Riesz—Fischer theorem). (12.130) 


απ 


An orthonormal system (e, ) in H is called complete if there is no non-zero vector y orthogonal to every 


en; it is called a basis if every vector x € H has the representation 2 = D Anen, 1.6., An = (2, ἐπ) and x 
n=1 

is equal to the sum of its Fourier series. In this case, one also says that x has a Fourier expansion. The 

following statements are equivalent: 

a) {en} is a fundamental set in H. 

b) {en} is complete in H. 

c) {en} is a basis in H. 

d) For V x,y € H with the corresponding Fourier coefficients c, and d, (n = 1,2,...) there holds 


(x,y) = YO eds. (12.131) 
n=1 
e) For every vector x € H, the Parseval equation (12.130) holds. 
Bl A: The trigonometric system (12.121) is a basis in the space L?((—7, π)). 


E B: The system of the normalized Legendre polynomials (12.124) P,(t) (n = 0,1,...) is complete 
and consequently a basis in the space L?((—1, 1)). 


12.4.4 Existence of a Basis, Isomorphic Hilbert Spaces 


In every separable Hilbert space there exits a basis. From this fact it follows that every orthonormal 
system can be completed to a basis. 

Two Hilbert spaces H; and Hy are called isometric or isomorphic as Hilbert spaces if there is a linear 
bijective mapping T: Hı — H» with the property (Τα, T'y)g, = (x,y)n, (that is, it preserves the 
scalar product and because of (12.108) also the norm). Any two arbitrary infinite dimensional separable 
Hilbert spaces are isometric, in particular every such space is isometric to the separable space 1”. 
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12.5 Continuous Linear Operators and Functionals 


12.5.1 Boundedness, Norm and Continuity of Linear Operators 


12.5.1.1 Boundedness and the Norm of Linear Operators 
Let X = (X, || - ||) and Y = (Y, ||- ||) be normed spaces. In the following discussion the index X in 
the notation || - ||% , which emphasizes being in the space X , is omitted, because from the text it will 
be always clear, which norms and spaces are considered. An arbitrary operator T: X —9 Y is called 
bounded if there is a real number A > 0 such that 

|T(x)|| < Alla] vx eX. (12.132) 
A bounded operator with a constant A “stretches” every vector at most A times and it transforms every 
bounded set of X into a bounded set of Y, in particular the image of the unit ball of X is bounded in Y. 
This last property is characteristic of bounded linear operators. A linear operator is continuous (see 
12.2.3, p. 668) if and only if it is bounded. 
The smallest constant A, for which (12.132) still holds, is called the norm of the operator T and it is 
denoted by ||T ||, i.e., 

ITI := inf(A > 0 : [[Tz||] € Al|z]|, z € X). (12.133 


For a continuous linear operator the following equalities hold: 


ΙΤ] = sup Τα] = sup |[Tx|| = sup Τα] (12.134 
σι llt τι 
and, furthermore, the following estimation holds 
[Τι| < [τί η Va eX. (12.135 
W Let T be the operator in the space C({a, b]) with the norm (12.89e), defined by the integral 
(Tx)(s) = y(s) = Γκο.θυ( dt (s€ [a,b]), (12.136 


where K (s, t) is a (complex-valued) continuous function on the rectangle {a < s,t € b}. Then T isa 
bounded linear operator, which maps C(|a, b]) into C([a, b]). Its norm is 


b 
ITI = max [ |K(s,t)| dt. (12.137) 


12.5.1.2 The Space of Linear Continuous Operators 
The sum U = S + T and the multiple aT of two linear (continuous) operators S, T: X —> Y are 
defined point-wise: 

σα) = S(a)+T(a), (aT)(r)—o-T(r), Vr € X and VacF. (12.138) 
The set L(X, Y), often denoted by B(X, Y), of all linear continuous operators T' from X into Y equipped 
with the operations (12.138) is a vector space, where ||T'|| (12.133) turns out to be a norm on it. So, 
L(X. Y) is a normed space and even a Banach space if Y is a Banach space. So the axioms (V1)-(V8) 
and (IN1)-(N3) are satisfied (see 12.1.1, p. 654, 12.3.1, p. 669). 
If Y = X, then a product can be defined for two arbitrary elements S, T € L(X, X) = L(X) = B(X) as 

(ST)(x)- S(Tx) (Vr €X), (12.139) 
which satisfies the axioms (A1)-(A4) from 12.3.4, p. 672, and also the compatibility condition (12.100) 
with the norm. L(X) isin general a non-commutative normed algebra, and if X is a Banach space, then 
it is a Banach algebra. Then for every operator T € L(X) its powers are defined by 

T? =I, T"2T"T (n=1,2,...), (12.140) 
where J is the identity operator Ix = x, Vr € X. Then 

yr sr (α-ο,1....), (12.141) 
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and furthermore there always exists the (finite) limit 


r(T)-— lim ý 


noo 


Τη, (12.142) 


which is called the spectral radius of the operator T and satisfies the relations 
r(T) «ΤΙ. r(T*-—[r(D)", r(aT) = Jalr(T), r(T) = r(T*), (12.143) 
where T* is the adjoint operator to T (see 12.6, p. 684, and (12.159)). If L(X) is complete, then for 
|A| > r(T), the operator (AJ — Τ) 1 has the representation in the form of a Neumann series 
AEST) SA TEN Ter tA T 4. su (12.144) 
which is convergent for |A| > τ(Τ) in the operator norm on L(X). 


12.5.1.3 Convergence of Operator Sequences 


1. Point-wise Convergence 
of a sequence of linear continuous operators Th: X — Y to an operator T: X —+ Y means that: 
Tax — Tx inYforeachz € X. (12.145) 


2. Uniform Convergence 
The usual norm-convergence of a sequence of operators {1η} 1 in a space L(X, Y) to T, i.e., 
||T, — T|| = sup |T;z — Tel — 0 (n oo) (12.146) 
lc] et 
is the uniform convergence on the unit ball of X. It implies point-wise convergence, while the converse 
statement is not true in general. 
3. Applications 


'The convergence of quadrature formulas when the number n of interpolation nodes tends to oo, the 
performance principle of summation, limiting methods, etc. 


12.5.2 Linear Continuous Operators in Banach Spaces 
Now X and Y are supposed to be Banach spaces. 
1. Banach-Steinhaus Theorem (Uniform Boundedness Principle) 
The theorem characterizes the point-wise convergence of a sequence {Τη} of linear continuous operators 
Tn to some linear continuous operator by the conditions: 
a) For every element from an everywhere dense subset D C X, the sequence {T x} has a limit in Y, 
b) there is a constant C such that [Τμ] € C, V n. 
2. Open Mappings Theorem 
The theorem tells us that a linear continuous operator mapping from X onto Y is open, i.e., the image 
T(G) of every open set G from X is an open set in Y. 
3. Closed Graph Theorem 
An operator T: Dr — Y with Dr C X is called closed if x, € Dr, £n > roin X and Tr, — yo in Y 
imply zo € Dr and yo = Txo. A necessary and sufficient condition is that the graph of the operator T 
in the space X x Y, i.e., the set 

Pp = ία. Tx): x € Dr} (12.147) 
is closed, where here (x, y) denotes an element of the set X x Y. 
If T is a closed operator with a closed domain Dr, then T is continuous. 
4. Hellinger-Toeplitz Theorem 
Let T be a linear operator in a Hilbert space H. If (x, T'y) = (Τα, y) for every x,y € H, then T is 
continuous (here (x, Ty) denotes the scalar product in H). 
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5. Krein-Losanovskij Theorem on the Continuity of Positive Linear 

Operators 
If X = (X, X4, ||- |) and Y = (¥,Y4,|| - ||) are ordered normed spaces, where X. is a generating 
cone, then the set L,(X, Y) of all positive linear and continuous operators T, i.e., T(X,) C Y}, isa 
cone in L(X, Y). The theorem of Krein and Losanovskij asserts (see [12.17]): If X and Y are ordered 
Banach spaces with closed cones X and Y, and X is a generating cone, then the positivity of a linear 
operator implies its continuity. 
6. Inverse Operator 
Let X and Y be arbitrary normed spaces and let T: X — Y be a linear, not necessarily continuous 
operator. T has a continuous inverse T^! : Y — X, if T(X) = Y and there exists a constant m > 0 


general than that in (12.22) (see 12.1.5.2, p. 659), since there may be D  Χ and R(T) # Y. 

In the case of Banach spaces X, Y the following theorem is valid: 

7. Banach Theorem on the Continuity of the Inverse Operator 

If T is a linear continuous bijective operator from X onto Y, then the inverse operator Τ 1 is also 
continuous. 


1 
such that [Τα] > m||x|| for each x € X. Then |T!|| € —. The situation considered here is less 
m 


An important application is, e.g., the continuity of (AJ — T)~! given the injectivity and surjectivity of 
AI — T. This fact has importance in investigating the spectrum of an operator (see 12.5.3.2, p. 680). 
It also applies to the 

8. Continuous Dependence of the Solution 

on the right-hand side and also on the initial data of initial value problems for linear differential equa- 
tions. This fact is demonstrated by the following example. 


W The initial value problem 


(t) +p (Hilt) + po(t)a(t) = q(t), t€ [a,b], zlto) -- €, tlto) =€, to € τα, t] (12.1482) 
with coefficients p(t), p2(t) € C({a, b]) has exactly one solution x from C?((a, ὑ]) for every right-hand 


side q(t) € C([a, b]) and for every pair of numbers €, £. The solution x depends continuously on q(t), € 
and £ in the following sense. If gn(t) € C([a,0]),£,, ἐν € IR! are given and z, € C({a, b]) denotes the 
solution of 
Ën(t) + pi(t)t, (t) + po(t)an(t) = qu(t), Enla) = £n, tnla) = En, (12.148b) 
for n = 1,2,..., then: 
dn(t) — q(t) in C([a, δ), 
En > €, implies z,, — x in the space C"([a, b]). (12.148c) 
fn 6 


9. Method of Successive Approximation 
to solve an equation of the form 
z—Tr-y (12.149) 
with a continuous linear operator T in a Banach space X for a given y. This method starts with an 
arbitrary initial element xo, and constructs a sequence (c, } of approximating solutions by the formula 
Zn ytTr, (n=0,1,...). (12.150) 
This sequence converges to the solution z* in X of (12.149). The convergence of the method, i.e., 
Zn — 2, is based on the convergence of the series (12.144) with A = 1. 
Let ||T|| € q 1. Then the following statements are valid: 


1 
a) The operator J — T has a continuous inverse with ||(Z — T) !|| € τ and (12.149) has exactly 


one solution for each y. 
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b) The series (12.144) converges and its sum is the operator (J — T) !. 
c) The method (12.150) converges to the unique solution α of (12.149) for any initial element zo, if 
the series (12.144) converges. Then the following estimation holds: 


q" 
len — z*|| € m στο xol (ως 1,2,...). (12.151) 
Equations of the type 
a-pTc=y, Avr—Tr—y, wrAECF (12.152) 


can be handled in an analogous way (see 11.2.2, p. 625, and [12.8]). 


12.5.3 Elements of the Spectral Theory of Linear Operators 
12.5.3.1 Resolvent Set and the Resolvent of an Operator 


For an investigation of the solvability of equations one tries to rewrite the problem in the form 

(I—-T)v—y (12.153) 
with some operator 7' having a possible small norm. This is especially convenient for using a functional 
analytic method because of (12.143) and (12.144). In order to handle large values of ||T'|| as well, it is 
necessary to investigate the whole family of equations 

(AI—-T)vr—-y «eX, with Ac C (12.154) 
in a complex Banach space X. Let T be a linear, but in general not a bounded operator in a Banach 
space X. The set o(T) of all complex numbers A such that (AJ — T)! € B(X) = L(X) is called the 
resolvent set and the operator Ry = R(T) = (AI — T) ! is called the resolvent. Let T now be a 
bounded linear operator in a complex Banach space X. Then the following statements are valid: 
a) The set o(T) is rie More precisely, if Ao € o(T) and λ € C satisfy the inequality 


à dd < (12.155) 
i WT 

then Ry exists and 

Ry = Ry + (A — Ao) R2, + (A — A0)? R3, uim Y D - ΑΕΕ. (12.156) 

k=1 
b) (4€ C: |A| ΣΙΤΙ} C o(T). More precisely, Y À € € with |A| > ||T||, the operator Ry exists and 
iT T 

a rin (12.157) 

c) | & — Ra l| — 0, if A — ào (A, ào € o(T)), and | f] 30, if A co (A € οἱτ)) 
a |5Ξ — Εξ || — 0, fA > λο- 


e) Foran MN functional f € X* (see 12.5.4.1, p. 681) and arbitrary x € X the function F(A) = 
f(Ry(x)) is holomorphic on o(T). 

f) For arbitrary A, µε o(T), and À Z p one has: 
Πλ - R; 


RaR, = RyRy = A 


(12.158) 


12.5.3.2 Spectrum of an Operator 


1. Definition of the Spectrum 

The set σ(Τ) = ΟΝ o(T) is called the spectrum of the operator T. Since J — T has a continuous inverse 
(and consequently (12.153) has a solution, which continuously depends on the right-hand side) if and 
only if 1 € o(T), the spectrum σ(Τ) must be known as well as possible. From the properties of the 
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resolvent set it follows immediately that the spectrum o(T) is a closed set of C which lies in the disk 
(A € €: JA] € ||T||}, however, in many cases o(T) is much smaller than this disk. The spectrum of any 
linear continuous operator on a complex Banach space is never empty and 

r(T) = sup |. (12.159) 

λεσ(Τ) 

It is possible to say more about the spectrum in the cases of different special classes of operators. 
If T is an operator in a finite dimensional space X and if the equation (AJ — T)x = 0 has only the trivial 
solution (i.e., AJ — T is injective), then \ € o(T) (i.e., AZ — T is surjective). If this equation has a 
non-trivial solution in some Banach space, then the operator \J — T is not injective and (AJ — T)! is 
in general not defined. 
The number A € C is called an eigenvalue of the linear operator T, if the equation Ax = Tx has a non- 
trivial solution. All those solutions are called eigenvectors, or in the case when X is a function space 
(which occurs very often in applications), they are called eigenfunctions of the operator T associated 
to A. The subspace spanned by them is called the eigenspace (or characteristic space) associated to A. 
The set σρ(Τ) of all eigenvalues of T is called the point spectrum of the operator T. 
2. Comparison to Linear Algebra, Residual Spectrum 
An essential difference between the finite dimensional case which is considered in linear algebra and the 
infinite dimensional case discussed in functional analysis is that in the first case σ(Τ) = e; (T) always 
holds, while in the second case the spectrum usually also contains points which are not eigenvalues of 
T. If AI — T is injective and surjective as well, then λ € o(T) due to the theorem on the continuity 
of the inverse (see 12.5.2, Τ., p. 679). In contrast to the finite dimensional case where the surjectivity 
follows automatically from the injectivity, the infinite dimensional case has to be dealt with in a very 
different way. 
The set σ.(Τ) of all A € σ(Τ). for which AJ — T is injective and Γπι(λΙ — T) is dense in X, is called 
the continuous spectrum and the set c, (T) of all A with an injective AJ — T and a non-dense image, is 
called the residual spectrum of operator T. 
For a bounded linear operator T' in a complex Banach space X 

σ(Τ) = σν(1)) Ue .(T) U o (T), (12.160) 


where the terms of the right-hand side are mutually disjoint. 


12.5.4 Continuous Linear Functionals 


12.5.4.1 Definition 


For Y — FF a linear mapping is called a linear functional or a linear form. In the following discussions, 
for a Hilbert space the complex case is considered; in other situations almost every times the real case 
is considered. The Banach space L(X, F) of all continuous linear functionals is called the adjoint space 
or the dual space of X and it is denoted by X* (sometimes also by X’). The value (in F) of a linear 
continuous functional f € X* on an element x € X is denoted by f (x), often also by (x, f) - emphasizing 
the bilinear relation of X and X* — (compare also with the Riesz theorem (see 12.5.4.2, p. 682). 


Wl A: Let t),t2,...,tn be fixed points of the interval [a, b] and c,,c2,...,¢, real numbers. By the 
formula 


f(a) = Y ash) (12.161) 


n 
a linear continuous functional is defined on the space C([a, ὑ]}; the norm of f is || f|] = X οι]. A special 
k=1 


case of (12.161) for a fixed t € [a, b] is the 6 functional 
lx) = x(t) (x € C([a,b])). (12.162) 
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E B: With an integrable function y(t) (see 12.9.3.1, p. 696) on [α, b] 


f(a) = [ 9s) dt (12.163) 


is a linear continuous functional on C([a, b]) and also on B([a, b]) in each case with the norm ||f|| = 
b 
f Old 


12.5.4.2 Continuous Linear Functionals in Hilbert Spaces. 
Riesz Representation Theorem 

In a Hilbert space H equipped with the scalar product (-,-) every element y € H defines a linear 
continuous functional by the formula f(x) = (x, y), where its norm is || f|| = ||y||. Conversely, if f is a 
linear continuous functional on H, then there exists a unique element y € H such that 

f(x) =(a,y) VaeH, (12.164) 
where | 
tified. 


fll = [νυ]. According to this theorem the spaces H and H* are isomorphic and might be iden- 


The Riesz representation theorem contains a hint on how to introduce the notion of orthogonality in 
an arbitrary normed space. Let Ας X and A* C X*. The sets 


At-[feX:f(r)-0 Vee A} and A*+={x EX: f(z) =0 νεα} (12.165) 


are called the orthogonal complement or the annulator of A and A*, respectively. 
12.5.4.3 Continuous Linear Functionals in L? TE 
Let p > 1. The number q is called the conjugate exponent to pif — + — = 1, where it is assumed that 
P q 
q = œ in the case of p = 1. 
W Based on the Holder integral inequality (see 1.4.2.12, p. 32) the functional (12.163) can be considered 
d d 
also in the spaces L?({a, b]) (1 € p € oo) (see 12.9.4, p. 697) if y € L4([a, 0]) and — + — = 1. Its norm 
P q 
is then . 
b q 
(/ ΠΠ} , if l«p&oo 
fll = lell = 9 We 


ess. sup |y(t)|, if p=1 
L€ [a,b] 


(12.166) 


(with respect to the definition of ess. sup |p| see (12.221), p. 698). To every linear continuous functional 
f in the space L?([a, b]) there is a uniquely (up to its equivalence class) defined element y € L^(a, b]) 
such that 


Π 


b b 
Λο) = (2,9) = [αθνδὰ, ze" and fl =lylle= (/ vora) l (12.167) 
For the case of p = oo see [12.15]. 


12.5.5 Extension ofa Linear Functional 


1. Semi-Norm 

A mapping p: X — R of a vector space X is called a semi-norm or pseudonorm, if it has the following 
properties: 

(HN1) p(x) 20, (12.168) 
(HN2) plaz) = |a|p(x), (12.169) 
(HN3) p(x +y) € p(x) + ply). (12.170) 
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Comparison with 12.3.1, p. 669, shows that a semi-norm is a norm if and only if p(x) = 0 holds only 
for x — 0. 

Both for theoretical mathematical questions and for practical reasons in applications of mathematics, 
the problem of the extension of a linear functional given on a linear subspace Χο C X to the entire 
space (and, in order to avoid trivial and uninteresting cases) with preserving certain * good" properties 
became a fundamental question. The solution of this problem is guaranteed by 

2. Analytic Form of the Hahn-Banach Extension Theorem 

Let X be a vector space over F and p a pseudonorm on X. Let Χρ be a linear (complex in the case of 
F = C and real in the case of F = R) subspace of X, and let fo be a (complex-valued in the case of 
F = C and real-valued in the case of F = R) linear functional on Χρ satisfying the relation 


[fo(z)| € p(z) Va € Xo. (12.171) 
Then there exists a linear functional f on X with the following properties: 
f(x) = (ο) Vae Xo, |f(x)|< plz) Vrex. (12.172) 


So, f is an extension of the functional [ο onto the whole space X preserving the relation (12.171). 

If Xo is a linear subspace of a normed space X and fo is a continuous linear functional on Xo, then 
p(x) = || foll- |||] is a pseudonorm on X satisfying (12.171), so the Hahn-Banach extension theorem for 
continuous linear functionals is obtained. 


Two important consequences are: 

1. For every element x 4 0 there is a functional f € X* with f(x) = |[z|| and || f|| = 1. 

2. For every linear subspace Χρ C X and πο € Xo with the positive distance d = inf,cx, ||x — xo|| > 0 
there is an f € X* such that 


f(x) =0 Vr€ Xo, f(t) =1 and | = j (12.173) 
12.5.6 Separation of Convex Sets 


1. Hyperplanes 

A linear subset L of the real vector space X, L Z X, is called a hypersubspace or hyperplane through 0 
if there exists an xo € X such that X = παρ, L). Sets of the form x + L (L a linear subset) are affine- 
linear manifolds (see 12.1.2, p. 655). If L is a hypersubspace, these manifolds are called hyperplanes. 
There exist the following close relations between hypersubspaces, hyperplanes and linear functionals: 
a) The kernel f^! (0) = (x € X: f(x) = 0} of a linear functional f on X is a hypersubspace in X, and 
for each number λ € R there exists an element αλ € X with f(x.) = A and f^! (A) = x + f^! (0). 

b) For any given hypersubspace L C X and each x ¢ L and \ Z 0 (A € R) there always exists a 
uniquely determined linear functional f on X with f^! (0) = L and f(a) = A. 

The closedness of f^! (0) in the case of a normed space X is equivalent to the continuity of the functional 
Js 

2. Geometric Form of the Hahn-Banach Extension Theorem 

Let X be a normed space, 20 € X and L a linear subspace of X. Then for every non-empty convex open 
set K which does not intersect the affine-linear manifold zo + L, there exists a closed hypersubspace H 
such that zo + L C H and HA K =f. 

3. Separation of Convex Sets 

Two subsets A, B of a real normed space X are called separated by a hyperplane if there is a functional 
f € X* such that: 


sup f(x) < ing /(y). (12.174) 


The separating hyperplane is then given by f~'(a) with a = suppea f(x), which means that the two 
sets are contained in the different half-spaces 
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Ac {«eX: f(z) <a} and B C (r€ X: f(z) > a}. (12.175) 


In Fig. 12.5b,c two cases of the separation by a hyperplane are shown. 

Their disjointness is less decisive for the separation of two sets. In fact, Fig. 12.5a shows two sets E 
and B, which are not separated although E and B are disjoint and B is convex. The convexity of both 
sets is the intrinsic property for separating them. In this case it is possible that the sets have common 
points which are contained in the hyperplane. 


a) b) 


Figure 12.5 


If A is a convex set of a normed space X with a non-empty interior Jnt(A) and B C X is a non-empty 
convex set with Int(A)M B = 0, then A and B can be separated. The hypothesis Int(A) # Ø in that 
statement cannot be dropped (see [12.3], example 4.47). A (reallinear) functional f € X* is called 
a supporting functional of the set A at the point rg € A, if there is a real number A € R such that 
f(xo) =A, and AC (x € X : f(x) € A}. f^! (A) is called the supporting hyperplane at the point zo. 

For a convex set K with a non-empty interior, there exists a supporting functional at each of its bound- 
ary points. 
Remark: The famous Kuhn-Tucker theorem (see 18.2, p. 925) which yields practical methods to de- 
termine the minimum of convex optimization problems (see [12.5]), is also based on the separation of 
convex sets. 


12.5.7 Second Adjoint Space and Reflexive Spaces 
The adjoint space X* of a normed space X is also a normed space if it is equipped with the norm || ||} = 


sup |f(x)|, so (X*)* = X** — the second adjoint space to X can also be considered. The canonical 
μα 


embedding 

J: X — X* with Ja=F,, where F,(f)= f(x) Vfex (12.176) 
is a norm isomorphism (see 12.3.1, p. 669), hence X is identified with the subset J(X) C X**. A Banach 
space X is called reflexive if J(X) = X**. Hence the canonical embedding is then a surjective norm 
isomorphism. 


W Every finite dimensional Banach space and every Hilbert space is reflexive, as well as the spaces 
LP (1 <p « oo), however C (fa, δ). L'((0, 1]). co are examples of non-reflexive spaces. 


12.6 Adjoint Operators in Normed Spaces 
12.6.1 Adjoint of a Bounded Operator 


For a given linear continuous operator T: X — Y (X,Y are normed spaces) to every g € Y* there 

is assigned a functional f € X* by f(x) = 1). V x € X. In this way, a linear continuous operator 
Τη: Y' — X*, (I*"g)z)—-g(1zy), Vg c Y* and Vx eX (12.177) 

is obtained which is called the adjoint operator of T and has the following properties: 

(TS) —T*--S*, (ST)* = S*T*, |T*|| = ΙΤ], where for the linear continuous operators T: X —^ Y 

and ο: Y + Z (X,Y,Z normed spaces), the operator ST: X — Z is defined in the natural way 
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as ST(x) = S(T(x)) (see 12.3.4, M C, p. 672). With the notation introduced in 12.1.5, p. 658, and 
12.5.4.2, p. 682, the following identities are valid for an operator T € B(X, Y): 

Im(T)- ker(T*)', Im(T*) = ker(T)*, (12.178) 
where the closedness of Im(T) implies the closedness of Im(T*). 
The operator T**: X** — Y**, obtained as (7*)* from T*, is called the second adjoint of T. Due to 
(T*(F,))g = F,(1*g) = (1*g)(x) = g(Tx) = Fra(g) the operator 1” has the following property: If 
F, € X™*, then T* F, = Fr, € Y". Hence, the operator T**: X** — Y** is an extension of T. 
In a Hilbert space H the adjoint operator can also be introduced by means of the scalar product 
(Tx,y) = (α.Τ"υ), x,y € H. This is based on the Riesz representation theorem, where the iden- 
tification of H and H** implies (AT)* = AT*, I* = I and even T** = T. If T is bijective, then the same 
holds for T*, and also (T*)~! = (T^ !)*. For the resolvents of T and T* there holds 

[f (D = R(T"), (12.179) 
from which o(T*) = (X: λε o(T)} follows for the spectrum of the adjoint operator. 
W A: Let T be an integral operator in the space L?([a,b]) (1 < p < oc) 


(Tx)(s) = [κα t)a(t) dt (12.180) 


with a continuous kernel K (s, t). The adjoint operator of T is also an integral operator, namely 


b 

(gy) = f K (t, Su) ds (12.181) 
with the kernel K*(s,t) = K(t,s), where y, is the element from L^ associated to g € (L”)* according 
to (12.167). 


W B: Ina finite dimensional complex vector space the adjoint of an operator represented by the matrix 
A = (αι) is defined by the matrix A* with aj; = aj;. 


12.6.2 Adjoint Operator of an Unbounded Operator 


Let X and Y be real normed spaces and T' a (not necessarily bounded) linear operator with a (linear) 
domain D(T) C X and values in Y. For a given g € Y*, the expression g(T'x), depending obviously 
linearly on 2, is meaningful. Now the question is: Does there exist a well-defined functional f € X* 
such that 

f(x)=g(Tx) Vx € D(T). (12.182) 
Let D* C Y* be the set of all those g € Y* for which the representation (12.182) holds for a certain 


f € X*. If D(T) = X, then for given g the functional f is uniquely defined. So a linear operator T* is 
defined by f = T*g with D(T*) = D*. Then for arbitrary x € D(T) and g € D(T*) 

g(Tx) = (T*g)(x). (12.183) 
The operator T* turns out to be closed and is called the adjoint of T. The naturalness of this general 


procedure stems from the fact that D(T*) = Y* holds if and only if T is bounded on D(T). In this case 
T* € B(Y*, X*) and ||T*|| = ΙΤ] hold. 


12.6.3 Self-Adjoint Operators 
An operator T € B(H) (H is a Hilbert space) is called self-adjoint if T* = T. In this case 


(Τα, ψ) = (x, Ty), x, y € H (12.1842) 
is valid and the number (Τα, x) is real for each x € H. Then the equality 
ΙΤ = sup |(Τ., x)| (12.184b) 


σι 
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holds and with m = m(T) = ign Un α) and M = M(T) = mate x) also the relations 
m(T)|z|? < (Tx,z) € M(T)|x|? and [ΤΙ -- r(T) = max{|m|, M} (12.185) 

are valid. The equality (12.184a) characterizes the self-adjoint operators. The spectrum ofa self-adjoint 

(bounded) operator lies in the interval [m, M] and m, M € o(T) holds. 

12.6.3.1 Positive Definite Operators 

A partial ordering can be introduced in the set of all self-adjoint operators of B(H) by defining 

T>0 ifandonlyif (Tr,v) 20 VaeH. (12.186) 

An operator T' with T' 7 0 is called positive (or, more exactly positive definite). For any self-adjoint 

operator T (with (H1) from 12.4.1.1, p. 673), (1?x,x) = (Τα, Τα) > 0, so T? is positive definite. 

Every positive definite operator T possesses a square root, i.e., there exists a unique positive definite 

operator W such that W? — T. Moreover, the vector space of all self-adjoint operators is a K-space 

(Kantorovich space, see 12.1.7.4, p. 660), where the operators 


T| -- VT?, T* = απ] +T), T- - (T| - T) (12.187) 


z| 


m 


are the corresponding elements with respect to (12.37). They are of particular importance for the 
spectral decomposition and spectral and integral representations of self-adjoint operators by means of 
some Stieltjes integral (see 8.2.3.1, 2., p. 506, and [12.1], [12.11], [12.12], [12.15], [12.18]). 


12.6.3.2 Projectors in a Hilbert Space 

Let Ho be a subspace of a Hilbert space H. Then every element x € H has its projection x’ onto Ho 
according to the projection theorem (see 12.4.2, p. 674), and therefore, an operator P with Px = 2’ is 
defined on H with values in Ho. P is called a projector onto Πρ. Obviously, P is linear, continuous, and 
||P|| = 1. A continuous linear operator P in H is a projector (onto a certain subspace) if and only if: 
a) P= P*, i.e., P is self-adjoint, and 

b) P? = P,ie., P is idempotent. 


12.7 Compact Sets and Compact Operators 
12.7.1 Compact Subsets of a Normed Space 


A subset A of a normed space! X is called 

e compact, if every sequence of elements from A contains a convergent subsequence whose limit lies in 
A, 

e relatively compact or precompact if its closure (see 12.2.1.3, p. 664) is compact, i.e., every sequence of 
elements from A contains a convergent subsequence (whose limit does not necessarily belong to A). 
'This is the Bolzano-Weierstrass theorem in real calculus for bounded sequences in IR", and one says 
that such a set has the Bolzano-Weierstrass property. 

Every compact set is closed and bounded. Conversely, if the space X is finite dimensional, then every 
such set is compact. The closed unit ball in a normed space X is compact if and only if X is finite 
dimensional. 
For some characterizations of relatively compact subsets in metric spaces (the Hausdorff theorem on 
the existence of a finite e-net) and in the spaces s, C (Arzela-Ascoli theorem) and in the spaces L^(1 < 
p « oo) see. [12.15]. 


12.7.2 Compact Operators 
12.7.2.1 Definition of Compact Operator 


An arbitrary operator T: X —> Y of a normed space X into a normed space Y is called compact if the 


Τε is enough that X is a metric (or an even more general) space. This generality is not used in what follows. 
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image Τ(Α) of every bounded set A C X is a relatively compact set in Y. If, in addition the operator 
T is also continuous, then it is called completely continuous. Every compact linear operator is bounded 
and consequently completely continuous. For a linear operator to be compact it is sufficient to require 
that it transforms the unit ball of X into a relatively compact set in Y. 


12.7.2.2 Properties of Linear Compact Operators 

A characterization by sequences of the compactness of an operator from B(X, Y) is the following: For 
every bounded sequence {,,}°2, from X the sequence {Τα} }55 contains a convergent subsequence. 
A linear combination of compact operators is also compact. If one of the operators U € B(W, X), T € 
B(X,Y), S € B(Y, Z) in each of the following products is compact, then the operators TU and ST are 
also compact. If Y is a Banach space, then the following important statements are valid. 


a) Convergence: If a sequence of compact operators {Τη }55 is convergent in the space B(X, Y), then 
its limit is a compact operator, too. 
b) Schauder Theorem: If T is a linear continuous operator, then either both T and T* are compact 


or both are not. 


c) Spectral Properties of a Compact Operator T in an (Infinite Dimensional) 

Banach Space X: The zero belongs to the spectrum. Every non-zero point of the spectrum o(T) 
is an eigenvalue with a finite dimensional eigenspace Xy = {x € X: (AI — T)x = 0}, and Ye > 0 
there is always only a finite number of eigenvalues of T outside the circle {|A| € £}, where only the zero 
can be an accumulation point of the set of eigenvalues. If A = 0 is not an eigenvalue of T, then T~! is 
unbounded if it exists. 


12.Τ.2.3 Weak Convergence of Elements 

A sequence (x, € , of elements of a normed space X is called weakly convergent to an element xo if for 
each f € X* the relation f(z,) — f(xo) holds (written as: £n — £o or as ον ^ αρ). 

Obviously: x, — 20 implies x, — πρ. If Y is another normed space and T: X — Y is a continuous 
linear operator, then: 


a) £n — 20 implies Tx, — Τα, 
b) if T is compact, then £n — ro implies Tz, — Txo. 


W A: Every finite dimensional operator is compact. From this fact it follows that the identity operator 
in an infinite dimensional space cannot be compact (see 12.7.1, p. 686). 


E B: Suppose X = I, and let T be the operator in 1 given by the infinite matrix 


tir tie tig ++: 

{91 123 t23 ντ" 5ο ES 

tgp tee with Τα = (x tiis Mtn... . (12.188) 
απ, xy cu ial πας] 


oo F : 2. D P 
If $5 |t,4|? =M < oo, then T is a compact operator from P? into P with ||T|| € M. 
kyn=1 


E C: The integral operator (12.136) is a compact operator in the spaces C([a, b]) and L?((a,b)) (1 < 
p < oo). 


12.7.3 Fredholm Alternative 
Let T be a compact linear operator in a Banach space X. The following equations (of the second kind) 
are considered with a parameter À Æ 0: 
Ar — Τα y, λα-Τα =0, 
Af-T*f= 9, Af-T*f=0. 


(12.189) 
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The following statements are valid: 

a) dim(ker(AJ—T)) = dim(ker(AJ — T*)) < +00, i.e., both homogeneous equations always have the 
same number of linearly independent solutions. 

b) Im(AI — T) = ker(AI — T*)} and? Im(Al — T*) = ker(AI — T)+. 

c) Im(AJ — T) 2 X if and only if ker(AJ — T) = 0. 

d) The Fredholm alternative (also called the Riesz-Schauder theorem): 

a) Either the homogeneous equation has only the trivial solution. In this case \ € o(T), the operator 
(AI — T) ! is bounded, and the inhomogeneous equation has exactly one solution x = (AJ — T) !y for 
arbitrary y € X. 

B) Or the homogeneous equation has at least one non-trivial solution. In this case λ is an eigenvalue of 
T, i.e., A € o(T), and the inhomogeneous equation has a (non-unique) solution if and only if the right- 
hand side y satisfies the condition f(y) = 0 for every solution f of the adjoint equation T* f = Af. 

In this last case every solution x of the inhomogeneous equation has the form x = 20 + h, where xo is 
a fixed solution of the inhomogeneous equation and A € ker(AJ — T). 


Linear equations of the form Tx = y with a compact operator T are called equations of the first kind. 
Their mathematical investigation is in general more difficult (see [12.11],[12.18]). 


12.7.4 Compact Operators in Hilbert Space 


Let T : H — H be a compact operator. Then T is the limit (in B(H)) of a sequence of finite dimen- 
sional operators. The similarity to the finite dimensional case can be seen from the statements: 


If C is a finite dimensional operator and T = I — C, then the injectivity of T implies the existence of 
T~' and T^! € B(H). 

If C is a compact operator, then the following statements are equivalent: 

a) 3T-! and it is continuous, 

b) x #0 = Tz z 0,ie., T is injective, 

c) T(H) — H, i.e., T is surjective. 


12.7.5 Compact Self- Adjoint Operators 


1. Eigenvalues 

A compact self-adjoint operator T Z 0 in a Hilbert space H possesses at least one non-zero eigenvalue. 

More precisely, T always has an eigenvalue A with |A| = ||T]|. The set of eigenvalues of T is at most 

countable. 

Any compact self-adjoint operator T has the representation T = X A; Py, (in B(H)), where A; are 
k 


1e different eigenvalues of T and P, denotes the projector onto the eigenspace Ηλ. In this case the 
perator T is diagonalizable. From this fact it follows that Tx = © λεία, ej.)e; for every x € H, where 
k 


t 
ο 
(e) is the orthonormal system of the eigenvectors of T. If A ¢ σ(Τ) and y € H, then the solution of 
t 


1e equation (AI — T) = y can be represented as x = RA(T)y = Y; 
k 


(y, ex )ek- 


X— Xy 


2. Hilbert-Schmidt Theorem 

If T is a compact self-adjoint operator in a separable Hilbert space H, then there is a basis in H con- 
sisting of the eigenvectors of T. 

The so-called spectral (mapping) theorems (see [12.8], [12.10], [12.12], [12.13], [12.18]) can be con- 
sidered as the generalization of the Hilbert-Schmidt theorem for the non-compact case of self-adjoint 
(bounded or unbounded) operators. 


*Here the orthogonality is considered in Banach spaces (see 12.5.4.2, p. 682). 
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12.8 Non-Linear Operators 

In the theory of non-linear operator equations the most important methods are based on the following 
principles: 

1. Principle of the Contracting Mapping, Banach Fixed-Point Theorem (see 12.2.2.3, p. 666, 
and 12.2.2.4, p. 666). For further modifications of this principle see [{12.8],[12.11], [12.12], [12.18]. 

2. Generalization of the Newton Method (see 18.2.5.2, p. 931 and 19.1.1.2, p. 950) for the infinite 
dimensional case. 

3. Schauder Fixed-Point Principle (see 12.8.4, p. 691) 

4. Leray-Schauder Theory (sce 12.8.5, p. 692) 

Methods based on principles 1 and 2 yield information on the existence, uniqueness, constructivity 
etc. of the solution, while methods based on principles 3 and 4, in general, allow “only” the qualitative 
statement of the existence of a solution. If further properties of operators are known then see also 


12.8.6, p. 692, and 12.8.7, p. 693. 


12.8.1 Examples of Non-Linear Operators 


For non-linear operators the relation between continuity and boundedness discussed for linear operators 
in 12.5.1, p. 677 is no longer valid in general. In studying non-linear operator equations, e.g., non- 
linear boundary value problems or integral equations, the following non-linear operators occur most 
often. Iteration methods described in 12.2.2.4, p. 666, can be successfully applied for solving non-linear 
integral equations. 
1. Nemytskij Operator 
Let Q be an open measurable subset from R” (12.9.1, p. 693) and f: Q x R — R a function of two 
variables f (zx, s), which is continuous with respect to x for almost every s and measurable with respect 
to s for every x (Caratheodory conditions). The non-linear operator M to F(N) defined as 

(N'u)(x) = f[r,u(z)) (v εΩ) (12.190) 


is called the Nemytskij operator. It is continuous and bounded if it maps L?(Q) into L*(Q), where 


— + - = 1. This is the case, e.g., if 
D q 
p 

|f(z,s)| € a(z) --b|s| with a(x) ε LQ) (b 0) (12.191) 
or if f: Q x R — R is continuous. The operator V is compact only in special cases. 
2. Hammerstein Operator 
Let €? be a relatively compact subset of R^, f a function satisfying the Caratheodory conditions and 
K (x, y) a continuous function on Q x Q. The non-linear operator H on F(Q) 


(Hu)(x) = [Ke y)fly,uy)| dy (« €Q) (12.192 


is called the Hammerstein operator. H can be written in the form H =K -NM with the Nemytskij 
operator M and the integral operator K determined by the kernel K 


(Ku)(z) = / K(z,y)u(y)dy (x €Q). (12.193 
ἃ 
If the kernel K (x, y) satisfies the additional condition 
/ κα, y)! dz dy < oo (12.194 
Ὠχω 


and the function f satisfies the condition (12.191), then H is a continuous and compact operator on 
P(Q). 
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3. Urysohn Operator 
Let Q C R” be an open measurable subset and K(x,y,s): Q x Q x R — R a function of three 
variables. Then the non-linear operator U on F(Q) 


(Uu)(x) = / K|x,y.u(y)dy (α ερ) (19.195) 


is called the Urysohn operator. If the kernel K satisfies the appropriate conditions, then 4 is a contin- 
uous and compact operator in C(Q) or in L?(Q), respectively. 


12.8.2 Differentiability of Non-Linear Operators 
Let X, Y be Banach spaces, D C X be an open set and T: D — Y. The operator T is called Fréchet 
differentiable (or, briefly, differentiable) at the point x € D ifthere exists a linear operator L € B(X, Y) 
(in general depending on the point Ὁ) such that 
T(r-4h)-— Τα) -- Lh+w(h) with ||w(Rh)|| = o(l|h]]) (12.196) 
or in an equivalent form 
is [Τ(α + h) — T(x) — Lhl| 
||] 0 liA] 


i.e., Ve > 0, 3 ô > 0, such that ||A|| < 9 implies ||T (x + h) — T(x) — Lh|| € e||h||. The operator L, 
which is usually denoted by T” (x), T'(x, -) or T'(x)(-), is called the Fréchet derivative of the operator T 
at the point x. The value dT (x; h) = T'(x)h is called the Fréchet differential of the operator T at the 
point x (for the increment h). 

The differentiability of an operator at a point implies its continuity at that point. If T € B(X, Y), i,e., 
T itself is linear and continuous, then T is differentiable at every point, and its derivative is equal to 


T 


12.8.3 Newton’s Method 
Let X, D be as in the previous paragraph and T: D —5 Y. Under the assumption of the differentiability 
of T at every point of the set D an operator T’: D — B(X, Y) is defined by assigning the element 
T'(x) € B(X, Y) to every point x € D. Suppose the operator 7" is continuous on D (in the operator 
norm); in this case T is called continuously differentiable on D. 
Suppose Y = X and also that the set D contains a solution x* of the equation 

T(x) — 0. (12.198) 
Furthermore, it is assumed that the operator T’(x) is continuously invertible for each x € D, hence 
(1'(z)| * is in B(X). Because of (12.196) for an arbitrary ry € D one conjectures that the elements 
T (xo) = T (vo) — T(x*) and T'(xo)(xo — ο") are “not far” from each other and therefore the element x1 
defined as 

21 = zo — [T’(xo)| T (20) (12.199) 
is an approximation of z* (under the given assumptions). Starting with an arbitrary xo the so-called 
Newton approximation sequence 

Tn — mà -- [D (αν) Ταν) (20.1...) (12.200) 
can be constructed. There are many theorems known from the literature discussing the behavior and 
the convergence properties of this method. Here only the following most important result is mentioned 
which demonstrates the main properties and advantages of Newton's method: 
Ve € (0,1) there exists a ball B = Βίαο;δ), ὃ = δε) in X, such that all points x, lie in B and the 
Newton sequence converges to the solution x* of (12.198). Moreover, ||, — 2o|| € &"||o — x*|| which 
yields a practical error estimation. 
The modified Newton's method is obtained if the operator [T"(g)] ! is used instead of [T (x) ! Vn = 
0, 1,... in formula (12.200). For further estimations of the speed of convergence and for the (in general 
sensitive) dependence of the method on the choice of the starting point αρ see [12.7], [12.12], [12.18]. 


0, (12.197) 
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W Jacobian or Functional Matrix Given a non-linear operator T = F: D — R” on an open set 


D C R” with m non-linear coordinate functions F}, F5,..., Fm and n independent variables 71, 15, . . . , En- 
Then 
Fi (x) 
F(x) 
F(x) = À ER” Va = (z1, %2,..., £n) E D (12.201) 
F(x) 
holds. If the partial derivatives δ (k = 1,2,...,n) of the coordinate functions F; (i = 1,2,...,m) 
Tk 
on D exist and are continuous, then the mapping (the operator) F in every point of D is differentiable 
and its derivative at the point x = (£1, 29,...,%,) € D is the linear operator F’(x) : R” — R” with 
the matrix representation 
OF\(x) OF \(x) OF, (x) 
Ox, Ox iii Om, 
OF»(x) OF x(x) OF»(x) 
Ε΄ (α) = Ox, Ox 33b δαν 2 (12.202) 
ὃβι(α) Εμ) OF.) 
ὅσι Ox» no ὅτι 


The derivative F'(x) is a matrix of the type (m,n). It is called the Jacobian or functional matrix of 
F . Special cases of occurrence are, e.g., the iterative solution of systems of non-linear equations by the 
Newton method (see 19.2.2.2, p. 962) or describing the independence of functions (see 2.18.2.6, 3., p. 
123). 

For m = n the so-called functional determinant or Jacobian determinant can be formed, which is de- 
noted shortly by 


D(S Fay sss Fr) (12.203) με 


This determinant is used for the solution of (mostly inner-mathematical) problems (see also, e.g., 


8.5.3.2, p. 539). 


12.8.4 Schauder’s Fixed-Point Theorem 


Let T : D — X bea non-linear operator defined on a subset D of a Banach space X. The non-trivial 
question of whether the equation x = T(x) has at least one solution, can be answered as follows: If 
X = Rand D = [-1,1], then every continuous function, mapping D into D, has a fixed point in D. 
If X is an arbitrary finite dimensional normed space (dimX > 2), then Browwer’s fixed-point theorem 
holds. 


1. Brouwer's Fixed-Point Theorem Let D be a non-empty closed bounded and convex subset of 
a finite dimensional normed space. If T is a continuous operator, which maps D into itself, then T has 
at least one fixed point in D. 

The answer in the case of an arbitrary infinite dimensional Banach space X is given by Schauder’s 
fixed-point theorem. 

2. Schauder's Fixed-Point Theorem Let D be a non-empty closed bounded and convex subset 
of a Banach space X. If the operator T: D —+ X is continuous and compact (hence completely 
continuous) and it maps D into itself, then T' has at least one fixed point in D. 

By using this theorem, it is proved, e.g., that the initial value problem (12.70), p. 668, always has a 
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local solution for t 7 0, if the right-hand side is assumed only to be continuous. 


12.8.5 Leray-Schauder Theory 
For the existence of solutions of the equations x = T(x) and (I--T)(x) = y with a completely continuous 
operator T, a further principle is found which is based on deep properties of the mapping degree. It can 
be successfully applied to prove the existence of a solution of non-linear boundary value problems. Here 
only those results of this theory are mentioned which are the most useful ones in practical problems, 
and for simplicity a formulation is chosen which avoids the notion of the mapping degree. 
Leray-Schauder Theorem: Let D be an open bounded set in a real Banach space X and let T': 
D:—+ X bea completely continuous operator. Let y € D be a point such that «+ AT (x) Z y for each 
x € OD and € [θ. 1], where OD denotes the boundary of the set D. Then the equation (I +T)(x) = y 
has at least one solution. 
The following version of this theorem is very useful in applications: 
Let T be a completely continuous operator in the Banach space X. If all solutions of the family of 
equations 

x= T(x) (λε (0,1) (12.204) 
are uniformly bounded, i.e., 3c > 0 such that VA and V x satisfying (12.204) the a priori estimation 
\|x|| € c holds, then the equation x = T(x) has a solution. 


12.8.6 Positive Non-Linear Operators 

The successful application of Schauder's fixed-point theorem requires the choice of a set with appropri- 
ate properties, which is mapped into itself by the considered operator. In applications, especially in the 
theory of non-linear boundary value problems, ordered normed function spaces and positive operators 
are often considered, i.e., which leave the corresponding cone invariant, or isotone increasing operators, 
Le,ifzr < y > T(x) € T(y). If confusions (see, e.g., 12.8.7, p. 693) are excluded, these operators are 
also called monotone. 
Let X = (X, ΧΑ, ||- ||) be an ordered Banach space, X} a closed cone and [a,b] an order interval of 
X. If X, is normal and T is a completely continuous (not necessarily isotone) operator that satisfies 
T([a, ϐ]) C [a,b], then T has at least one fixed point in [a, 0] (Fig. 12.6b). 


pit 


b) 


Figure 12.6 


Notice that the condition T([a,0]) C [a,b] automatically holds for any isotone increasing operator T, 
which is defined on an (o)-interval (order interval) [a, b] of the space X if it maps only the endpoints a, b 
into [α, b], i.e., when the two conditions T(a) > a and T(b) € b are satisfied. Then both sequences 
πρ a and ρει — T(x,) (n 2 0) and yo =6 and νι — T(y,) (n > 0) (12.205) 
are well defined, i.e., £n, y, € [a,b], n > 0. They are monotone increasing and decreasing, respectively, 
i.e., a = zo € zi €... € v, € ... and b = yg È y1 È ... ys 2. .... A fixed point £+, z* of the operator 
T is called minimal, maximal, respectively, if for every fixed point z of T the inequalities x, € z, z € z* 
hold, respectively. 
Now,the following statement is valid (Fig. 12.6a)): Let X be an ordered Banach space with a closed 
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cone X, and T: D — X, D C Xa continuous isotone increasing operator. Let [a,b] C D be such 
that T(a) > a and T(b) € b. Then Τ([α, b]) C [a,b], and the operator T has a fixed point in [a, b] if one 
of the following conditions is fulfilled: 

a) X, is normal and T is compact; 

b) X is regular. 

Then the sequences (x, E? and (y, o, defined in (12.205), converge to the minimal and to the 
maximal fixed points of T in |a, b], respectively. 

The notion of the super- and sub-solutions is based on these results (see [12.14]). 


12.8.7 Monotone Operators in Banach Spaces 


1. Special Properties 

An arbitrary operator T: D C X — Y (X, Y normed spaces) is called demi-continuous at the point 
xo € D if for each sequence {x,,}°, C D converging to xp (in the norm of X) the sequence (T(x,) V8 , 
converges weakly to Τ(αρ) in Y. T is called demi-continuous on the set D if T is demi-continuous at 
every point of D. 

In this paragraph another generalization of the notion of monotonicity known from real analysis are 
introduced. Let X now be a real Banach space, X* its dual, D C X and T: D —5 X* a non-linear 
operator. T is called monotone if V x, y € D the inequality (T(x) — T(y),x — y) > 0 holds. If X = H 
is a Hilbert space, then (:,-) means the scalar product, while in the case of an arbitrary Banach space 
one refers to the notation introduced in 12.5.4.1, p. 681. The operator T is called strongly monotone 
if there is a constant c > 0 such that (T(x) — T(y),x — y) 2 ella — y|? for V x,y € D. An operator 
T: X — X* is called coercive if lim EE ES 


[ο[-λοο |æ 


2. Existence Theorems 

for solutions of operator equations with monotone operators are given here only exemplarily: If the op- 
erator T, mapping the real separable Banach space X into X*, (Dr = X), is monotone demi-continuous 
and coercive, then the equation T(x) = f has a solution for arbitrary f € X". 

If in addition the operator T' is strongly monotone, then the solution is unique. In this case the inverse 
operator Τ-! also exists. 

For a monotone, demi-continuous operator T: H — H in a Hilbert space H with Dr = H, there holds 
Im(I +T) = H, where (1 4- T)! is continuous. If T is supposed to be strongly monotone, then Τ 1 is 
bijective with a continuous Τ 1. 

Constructive approximation methods for the solution of the equation T(x) = 0 with a monotone oper- 
ator T in a Hilbert space are based on the idea of Galerkin's method (see 19.4.2.2, p. 974, or [12.10], 
[12.18]). By means of this theory set-valued operators T: X — 2*' can also be handled. The notion 
of monotonicity is then generalized by (f — g,r — y) 2 0, Va,y € Dr and f € T(x),g € T(y). 


12.9 Measure and Lebesgue Integral 
12.9.1 Set Algebras and Measures 


The initial point for introducing measures is a generalization of the notion of the length of an interval in 
R, of the area, and of the volume of subsets of IR? and R®, respectively. This generalization is necessary 
in order to “measure” as many sets as possible and to “make integrable” as many functions as possible. 
For instance, the volume of an n-dimensional rectangular parallelepiped , 

Q={r ER": a, <a, <b, (k= 1,2,...,n)} hasthe value | [[(bi — ax). (12.206) 


k=1 
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1. cAlgebra or Set Algebra 
Let X be an arbitrary set. A non-empty system A of subsets from X is called a σ algebra if: 


a) Ac A implies X\ AEA and | (12.2072) 
b) Ai, A5,..., An... € A implies |) A, € A. (12.207b) 
n=1 


Every c algebra contains the sets () and X, the intersection of countably many of its sets and also the 
difference sets of any two of its sets. 

In the following R denotes the set IR of real numbers extended by the elements —oo and +00 (extended 
real line), where the algebraic operations and the order properties from R are extended to R in the 


co 

natural way. The expressions (+00) + (F00) and — are meaningless, while 0 - (+00) and 0: (--οο) are 
οο 

assigned the value 0. 


2. Measure - 
A function u: A — Ry = R, U +00, defined on ao algebra A, is called a measure if 


a) (A) 20 VAEA, (12.208a) 

b) μ(θ) — 0, (12.208b) 

C) Αι, Ao... Αμ... € A, Ag NAL =O (k zT) implies u (U ^.) = NW BWA). (12.208c) 
n=1 n=1 


The property c) is called ø additivity of the measure. If µ is a measure on A, and for the sets A, B € 
A, A C B holds, then (A) € p(B) (monotonicity). If An € A (n = 1,2,...) and Ay C A» C ©5, 


E 
then μ ( U An) = lim u(An) (continuity from below). 
n=1 noo 


Let A bea c algebra of subsets of X and jj a measure on A. The triplet X = (X, A, ju) is called a measure 
space, and the sets belonging to A are called measurable or A-measurable. 

ΒΛ: Counting Measure: Let X be a finite set (z1,29,..., £y}, A the o algebra of all subsets of X, 
and let assign a non-negative number pp to each zy (k = 1,..., N). Then the function jz defined on A 
for every set A € A, A= (xj, m4... Enp } by (A) = pa, + pus +++ pa, 18 à measure which takes 
on only finite values since μ(Χ) = pi +--+ 4- py. < oo. This measure is called the counting measure. 
E B: Dirac Measure: Let A bea c algebra of subsets of a set X and a an arbitrary given point from 
X. Then a measure (called Dirac Measure) is defined on A by 


1, if a€ A, . 
it ts fone (12.2094) 


It is called the 6 function (concentrated on a). The characteristic function or indicator function of a 
subset A C X denotes the function χα: X —> {0,1} of X on {0,1}, which has the value 1 for x € A 
and for all other x the value 0: 
f 1, ifx € A, 
χαία) = s otherwise. 


Obviously 64(A) = δι(χα) = χα(α) (see 12.5.4, p. 681), where χα denotes the characteristic function 
of the set A. 

B C: Lebesgue Measure: Let X be a metric space and B(X) the smallest σ algebra of subsets of X 
which contains all the open sets from X. B(X) exists as the intersection of all the σ algebras containing 
all the open sets, and is called the Borel σ algebra of X. Every element from Β(Χ) is called a Borel set 
(see [12.6]). 

Suppose now, X = R” (n > 1). Using an extension procedure a σ algebra and a measure on it can be 
constructed, which coincides with the volume on the set of all rectangular parallelepipeds in R". More 
precisely: There exists a uniquely defined o algebra A of subsets of R” and a uniquely defined measure 
λ on A with the following properties: 


(12.209b) 
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a) Each open set from R” belongs to A, in other words: B(R") C A. 
b) If A € A, \(A) = 0 and B C Athen B € A and \(B) = 0. 


c) If Q is a rectangular parallelepiped, then Q € A, and \(Q) = nc — a). 


d) À is translation invariant, i.e., for every vector x € IR" and every set A € Aone has x + A = {x£ +y: 
y € A} € A and A(z + A) = A(A). 

The elements of A are called Lebesgue measurable subsets of R”. A is the (n-dimensional) Lebesgue 
measure in R”. 

Remark: In measure theory and integration theory one says that a certain statement (property, or 
condition) with respect to the measure p is valid almost everywhere or p- almost everywhere on a set 
X, if the set, where the statement is not valid, has measure zero. It is denoted by a.e. or µ-α.ο.δ For 
instance, if A is the Lebesgue measure on R and A, B are two disjoint sets with R = AU B and f isa 
function on R with f(x) = 1, Vr € A and f(x) = 0, V x € B, then f = 1, A-a.e. on R if and only if 
A(B) — 0. 


12.9.2 Measurable Functions 
12.9.2.1 Measurable Function 


Let A bea c algebra of subsets of aset X. A function f: X — R is called measurable if for an arbitrary 
a E€ R the set f (ία, -οο]) = (x: x € X, f(x) > α} isin.A. 
A complex-valued function g + ih is called measurable if both functions g and h are measurable. The 
characteristic function χα of every set A € A is measurable, because 

= A, if a € (—oo, 1), 

Xa ((a, +00]) = { eee ET (12.210) 
is valid (see Dirac measure, p. 694). If.A is the σ algebra of the Lebesgue measurable sets of IR" and 
f: R” — Risa continuous function, then the set f~!((a, ἠ-οοἱ) = f~!((a, +00)), according to 12.2.3, 
p. 668, is open for every a € R, hence f is measurable. 
12.9.2.2 Properties of the Class of Measurable Functions 


The notion of measurable functions requires no measure but a c algebra. Let A be a ø algebra of sub- 
sets of the set X and let f, g, fa : X — R be measurable functions. Then the following functions (see 
12.1.7.4, p. 660) are also measurable: 


a) af for every a € R; f +g; 
b) fi, J- Ifl, f Vg and f ^ g; 
c) f + g, if there is no point from X where the expression (+00) + ( 
d) sup fn, inf fn, limsup fn (= lim sup fx), liminf fn; 
n> k>n 


oo) occurs; 


e) the point-wise limit lim fn, in case it exists; 
f) if f 2 0and p € R,p > 0, the f? is measurable. 
A function f: X — Ris called elementary or simple if there is a finite number of pairwise disjoint sets 


n 
A1,..., An € A and real numbers o5,...,0; such that f = X ακχε. where χι denotes the character- 
k=1 


istic function of the set Αμ. Since each characteristic function of a measurable set is measurable (see 
(12.210)), so every elementary function is measurable. It is interesting that each measurable function 
can be approximated arbitrarily well by elementary functions: For each measurable function f > 0 
there exists a monotone increasing sequence of non-negative elementary functions, which converges 
point-wise to f. 


Here and in the following parts “a.e.” is an abbreviation for “almost everywhere". 
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12.9.3 Integration 
12.9.3.1 Definition of the Integral 


Let (X, A, μ) be a measure space. The integral / f du (also denoted by / f du) for a measurable 
Jx i 
function f is defined by means of the following steps: 
1. If f is an elementary function f = Σ ακχκ. then 
k=1 

[tan = M a pt(Ag). (12.211 
: k=1 

2. If f: X — R (f > 0), then 


[ta = sup [ f gap : gis an elementary function with 0 € g(x) € f(x), Vr € x} . (12.212 


3. If f: X — Rand f}, f- are the positive and the negative parts of f, then 


[ 1n f fdp- [1-41 (12.213 


under the condition that at least one of the integrals on the right side is finite (in order to avoid the 
meaningless expression oo — oc). 


4. For a complex-valued function f = g + ih, if the integrals (12.213) of the functions g, h are finite, 
put 


[ fan [oan i [ han. (12.214) 


5. If for any measurable set A and a function f there exists the integral of the function [χα then put 
[fa i= / fa dp. (12.215) 
A 


The integral of a measurable function is in general a number from R. A function f: X — R is called 


integrable or summable over X with respect to p if it is measurable and / |f| du < oc. 


12.9.3.2 Some Properties of the Integral 
Let (X, A, μ) be a measure space, f, g : X — R be measurable functions and a, 8 € R. 
1. If f is integrable, then f is finite a.e., i.e., µία € X: | f(x)| = +20} = 0. 


| rns finas. 


f is integrable and f > 0, then i f du > 0. 


2. 


— 


f is integrable, then 


3. 


= 


4. If0 € g(x) € f(x) on X and f is integrable, then g is also integrable, and n < n 


5. 


— 


f. g are integrable, then af + 8g is integrable, and / (af + Bg)du=a / [du B J gdu. 


6. If f,g are integrable on A € A, i.e., there exist the integrals / f du and / gdu according to 
A JA 


(12.215) and f = g pra.e. on A, then f fdu- i gdp. 
A A 


If X = R” and λ is the Lebesgue measure, then the introduced integral is the (n-dimensional) Lebesgue 
integral (see also 8.2.3.1, 3., p. 507). In the case n = 1 and A = [a,b], for every continuous function 
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b 
f on [a,b] both the Riemann integral / f(x) da (see 8.2.1.1, 2., p. 494) and the Lebesgue integral 


f dX are defined. Both values are finite and equal to each other. Furthermore, if f is a bounded 
J [a,b] 


Riemann integrable function on (a, b], then it is also Lebesgue integrable and the values of the two 
integrals coincide. 

The set of Lebesgue integrable functions is considerably larger than the set of the Riemann integrable 
functions and it has several advantages, e.g., when passing to the limit under the integral sign and f, 
|f| are Lebesgue integrable simultaneously. 

12.9.3.3 Convergence Theorems 

Now Lebesgue measurable functions will be considered throughout. 

1. B.Levi's Theorem on Monotone Convergence 

Let { fr 4 be an a.e. monotone increasing sequence of non-negative integrable functions with values 
in R. Then 


dim / Te / lim f, dg. (12.216) 


2. Fatou's Theorem 
Let {fn}; be a sequence of non-negative R-valued measurable functions. Then 


[iim inf f, du € liminf ee dp. (12.217) 


3. Lebesgue's Dominated Convergence Theorem 
Let { fn } be a sequence of measurable functions convergent on X a.e. to some function f. If there exists 
an integrable function g such that | f,| € g a.e., then f = Jm f, is integrable and there holds 


Jim. f f. du — f lim f, dn. (12.218) 


4. Radon-Nikodym Theorem 
a) Assumptions: Let (X, A, μ) bea o-finite measure space, i.e., there exists a sequence {An}, An € A 


de 
such that X = U A, and μ( Αμ) < oo for V n. In this case the measure is called o finite. It is called 
n=1 


finite if μ(Χ) < oo, and it is called a probability measure if μ(Χ) = 1. A real function y defined on A is 
called absolutely continuous with respect to u if (A) = 0 implies y(A) = 0. This property is denoted 
by e < p. 

For an integrable function f, the function y defined on A by (A) = fa f duis ø additive and absolutely 
continuous with respect to the measure jz. The converse of this property plays a fundamental role in 
many theoretical investigations and practical applications: 

b) Radon-Nikodym Theorem: Suppose a c-additive function y and a measure ju are given on ao 
algebra A, and let y < u. Then there exists a jj-integrable function f such that for each set A € A, 


y(A) = fi dys. (12.219) 
A 
The function f is uniquely determined up to its equivalence class, and q is non-negative if and only if 
f 3 0 p-a.e. 
12.9.4 L?Spaces 


Let (X, A, p) be a measure space and p a real number 1 € p < oo. For a measurable function f, 
according to 12.9.2.2, p. 695, the function | f|? is measurable as well, so the expression 
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Αν) = (ην αμ)” (12.220) 


is defined (and may be equal to +00). A measurable function f: X — R is called p-th power integrable, 
or an L?-function if N,(f) < +00 holds or, equivalent to this, if | f|? is integrable. 
For every p with 1 € p < +00, the set of all L?-functions, i.e., all functions p-th power integrable with 
respect to u on X, is denoted by £^(j1) or by £?(X) or in full detail £? (X, A, 4i). For p = 1 the simple 
notation £(X) is used. For p = 2 the functions are called quadratically integrable. 
The set of all measurable µ-α.ο. bounded functions on X is denoted by £^ (x) and the essential supre- 
mum of a function f is defined as 

Noo(f) = ess. sup f = inf{a € R : |f(z)| € a µ-α.ο.}. (12.221) 
L"(u) (1 € p € œ) equipped with the usual operations for measurable functions and taking into 
consideration Minkowski inequality for integrals (see 1.4.2.13, p. 32), is a vector space and N,(-) is a 
semi-norm on Z?(pu). If f < g means that f(x) < g(x) holds µ-α.ο., then ϱΡ(μ) is also a vector lattice 
and even a K-space (see 12.1.7.4, p. 660). Two functions f,g € ϱΡ(μ) are called equivalent (or they 
are declared as equal) if f = g p-a.e. on X. In this way, functions are considered to be identical if they 
are equal j-a.e. The factorization of the set £?(X) modulo the linear subspace V; (0) leads to a set 
of equivalence classes on which the algebraic operations and the order can be transferred naturally. So 
a vector lattice (K-space) is obtained again, which is denoted now by L? (X, μ) or LP (u). Its elements 
are called functions, as before, but actually they are classes of equivalent functions. 
It is very important that || ||, = Np(f) is now a norm on L^(y) (f stands here for the equivalence class 


of f, which will later be denoted simply by f), and (L?(1), || |p) for every p with 1 € p € +œ isa 
Banach lattice with several good compatibility conditions between norm and order. For p — 2 with 


(f.g)— [τ dy. as a scalar product, [(μ) is also a Hilbert space (see [12.12]). 
Very often the space L” (Q) is considered for a measurable subset Q C R”. Its definition is not a problem 
because of step 5 in (12.9.3.1, p. 696). 


The spaces L?(Q, A), where A is the n-dimensional Lebesgue measure, can also be introduced as the 
completions (see 12.2.2.5, p. 668 and 12.3.2, p. 670) of the non-complete normed spaces C(Q) of all 


i 
continuous functions on the set Q C R” equipped with the integral norm ||v||; = ( / |α]Ρ dX) ας 


p < oo) (see [12.18]). 
Let X bea set with a finite measure, i.e., p(X) < +00, and suppose for the real numbers p1, po, 1 < pı < 
pa € +00. Then L?(X, u) C L" (X, i), and with a constant C = C(pi, po, i(X)) > 0 (independent 
of x), there holds the estimation |||]; € C||z||» for x € L”? (here ||| denotes the norm of the space 
L(X,u) (k — 1,2). 


12.9.5 Distributions 
12.9.5.1 Formula of Partial Integration 


For an arbitrary (open) domain Q C R”, ΟΣ (Q) denotes the set of all arbitrary many times in Q 
differentiable functions y with compact support, i.e., the set supp(y) = {a € Q : p(x) 4 0) is compact 
in R” and lies in Q. The set of all locally summable functions with respect to the Lebesgue measure in IR" 
is doted by Lj, (Q)), i.e., all the measurable functions f (equivalent classes) on Q such that f., |f| dÀ < 
-Foo for every bounded domain w C Q. 


Both sets are vector spaces (with the natural algebraic operations). 

There hold L?(Q) C Lj,.(Q) for 1 € p € oo, and Lj,,(Q) = L'(Q) for a bounded Q. If the elements 
of C*(Q) are considered as the classes generated by them in L?(Q), then the inclusion C*(Q) c L?(Q) 
holds for bounded Q, where C^(Q2) is at once dense. If Q is unbounded, then the set C5°(Q) is dense (in 
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this sense) in L?(Q). 


For a given function f € C*(Q) and an arbitrary function y € C§°(Q) the formula of partial integration 
has the form 


I {ία Ραφ(α) dd = (1) ri (a) D^ f(x) dd (12.222) 


Va with |a| € k (the fact that D°y|aq = 0 is used), and will be taken as the starting point for the 
definition of the generalized derivative of a function f € L1, (Q). 


12.9.5.2 Generalized Derivative 


Suppose f € Lj,.(Q). If there exists a function g € Lj,.(Q) such that V y € C§°(Q) with respect to 
some multi-index a the equation 


ΠΙΟΣ dì = (71)! f (yet) dy (12.223) 


holds, then g is called the generalized derivative (derivative in the Sobolev sense or distributional deriva- 
tive) of order a of f. It is denoted by g = D^ f as in the classical case. 
The convergence of a sequence {pr}? in the vector space C5* (Ω)ίοφ ερ (Ω) is defined as 


a) da compact set K C Q with supp(y;,) C K for any k, 


pr — 9 if and only if { b) Όσοι — D* uniformly on K for each multi-index a. 


(12.224) 
The set C° (NQ), equipped with this convergence of sequences, is called the fundamental space, and is 
denoted by D(Q). Its elements are often called test functions. 


12.9.5.3 Distributions 


A linear functional £ on D(Q) continuous in the following sense (see 12.2.3, p. 668): 
vx, P € D(Q) and vy — v imply Llyr) — Lle) (12.225) 
is called a generalized function or a distribution. 


E A: If f € LL, (Q), then 


tilp) = (F, p) = |, f(z)e(z)dA, ye DQ) (12.226) 


isa distribution. A distribution, defined by a locally summable function as in (12.226), is called regular. 
Two regular distributions are equal, i.e., ἔ() = 6,(v) Yy € D(Q), ifand only if f = g a.e. with respect 
to A. 

ΒΒ: Let a € 2 be an arbitrary fixed point. Then ές, (Φ) = v(a), v € D(Q) is a linear continuous 
functional on D(Q), hence a distribution, which is called the Dirac distribution, ὁ distribution or ὃ 
function. 
Since ls, cannot be generated by any locally summable function (see [12.11], [12.24]), it is an example 
for a non-regular distribution. 
The set of all distributions is denoted by D'(Q). From a more general duality theory than that discussed 
in 12.5.4, p. 681, D'(Q) can be obtained as the dual space of D(Q). Consequently, one should write 
D*(Q) instead. In the space D'((2), it is possible to define several operations with its elements and with 
functions from C? (Q2), e.g., the derivative of a distribution or the convolution of two distributions, 
which make D’(Q) important not only in theoretical investigations but also in practical applications in 
electrical engineering, mechanics, etc. 


For a review and for simple examples in applications of generalized functions see, e.g., [12.11], [12.24]. 
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Here, only the notion of the derivative of a generalized function is discussed. 


12.9.5.4 Derivative of a Distribution 


If £ is a given distribution, then the distribution D^/ defined by 


(D'A) = (-1)^4(D^e), ee D(Q), (12.227 
is called the distributional derivative of order a of £. 
Let f be a continuously differentiable function, say on R (so f is locally summable on R, and f can be 
considered as a distribution), let f’ be its classical derivative and D'f its distributional derivative o 
order 1. Then: 


(077,0) = [ fe) dz, (12.228a 
R 
from which by partial integration there follows 


(Difp) — / f(zye (x) dx = -(f,¢’). (12.228b 


In the case of a regular distribution £; with f € Lj,.(Q) by using (12.226) 
(DLN) = (CDU (Dg) = (71)* f f)D*edA (12.229 
ἃ 


is obtained, which is the generalized derivative of the function f in the Sobolev sense (see (12.223)). 
W A: For the regular distribution generated by the (obviously locally summable) Heaviside function 
Θ(α) = lo ror 7 = E (12.230) 
the non-regular ô distribution is obtained as the derivative. 
W B: In mathematical modeling of technical and physical problems one is faced with (in a certain sense 
idealized) influences concentrated at one point, such as a “point-like” force, needle-deflection, collision, 
etc., which can be expressed mathematically by using the ὃ or Heaviside function. For example, md, is 
the mass density of a point-like mass m concentrated at one point a (0 € a < l) of a beam of length I. 
The motion of a spring-mass system on which at time to there acts a momentary external force F is 
described by the equation ë + w?a = Fó,. With the initial conditions z(0) = <(0) = 0 its solution is 


x(t) = a sin(w(t — to))O(t — to). 


ω 


13 Vector Analysis and Vector Fields 


13.1 Basic Notions of the Theory of Vector Fields 


13.1.1 Vector Functions of a Scalar Variable 
13.1.1.1 Definitions 


1. Vector Function of a Scalar Variable t 
A vector function of a scalar variable is a vector ἃ whose components are real functions of t: 

ἃ = a(t) =a,(t)e, + a,(t)&, + a;(t)6.. (13.1) 
The notions of limit, continuity, differentiability are defined componentwise for the vector a(t). 
2. Hodograph of a Vector Function 
Considering the vector function a(t) as a position or radius vector ἵ = T(t) of a point P, then this 
function describes a space curve while t varies (Fig. 13.1). This space curve is called the hodograph of 
the vector function a(t). 


P, 
p d 30 

= t 
τι E 2 

Za 7 LLL ° 

0 μ.ο, 
P, 0 r+Ar ΖΡ» 
Figure 13.1 Figure 13.2 Figure 13.3 


13.1.1.2 Derivative of a Vector Function 

The derivative of (13.1) with respect to t is also a vector function of t: 
da, αι At)—a(t)  da,(t) , , da,(t) , , ἀα.(0., 
dr ~ Ah At oT AME WI 


(13.2) 


dr 
dt 
tion of the tangent of the hodograph at the point P (Fig. 13.2). Its length depends on the choice of 
the parameter t. If t is the time, then the vector r(t) describes the motion of a point P in space (the 


The geometric representation of the derivative — of the radius vector is a vector pointing in the direc- 


le 
space curve is its path), and = has the direction and magnitude of the velocity of this motion. If t = s 
d 
iz 
is the arclength of this space curve, measured from a certain point, then obviously «| zu 
ds 
13.1.1.3 Rules of Differentiation for Vectors 
d > dá , db , dé 
a+b ; 13.3a 
dt (a 8 dt dt dt’ (13:38) 
d ορια. (ip is a scalar function of 1) (13.3b) 
a (9 Hat PH o is a scalar function of t), 3b 
d = d> „db " 
T (ab) = T” FR (13.3c) 
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d ~ di > ib 
(a x b) xb+ax (the factors must not be interchanged), (13.3d) 
dt dt dt 
d da dg 
E a[o(t)] = a : ar (chain rule). (13.3e) 
m A 2 " E . NU „ dà . da 
If |a(t)| = const, i.e., a^ (t) = a(t) - a(t) = const, then it follows from (13.3c) that à - 3 0, 1.6. E 


and à are perpendicular to each other. Examples of this fact: 
W A: Radius and tangent vectors of a circle in the plane and 
W B: position and tangent vectors of a curve on the sphere. Then the hodograph is a spherical curve. 


13.1.1.4 Taylor Expansion for Vector Functions 
da eda h” d'a 
a(t +h) = a(t) + h— 4 + Een 13.4 
Verg c RU) or e ae πα psg 
The expansion of a vector function in a Taylor series makes sense only if it is convergent. Because the 
limit is defined componentwise, the convergence can be checked componentwise, so the convergence 
of this series with vector terms can be determined exactly by the same methods as the convergence of 
a series with complex terms (see 14.3.2, p. 751). So the convergence of a series with vector terms is 
reduced to the convergence of a series with scalar terms. 
The differential of a vector function a(t) is defined by: 
da 
da = —At. 13.5 
Ji (13.5) 


13.1.2 Scalar Fields 
13.1.2.1 Scalar Field or Scalar Point Function 


If a number (scalar value) U is assigned to every point P of a subset of space, then one writes 
U=U(P) (13.6a) 


and one calls (13.6a) a scalar field (or scalar function). 


W Examples of scalar fields are temperature, density, potential, etc., of solids. 
A scalar field U = U(P) can also be considered as 

U — U(r), (13.6b) 
where r is the position vector of the point P with a given pole 0 (see 3.5.1.1, 6., p. 182). 


13.1.2.2 Important Special Cases of Scalar Fields 
1. PlaneField 


One speaks of a plane field, if the function is defined only for the points of a plane in space. 

2. Central Field 

If a function has the same value at all points P lying at the same distance from a fixed point C(ri), 
called the center, then it is called a central symmetric field or also a central or spherical field. The 
function U depends only on the distance CP = |r|: 


U = f(|v ). (13.7a) 
W The field of the intensity of a point-like source, e.g., the field of brightness of a point-like source of 
light at the pole, can be described with |T| = r as the distance from the light source: 
c 


U-—-3 (cconst). (13.7b) 
n 
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3. Axial Field 
If the function U has the same value at all points lying at an equal distance from a certain straight line 
(axis of the field) then the field is called cylindrically symmetric or an axially symmetric field, or briefly 
an axial field. 
13.1.2.3 Coordinate Representation of Scalar Fields 
If the points of a subset of space are given by their coordinates, e.g., by Cartesian, cylindrical, or spher- 
ical coordinates, then the corresponding scalar field (13.6a) is represented, in general, by a function o 
three variables: 
U-d(r,y,z) U=W(p,p,z) or U= x(r,v,¢). (13.8a 
In the case of a plane field, a function with two variables is sufficient. It has the form in Cartesian and 
polar coordinates: 
U = (x,y) or U — V(p, q). (13.8b 
The functions in (13.8a) and (13.8b), in general, are assumed to be continuous, except, maybe, at some 
points, curves or surfaces of discontinuity. The functions have the form 


a) for a central field: U -U(z? + y? + 22) 2 U(y g? + 23) = U(r) (13.9a 
with the origin of the coordinate system as the pole of the field, 
b) for an axial field: U = U(y x? + y?) = U(p) = U(r sin ð) (13.9b 


with the z-axis as the axis of the field. 
Dealing with central fields is easiest using spherical coordinates, with axial fields using cylindrical co- 
ordinates. 


13.1.2.4 Level Surfaces and Level Lines ofa Field 


1. Level Surface 
A level surface is the union of all points P in space where the function (13.6a) has a constant value 

U — U(P) = const. (13.10a) 
Different constants Uo, Ui, U2, . . . define different level surfaces. There is a level surface passing through 
every point except the points where the function is not defined. The level surface equations in the three 
coordinate systems used so far are: 

U = (x,y,z) = const, U = W (p,p, z) = const, U = x(r, ù, p) = const. (13.10b) 
W Examples of level surfaces of different fields: 


A: U—cr-—cr-cy-c cz Parallel planes. 

B: U =a? + 2y? + 42?: Similar ellipsoids in similar positions. 
C: Central field: Concentric spheres. 

D: Axial field: Coaxial cylinders. 

2. Level Lines 


Level lines replace level surfaces in plane fields. They satisfy the equation 

U = const. (13.11) 
Level lines are usually drawn for equal intervals of U and each of them is marked by the corresponding 
value of U (Fig. 13.3). 
W Well-known examples are the isobaric lines on a synoptic map or the contour lines on topographic 
maps. 
In particular cases, level surfaces degenerate into points or lines, and level lines degenerate into separate 
points. 
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é , : 1 
he level lines of the fields a) U = xy, b) U = z c) U = x+y? = P, d) U are represented 
x p 


in Fig. 13.4. 
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Figure 13.4 
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.1.3 Vector Fields 


13.1.3.1 Vector Field or Vector Point Function 


Ifa 


anc 


vector V is assigned to every point P of a subset of space, then it is denoted by 


V = V(P) (13.124) 
calls (13.124) a vector field. 


W Examples of vector fields are the velocity field of a fluid in motion, a field of force, and a magnetic 
or electric intensity field. 


A vector field V = V(P) can be regarded as a vector function 


V = ViF), (13.12b) 


AY 
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where r is the position vector of the point P with a given pole 0. If all values of r as well as V lie ina 
plane, then the field is called a plane vector field (see 3.5.2, p. 190). 


13.1.3.2 Important Cases of Vector Fields 


1. Central Vector Field 
In acentral vector field all vectors V lie on straight lines passing through a fixed point called the center 
(Fig. 13.5a). 
Locating the pole at the center, then the field is defined by the formula 

V = fŒ), (13.13a) 
where all the vectors have the same direction as the radius vector r. It often has some advantage to 
define the field by the formula 


νο (r= |), (13.13b) 


Figure 13.5 


2. Spherical Vector Field 

A spherical vector field is a special case of a central vector field, where the length of the vector M depends 
only on the distance |r| (Fig. 13.5b). 

W Examples are the Newton and the Coulomb force field of a point-like mass or of a point-like electric 
charge: 


ds (c const) . (13.14) 
r 


M o 


C. 
Ve Le 
= 


The special case of a plane spherical vector field is called a circular field. 

3. Cylindrical Vector Field 

a) All vectors V lic on straight lines intersecting a certain line (called the axis) and perpendicular to 
it, and 


b) all vectors V at the points lying at the same distance from the axis have equal length, and they are 
directed either toward the axis or away from it (Fig. 13.5c). 
Locating the pole on the axis parallel to the unit vector C, then the field has the form 


V =(p)—, (13.15a) 
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where r* is the projection of r on a plane perpendicular to the axis: 

T =€ X(T XT): (13.15b) 
By intersecting this field with planes perpendicular to the axis, one always gets equal circular fields. 
13.1.3.3 Coordinate Representation of Vector Fields 
1. Vector Field in Cartesian Coordinates 
The vector field (13.12a) can be defined by scalar fields V; (r), Vo(r), and V3(r) which are the coordinate 
functions of V, i.e., the coefficients of its decomposition into any three non-coplanar base vectors 61, 
6», and ea: 


V = Vei + Vo + Vas. (13.164) 


With the coordinate unit vectors ij. K as base vectors and expressing the coefficients Vi, V2, V3 in 
Cartesian coordinates one gets 

V = για, y, z)i + νγ(α, y, ο] + ν.(α, y, z)k. (13.16b) 
So, the vector field can be defined with the help of three scalar functions of three scalar variables. 
2. Vector Field in Cylindrical and Spherical Coordinates 
In cylindrical and spherical coordinates, the coordinate unit vectors 


&, &, & (=k), and &(--)8&,&, (13.172) 
: 


are tangents to the coordinate lines at each point (Fig. 13.6, 13.7). In this order they always form a 
right-handed system. The coefficients are expressed as functions of the corresponding coordinates: 


v= V,(0, p, 2)8, + Volp: v, 2)8ρ + V-(p, p, z)8 (13.17b) 
V = V(r, 0, ϱ)δ, + Vo(r, 9, φ)6, + Volr, 0, )δν. (13.1Το) 


At transition from one point to the other, the coordinate unit vectors change their directions, but 
remain mutually perpendicular. 


Figure 13.6 Figure 13.7 Figure 13.8 


13.1.3.4 Transformation of Coordinate Systems 
See also Table 13.1. 
1. Cartesian Coordinates in Terms of Cylindrical Coordinates 
Vz = V, cosy — V; sing, V, = V,siny + V, cos p, V = Va (13.18) 
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2. Cylindrical Coordinates in Terms of Cartesian Coordinates 

V, = Vz cosy + V, sinc, V, = —Vz sin y + Vy cos p, Vz = Vz. (13.19) 
3. Cartesian Coordinates in Terms of Spherical Coordinates 

V, = V, sind cosy — V, sing + Vy cos e cos 0, 

V, = V.sinÜsin y + V, cosy + V; sin yp cos 0, 

V, = V, cos — V; sin. (13.20 
4. Spherical Coordinates in Terms of Cartesian Coordinates 

V. = V; sin) cos y + V, sin Ὁ sin y + V; cos 9, 

Və = Vz cos cos o + V, cos Ù sin  — V, sin 0, (13.21 

V, = — Vz sin y + Vy cosy. 
5. Expression of a Spherical Vector Field in Cartesian Coordinates 

V (ua? + y? 4 EXC | yj + zk). (13.22 
6. Expression of a Cylindrical Vector Field in Cartesian Coordinates 

V = e(/a? y?) (ri + yj). (13.23 


In the case of a spherical vector field, spherical coordinates are most convenient for investigations, 


i.e., the form V = V(r)e,; and for investigations in cylindrical fields, cylindrical coordinates are mos 


convenient, i.e., the form V = V(y)é,. In the case of a plane field (Fig. 13.8) 
V =V,(z, y)i+ Vv, y)j = Vole, P)E + Vole, ϱ)δρ, 


holds and for a circular field 
V = φ(ψαϑ 4- y?) (xi 4 


yi) = Φ(ο)8,. 


(13.24 


(13.25 


Table 13.1 Relations between the components of a vector in Cartesian, cylindrical, and spherical 


coordinates 


Cartesian coordinates 


Cylindrical coord. 


Spherical coordinates 


V =V,6, + V,&, + ν.δ. 


Vē, + V,&, + ν:6. 


V.&, + νυν + νρᾶ, 


—V, sin y + Vy cos p 


=V; 


=V; 


VW = V, cosp —V, sing | = V, sin Ù cosy + V; cos υ cos p 
— Vọsin y 

Vy = V, sin y + Vp cosy | = V, sin ὃ sin 9 + Vy cos ὐ sin p 
+ V, cos p 

Vz ΞΥ, = V, cos V — Vo sin Ü 

Vz cos + Vy sing =V, = V, sin Ü + V; cos Ü 

—V, sin y + Vy cos yp = V; = V 

Vz =V = V, cos Ü — V5 sin Ü 

V, sin Ý cosy + V, sin vsin y + V; cos | = V sin ð + V;cosü | = V, 

Vz cos 9 cosy + V, cos ü sin o — V; sing | = V cos ù — V.sind | = Vy 
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13.1.3.5 Vector Lines 

A curve C is called a line of a vector or a vector line of the vector field 
V (8) (Fig. 13.9) if the vector V (F) is a tangent vector of the curve 
at every point P. There is a vector line passing through every point 
of the field. Vector lines do not intersect each other, except, maybe, 


at points where the function V is not defined, or where it is the zero 
vector. The differential equations of the vector lines of a vector field Figure 13.9 


V given in Cartesian coordinates are 


dr d dz lr d 
a) in general: V = 7, τε! (13.364) b) for a plane field: v = - : (13.26b) 
'To solve these differential equations see 9.1.1.2, p. 542 or 9.2.1.1, p. 571. 
W A: The vector lines of a central field are rays starting at the center of the vector field. 


W B: The vector lines of the vector field V = c x r are circles lying in planes perpendicular to the 
vector C. Their centers are on the axis parallel to €. 


13.2 Differential Operators of Space 


13.2.1 Directional and Space Derivatives 


13.2.1.1 Directional Derivative of a Scalar Field 


The directional derivative of a scalar field U = U(r) at a point P with position vector r in the direction 
€ (Fig. 13.10) is defined as the limit of the quotient 


BE πα Leo e (13.27) 
Oc ε-λθ ε 


If the derivative of the field U = U (F) at a point r in the direction of the unit vector c? of € is denoted 


OU . us 9 . , ; 
by ——, then the relation between the derivative of the function with respect to the vector € and with 


` Oc? 
respect to its unit vector €° at the same point is 
OU „oU 
= = |e. (13.28) 
ὃς Oc? 
2.0. OU. : . . . 
The derivative ERO with respect to the unit vector represents the speed of increase of the function U in 
c 


20 


the direction of the vector c" at the point r. If n is the normal unit vector to the level surface passing 


er "me ο. ' . OU 
through the point r, and n is pointing in the direction of increasing U, then On has the greatest value 
ΠῚ 
among all the derivatives at the point with respect to the unit vectors in different directions. Between 
the directional derivatives with respect to ri and with respect to any direction c? holds the relation 


d = ~ cos(€°, fi) = d cosy = Οὐ. gradU (see (13.34), p. 710). (13.29) 


Hereafter, directional derivatives always mean the directional derivative with respect to a unit vector. 


13.2.1.2 Directional Derivative of a Vector Field 
The directional derivative of a vector field is defined analogously to the directional derivative of a scalar 


field. The directional derivative of the vector field V = Να) at a point P with position vector r 
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Figure 13.10 Figure 13.11 
(Fig. 13.11) with respect to the vector a is defined as the limit of the quotient 
oV Veoa)-wve 
PX guo οσα, (13.30) 
a Ε-»0 ε 


If the derivative of the vector field V = V(g) at a point F in the direction of the unit vector a° of a is 


ον 
denoted by ae then 


50 
ον δν 
ae = lalago (13.31) 


In Cartesian coordinates, i.e., for V= V,€, + Vey + ν.6,, ἃ = a,€, + ayēy + a;e, , holds: 
(8: grad )V = (&- grad V,)ez + (&- grad V,)e& + (8 - grad V;)e; , (13.32a) 
in general coordinates: 


1 m NC "AS 
(a- grad)V = zeot (V x &) + grad (&- V) + ädiv V — Vdiv& — a x rot V — V x rota. 


(13.32b) E 
13.2.1.3 Volume Derivative 
Volume derivatives of a scalar field U = U(r) or a vector field V at a point f are quantities of three 
forms, which are obtained as follows: 
1. Surrounding the point r of the scalar field or of the vector field by a closed surface X. This surface can 
be represented in parametric form F = F(u, v) = z(u, v)&, + y(u, v)&, + z(u, v)e; , so the corresponding 
vectorial surface element is 


τ dudv. (13.33a 
v 


2. Evaluating the surface integral over the closed surface Ὁ. Here, the following three types of integrals 
can be considered: 


fusa. divos, divas. (13.33b 
(2) (Ὁ) (Ὁ) 


8, νε. the limits (if n exist) 


lim — vf Ud$, lim vf d$, lim = vý V x d$. (13.33c 


v0 V 


Here V m the volume of ki region of space ias contains the point with the position vector Τ᾽ 
inside, and which is bounded by the considered closed surface X 
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The limits (13.33c) are called volume derivatives. The gradient of a scalar field and the divergence and 
the rotation of a vector field can be derived from them in the given order. In the following paragraphs, 
these notions will be discussed in details (even defining them again.) 


13.2.2 Gradient of a Scalar Field 

The gradient of a scalar field can be defined in different ways. 

13.2.2.1 Definition of the Gradient 

The gradient of a function U is a vector grad U, which can be assigned to every point P with the vector 
T of a scalar field U = U (F), having the following properties: 

1. The direction of grad U is always perpendicular to the direction of the level surface U = const, 
passing through the considered points P, 

2. grad U points always into the direction in which the function U is increasing, 


OU , : ; PE αι. : 
3. |grad U| = om i.e., the magnitude of grad U is equal to the directional derivative of U in the normal 
ΠῚ 

direction. 
If the gradient is defined in another way, e.g., as a volume derivative or by the differential operator, 
then the previous defining properties became consequences of the definition. 
13.2.2.2 Gradient and Directional Derivative 
The directional derivative of the scalar field U with respect to the unit vector €? is equal to the projec- 
tion of grad U onto the direction of the unit vector Οὔ; 

ου 

ae? 
i.e., the directional derivative can be calculated as the dot product of the gradient and the unit vector 
pointing into the required direction. 


= δῦ. grad U, (13.34) 


Remark: The directional derivative at certain points in certain directions may also exist if the gradient 
does not exist there. 

13.2.2.3 Gradient and Volume Derivative 

The gradient U of the scalar field U = U(r) at a point r can be defined as its volume derivative. If the 
following limit exists, then it is called the gradient of U at r: 


fp v as 


ΗΕ -- 
grad U = lim z (13.35) 


Here V is the volume of the region of space containing the point belonging to r inside and bounded by 
the closed surface X. (If the independent variable is not a three-dimensional vector, then the gradient 
is defined by the differential operator.) 

13.2.2.4 Further Properties of the Gradient 

1. The absolute value of the gradient is greater if the level lines or level surfaces drawn as mentioned 
in 13.1.2.4, 2., p. 703, are more dense. 


2. The gradient is the zero vector (grad U = 0) if U has a maximum or minimum at the considered 
point. The level lines or surfaces degenerate to a point there. 


13.2.2.5 Gradient of the Scalar Field in Different Coordinates 
1. Gradient in Cartesian Coordinates 
OU (x, y, 2) _ QU (o. y, z274 OU (x, y, De 


ra 1 P 
xd Ox Oy Oz 


(13.36) 
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2. Gradient in Cylindrical Coordinates (x = pcos o, y = p sin y, z = z) 


grad U = grad, 78, + grad, 76, + grad, 6. with (13.37a) 
1 , 

grad,U = -: grad,U = LAC grad,U — = : (13.37b) 
3. Gradient in Spherical Coordinates (x = rsin cos o, y = rsin sin p, z = 
r cos 9) 

grad U = grad, Ue, + grad; U&5 + grad, ΤΕ, with (13.38a) 

ου 1ου 1 οὐ 
rad, U =, gradjU =-=, grad,U = — , 13.38l 
ρτας Bp? B "I Pag: Brac eU raind Hu (13.38b) 


4. Gradient in General Orthogonal Coordinates (£, η, C) 
For F(E, η, €) = 2(£ n. QI ψ(ξνη, Οἱ + z(6 m. OK 


grad U = grad, δε + grad, Ue, + grad; U&, where (13.394) 
1 ðU 1 OU 1 OU ] 

grade U = [θε] oc grad,U = Jor On’ grad,U = BF] OC (13.39b) 
oE δη ὃς 


13.2.2.6 Rules of Calculations 


Assuming in the followings that € and c are constant, the following equalities hold: 


gradc — 0, grad (ὔι + U2) = grad U} + grad U2, | grad (cU) = cgrad U. (13.40 
αφ 

grad (Οτο) = Uigrad Uz + U;grad Ui, grady(U) = ered. (13.41 

grad (νι . V2) = (Vi . grad )V» + (νο - grad )JVi + V, x rot Vo + Vo x rot Vi. (13.42 

grad (T - c) = c. (13.43 
1. Differential of a Scalar Field as the Total Differential of the Function U 

f 

dU = grad U - dr = a dx + " dy + a dz. (13.44 
2. Derivative of a Function U along a Space Curve r(t) 

1 QU dx di dz 

dU 9U dz QU PEU (13.45 


dt Ox dt ` Oy dt z dt 


3. Gradient ofa Central Field 


grad U(r) Ut) (spherical field), (13.464) gradr = E (field of unit vectors). (13.465) 
F 
13.2.3 Vector Gradient 
The relation (13.32a) inspires the notation 
ὃν . " z "EN 5 
--- = &: grad (V,é, + V,&, + V.&;) = à- grad V (13.472) 


δα 
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where grad V is called the vector gradient. It follows from the matrix notation of (13.47a) that the 
vector gradient, as a tensor, can be represented by a matrix: 


ὃν, ὃν, OV, OV, OV, OV, 
Ox Oy Oz Ox Oy Oz 
20 qx | OV, 9V, ὃν, | (^) a, ss | OV, ὃν, ὃν, m 
(ā- grad )V = δα Oy Be ἂν (13.475) grad V = a oy oz | (13.47c) 
OV. OV, OV. OV, OV, OV. 
Ox Oy Oz Ox Oy Oz 


These types of tensors have a very important role in engineering sciences, e.g., for the description of 
tension and elasticity (see 4.3.2, 4., p. 282. 

13.2.4 Divergence of Vector Fields 

13.2.4.1 Definition of Divergence 


'To a vector field Viz) a scalar field can be assigned which is called its divergence. The divergence is 
defined as a space derivative of the vector field at a point r: 


py. ds 


div V = lim ' pr (13.48) 


If the vector field V is considered as a stream field, then the divergence can be considered as the fluid 
output or source, because it gives the amount of fluid given in a unit of volume during a unit of time 


flowing by the considered point of the vector field V. In the case div V > 0 the point is called a source, 
in the case div V < 0 it is called a sink. 

13.2.4.2 Divergence in Different Coordinates 

1. Divergence in Cartesian Coordinates 


OV, OV, ὄν. 


divV = δν "Oy | Oz (13.49a) with ναι y, z) = Veit Vij + Vk. (13.49b) 


The scalar field div V can be represented as the dot product of the nabla operator V and the vector V 
as 


divV 2 V. V (13.49c) 
and it is translation and rotation invariant, i.e., scalar invariant (see 4.3.3.2, p. 283). 
2. Divergence in Cylindrical Coordinates 
10(pV,) | 10V, ὃν. 


NN CE Ip | 2» | ὃς (13.504) with V(p,y,z) =V,é,+ V;&; + V.&;. (19.500 


3. Divergence in Spherical Coordinates 
10(?V. , 1 O(snóV;), 1 OV, 


livV = 
cud r2 Or rsinü  δὺ ' rsind Og ( 


3.51a 


with | V(r, 0, i) = ν.δ, + Voto + V, 3.51b 


«Ji (13.52a 


~ 


4. Divergence in General Orthogonal Coordinates 


.o Lf od /fjor|jar|,.\ ὃ (Ər, ὃ 
div V 5 ac {5 δε D } 5 oc E v) | xí 


or 
ὃς 


or 
On 
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with F(E,n,¢) = α(ξ,η, Qf y(n, 0] + 26.0, Ok, (13.52b) 
OF OF OF Or| |Or| | ar : " z z 3 
D= [5 δη x) = ls : i : E (13.52c) and V (E, n, C) = Veče + V,&, + Vee. (19.524) 


13.2.4.3 Rules for Evaluation of the Divergence 


dive —0, div (V + V3) — div Vi + div Vo, div (cV) =cdivV. (13.53 
div (UV) = U div V + V - grad U (socias div(rc) = zJ 9 (13.54 
m 
div (Vi x V3) = Vs. rot V; — Vi -rot Va. (13.55 
13.2.4.4 Divergence of a Central Field 
div? —3, divg(r)¥ = 3y(r) + ro (r). (13.56 


13.2.5 Rotation of Vector Fields 
13.2.5.1 Definitions of the Rotation 


1. Definition 7 p , 
The rotation or curl of a vector field V at the point T is a vector denoted by rot V, curl V or with the 


nabla operator V x V, and defined as the negative space derivative of the vector field: 


pv xas fpasx¥ 


() 


rot V = — lim 
v0 


2. Definition τ 
The vector field of the rotation of the vector field V (r) can be defined in the following way: 


a) Putting a small surface sheet S (Fig. 13.12) 
through the point r and describing this surface sheet 
by a vector S whose direction is the direction of the 
surface normal ñ and its absolute value is equal to the 
area of this surface region. The boundary of this sur- 
face is denoted by C. 
b) Evaluating the integral f V. dra ong the closed 

©) 
boundary curve C of the surface (the sense of the curve 


is positive looking to the surface from the direction of 
the surface normal (see Fig. 13.12). 


1 E 
c) Determining the limit (if it exists) lim 5 f V. dr, 
20$. 
(C) 


while the position of the surface sheet remains un- 
Figure 13.12 changed. 


d) Changing the position of the surface sheet in order to get a maximum value of the limit. The surface 
area in this position is Smax and the corresponding boundary curve is Cmax- 
e) Determining the vector rot f at the point r, whose absolute value is equal to the maximum value 
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found above and its direction coincides with the direction of the surface normal of the corresponding 


surface. 


Then one gets: 
V. di 
T o (max 
|rot v| = lim ΄ ) 
Smax—0 


max 


(13.58a) 


The projection of rot V onto the surface normal ñ of a surface with area S, i.e., the component of the 


vector rot V in an arbitrary direction n — lis 


I. 


= = NE 
rot V — rot; V — lim 


—0 


V.dr 
) 


The vector lines of the field rot V are called the curl lines of the vector field V. 


13.2.5.2 Rotation in Different Coordinates 


1. Rotation in Cartesian Coordinates 

=> ὃν, OV, (OV, OV, " 

V =i Z 51} - Z] +k 
! ( Oy Oz ) J ( Oz Ox ) ( 


or field rot V can be represented as the cross product of the nabla operator V and the vector 


8 
[5] 


The vec 
ME 

TO 
2. Ro 


TO 


3. Ro 


av, 


V=VxV. 


tation in Cylindrical Coordinates 


V — rot Vē, +rot Vē, +rot Vē, with 
> 10V m ƏV, 
εν ο he. 
pop Oz Oz p 
tation in Spherical Coordinates 
rot V = rot , V&, + rot Vē + rot nu with 


rot, V — S (ay eina - τ) ; 


| 


rsind | Ov Op 
= 1 ὄν. 13, 
tota V = rsind ðp 79; Yo) 


Ox 


OV,N — 
ὃν 


4. Rotation in General Orthogonal Coordinates 


rot V = τος Vē + rot " Vë, + rot ¢ Vë 


with 


COE ον en 


Qm 


= 


DUE|ec-a 
Q 
N 


(13.58b) 


(13.59a) 


(13.59b) 


(13.60a) 


(13.60b) 


(19.614) 


(19.618) 


(13.024) 
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Y ð ð (δή 
t ; η 
ασ ο) 
ὃ (95 ð (195 
ty V= Ve 13.62] 
ο ο. Son 
ὃ ð (95 
t, V = a ; 
τα i 
FEM) ο αμα (13.626 
13.2.5.3 Rules for Evaluating the Rotation 
rot (Vi + V2) = rot Vi + rot Va, τοῦ (eV) = crot V. (13.63 
rot (UV) = Urot V + gradU x V. (13.64 
ro (νι x V2) = (V2 - grad Vi = (νι - grad V2 + V, div Vz — Vo div Vi. (13.65 
13.2.5.4 Rotation of a Potential Field 
This also follows from the Stokes theorem (see 13.3.3.2, p. 725) that the rotation of a potential field is 
identically zero: 
rot V = rot (grad Ὁ) — 0. (13.66) 


This also follows from (13.594) for V= grad U, if the assumptions of the Schwarz interchanging theo- 
rem are fulfilled (see 6.2.2.2, 1., p. 448). 
Bb For? = ri + yj + 2k with r = |?| = 
vr) is a differentiable function of r. 
13.2.6 Nabla Operator, Laplace Operator 
13.2.6.1 Nabla Operator 


The symbolic vector V is called the nabla operator. Its use simplifies the representation of and calcu- 
lations with space differential operators. In Cartesian coordinates holds 


LE ὃ» O0. 
δα 


νο; - y? FZ holds: rot f = 0 and rot (y(r)#) = 0, where 


V= (13.67) 


The components of the nal 


5: means partial differen 
z 


stants. 


zla operator are considered as partial differential operators, i.e., the symbol 


jation with respect to x, where the other variables are considered as con- 


The formulas for spatial differential operators in Cartesian coordinates can be obtained by formal mul- 


tiplication of this vector o 
operators gradient, vector 


;erator by the scalar U or by the vector V. For instance, in the case of the 
gradient, divergence, and rotation: 


grad U = VU (gradient of U (see 13.2.2, p. 710)), (13.654) 
grad V=VV vector gradient of V (see 13.2.3, p. 711)), (13.68b) 
div V — V. V (divergence of V (see 13.2.4, p. 712)) (13.68c) 
rot V — V x V (rotation or curlof V (see 13.2.5, p. 713)). (13.68d) 
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13.2.6.2 Rules for Calculations with the Nabla Operator 


1. If V stands in front of a linear combination X a; X; with constants a; and with point functions X;, 
then, independently of whether they are scalar or vector functions, we have the formula: 


VY a Xi) = 31a; V Xi. (13.69) 


2. If V is applied to a product of scalar or vector functions, then it has to be applied to each of these 
functions after each other and the results are to be added. There is a | above the symbol of the function 
submitted to the operation 


4 + 4 
V(XYZ)=V(XYZ)+V(XY Z)+V(XY Z), ie., (13.70) 
V(XY Z) 2 (VX)YZ + X(VY)Z) - XY (VZ). 
Then the products have to be transformed according to vector algebra so as the operator V is applied 
to only one factor with the sign |. Having performed the computation one omits that sign. 
; , ΠΝ — "UN > 
B A: div(UV) - V(UV) =V(UV)+V(UV)=V:-VU+UV-V=V-gradU + U div V. 
-- ΜΞ ae "E « " , 
WB B: grad (ViV5) = V(ViV2) = V(Vi V2) + V(Vi V2). Because (ας) = (ab)c + à x (b x c) 
follows: grad (V1 V3) = (VV) Vi + Va x (V x Vi) + (Vi V)Va + Vi x (V x V3) 


-- 


= (Vagrad ) V: + V3 x rot νι + (Vigrad γν, + MZ x rot Vo. 
13.2.6.3 Vector Gradient 
The vector gradient grad V is represented by the nabla operator as 

grad V = VV. (13.71a) 
The expression occurring in the vector gradient (& - V)V (see (13.32b), p. 709) has the form: 

2(8. V)V = rot (V x &) + grad (AV) + ádiv V — Vdiva— ax rot V — V x rota. — (13.71b) 
In particular one gets for r — αἲ }- yj + 2k: 

(a. V)r =a. (13.71c) 
13.2.6.4 Nabla Operator Applied Twice 
For every field V: 


V(V x V) = divrot V = 0, (13.72) V x (VU) = rot grad U = 6, (13.73) 
V(VU) — divgradU — AU. (13.74) 


13.2.6.5 Laplace Operator 


1. Definition 
'The dot product of the nabla operator with itself is called the Laplace operator: 

A-V.V-V? (13.75) 
'The Laplace operator is not a vector. It prescribes the summation of the second partial derivatives. It 
can be applied to scalar functions as well as to vector functions. The application to a vector function, 
componentwise, results in a vector. 
The Laplace operator is an invariant, i.e., it does not change during translation and/or rotation of the 
coordinate system. 
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2. Formulas for the Laplace Operator in Different Coordinates 


Here the Laplace operator is applied to the scalar point function U(r). Then the result is a scalar. The 


application of it for vector functions Να) results in a vector AV with components AV,, AV,, AV.. 


1. Laplace Operator in Cartesian Coordinates 
Ox? Oy 202 

2. Laplace Operator in Cylindrical Coordinates 
18 / ƏU 19 OU 
pop ο») pog tas 


3. Laplace Operator in Spherical Coordinates 


; ; " 
AU(r,9, p) = 1? Ga t : B (sino 55) | l : Lm 


AU (x,y,z) = 


AU (0, 9,2) = 


r? Or Or r? sin 0 OV r? sin? Ὁ Oy? 


4. Laplace Operator in General Orthogonal Coordinates 


1|9 D OU ὃ D οὐ ὃ D οὐ ] 
ANG =D [ag | orp ae | δη | [δες δη | * aC | fare ac | ““ 


8€ δη ὃς 


or 
ὃς 


OF 
On 


or 
ὃς 


(En, 0) -- x(&m Oit yE n O+ En CK, (13.79 D= 


3. Special Relations between the Nabla Operator and Laplace Operator 


V(V - V) = graddiv V, 
V x (V x V) = rotrot V, 
V(V.V)-Vx(Vx V)— AV, | where 


2 217 92V.N 5 
AV = (V. V)V = AV,i + AV + AVE (= LEN yi 


Ox? | Oy? | Oz 

ev, By 21 Ν., 2y, 2y, ΣΝ, 

998 0 j+ age k. 
Ox? Oy? 0z? 


(o3 | Oy? 82 


13.2.7 Review of Spatial Differential Operations 
13.2.7.1 Rules of Calculation for Spatial Differential Operators 
U,U,, Uz and F are scalar functions; c is a constant; V, Vi, V, are vector functions: 
grad (U; + U2) = grad Ui + grad U2. (13.84) grad (cU) = cgrad U. 
grad (U1U5) = Ui grad Us4-U» grad U,.(13.86) grad F(U) = F'(U) grad U. 


div (Vi + V3) = div Vi + div Vo. (19.88) div (cV) — cdiv V. 


(13.76 


(13.792) 


(13.79c) 


(13.80) mn 


(13.81) 
(13.82) 


(13.83) 


(13.85) 
(13.87) 


(13.89) 
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div (UV) = 
rot (cV) = crot V. 
divrot V = 0. 


div grad U = AU. 


V. grad U + U div V. 


(13.90) rot (Vi + νο) = rot Vi --rot Va. — (13.91) 
(13.92) rot (UV) = U rot V — V x grad U. (13.93) 
(13.94) rot grad U = 0 (zero vector). (13.95) 
(13.96) rot rot V = grad div V — AV. (13.97) 


div (Vi x V3) = Vo-rot Vı— νι τοι V3.(13.98) 


13.2.7.2 Expressions of Vector Analysis in Cartesian, Cylindrical, and 
Spherical Coordinates (see Table 13.2) 


Table 13.2 Expressions of vector analysis in Cartesian, cylindrical, and spherical coordinates 


Cartesian coordinates | Cylindrical coordinates | Spherical coordinates 
ds = dr | &,dx + é,dy + 6.42 Edp + €,pdyp + 6.12 Edr + éyrdv + €r sin Ydy 
rady | g, 2U pe 9U pe 9U ες g 19U g 3U a 3U qa 2U cs 1 OU 
B "Or | Oy * Az " ap ρθρο 9; "Or | "rod ! "rsinó Op 
> | OV, Ον, OV. 1 ὃ 10V, OV, |10,. L9 
liv rer / t ry, Vy sin ὃ 
ΡΝ Ox Oy Oz 585 V») p Op τ Oz r? rV ΤΉ T rand 85 pnt) 
1 ὃν, 
rsinó ὃρ 
Ὁ OV, ,[19V, OV, y di ο. OV, 
rotV «(S Ὃν 2 8, E Jp Oz ) oad [ig sin) Jg 
γα. ƏV; ο. (OV, OV: a L| τ Ον ὃν. 
i a (5e - x] ως ( ðz τ) Ὃς m Op Or eva) 
,1[90VW, OV adl 9. 10V, aTa OV, 
Tes ( δα - m) i Grae 25 er πο o 
AU OU OU OU ð ( W\ _10U 10 / ,0U 
Ox? δν 02 pop P 8p pog Por \" Or 
PU 1 9 ος poe 
Oz? r? sin 0 00 δὺ 
1 OU 
r2 sin? 0 Oy? 
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13.2.7.3 Fundamental Relations and Results (see Table 13.3) 


Table 13.3 Fundamental relations for spatial differential operators 
Operator Symbol | Relation | Argument | Result Meaning 
Gradient grad U VU scalar vector maximal increase 
Vector gradient | gradV | VV vector tensor second order 
Divergence div V V-V vector scalar source, sink 
Rotation rot V Vxv vector vector curl 
Laplace operator | AU (V- V)U | scalar scalar potential field source 
Laplace operator | AV (V-V)V | vector vector 
13.3 Integration in Vector Fields 
Integration in vector fields is usually performed in Cartesian, cylindrical or in spherical coordinate 


systems. Usually one integrates along curves, surfaces, or volumes. The line, surface, and volume 
elements needed for these calculations are collected in Table 13.4. 


Table 13.4 Line, surface, and volume elements in Cartesian, cylindrical, and spherical coordinates 


Cartesian coordinates Cylindrical coordinates Spherical coordinates 
dr | €,dx + é,dy + €.dz Edp + &;pdq + €.dz &,dr + €yrdd + €,r sin Vdp 
dS &,dydz + €,dxdz + €,daxdy | €,pdpdz + &,dpdz + €.pdpdy | €,r? sin Vdddy 

-Feyr sin Vdrdy 
+é6,rdrdvdp 
dv* | dxdydz pdpdydz 1? sin Vdrdddy 

ᾱ, = Ej xe €, = €, χε, E, = 689 X δρ 

Ey = &, x ër é, = €, E; δ = δρ X 6, 

é, = €, X ἂν 6, = &p x & Ep = €, x Ey 

wi-fi jee- 6; Jn 

The indices i and j take the place of x, y, z or p, p, z or r, Ù, φ. 
% The volume is denoted here by v to avoid confusion with the absolute value of the vector 
function |V| = V. 


13.3.1 Line Integral and Potential in Vector Fields 
13.3.1.1 Line Integral in Vector Fields 


1. Definition The scalar-valued curvilinear integral or line integral of a vector function Viz) along 
a rectificable curve AB (Fig. 13.13) is the scalar value 
P= J VE) dz. 


AB 


(13.99a) 
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2. Evaluation of this Integral in Five Steps 

a) Dividing the path AB (Fig. 13.13) by division points 

Αι(τι), A2(f2),..-,An-1(Fn-1) (A = Ao B = An) 

into n small arcs which are approximated by the vectors 

r;— Pa = ΔΙ; 1. 

b) Choosing arbitrarily the points P; with position vec- 

tors & lying inside or at the boundary of each small arc. 

c) Calculating the dot product of the value of the func- 

tion νε ) at these chosen points with the corresponding 
δν 1. 

d) Taking the sum of all the n products. 


e) Calculating the limit of the sums got in this way 


n x 


YVE). ΔΕ, ι for |Ar; 4| — 0, while n — oo obvi- 


i-i 
Figure 13.13 ously. 
If this limit exists independently of the choice of the points A; and P;, then it is called the line integral 
V.dr- li VÈ) Afia. 13.99t 
! dr πας Σ (6) 1 (13.990) 


A sufficient condition for the existence of the line integral (13.99a,b) is that the vector function V (F) 


~ 
and the curve AB are continuous and the curve has a tangent varying continuously. 


C A vector function V(z) is continuous if its components, the three scalar functions, 
are continuous. 
= B 13.3.1.2 Interpretation ofthe Line Integral in Mechanics 
If Να) is a field of force, i.e., V(r) = F(r), then the line integral (13.99a) 
m represents the work done by F while a particle m moves along the path AB 


Figure 13.14 (Fig. 13.13,13.14). 


13.3.1.3 Properties of the Line Integral 


[IE æ= [Gare [ Vi) dg (Fig. 1344). (sein 
| ΠΣ We at [ver ee [Sm a -— 
J ee acf An am 
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13.3.1.4 Line Integral in Cartesian Coordinates 
In Cartesian coordinates the following formula holds: 
m το d= / (Vide + V; dy V; de). (13.104) 


AB 


13.3.1.5 Integral Along a Closed Curve in a Vector Field 
A line integral is called a contour integral if the path of integration is a closed curve. If the scalar value 
of the integral is denoted by P and the closed curve is denoted by C, then the following notation is used: 
= je di. (13.105) 
(C) 


13.3.1.6 Conservative Field or Potential Field 


1. Definition 

If the value P of the line integral (183.904) in a vector field depends only on the initial point A and the 
endpoint B, and is independent of the path between them, then this field is called a conservative field 
or a potential field. 

The value of the contour integral in a conservative field is always equal to zero: 


/νῶ: di = 0. (13.106) 


A conservative field is always irrotational: 

rot V = 0, (13.107) 
and conversely, this equality is a sufficient condition for a vector field to be conservative. Of course, it 
is to be supposed that the partial derivatives of the field function V are continuous with respect to the 


corresponding coordinates, and the domain of V is simply connected. This condition, also called the 
integrability condition (see 8.3.4.2, p. 521), has the following form in Cartesian coordinates 


ὃν. OV, OV, 08V, AV, OV, 


, ; : 13.108 
Oy ὃς Oz Oy Ox Oz ( ) 
2. Potential of a Conservative Field, 
or its potential function or briefly its potential is the scalar function 

U(#) = /νῶ di. (13.109a) 


In a conservative field it is calculated with a fixed initial point Α(Τῃ) and a variable endpoint B(r) as 
the line integral 


Ὁ - | νῷ a. (13.109b) 


Remark: In physics, the potential U* (F) of a function V (F) at the point F is often considered with the 
opposite sign: 


U*(#) = pi . di = -U (È). (13.110) 
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3. Relations between Gradient, Line Integral, and Potential 
If the relation V (T) = grad U (F) holds, then U (F) is the potential of the field V(#), and conversely, V (F) 
is a conservative or potential field. In physics often the negative sign is used corresponding to (13.110). 
4. Calculation of the Potential in a Conservative Field 7 
If the function V(r) is given in Cartesian coordinates V = V,i+ Vj + Vk, then for the total differen- 
tial of its potential function U 

dU = V, dx + V, dy + V. dz (13.11 1a) 
holds. Here, the coefficients V,, V,, V. must fulfill the integrability 
condition (13.108). The determination of U follows from the equa- 
tion system 


OU OU OU 
=V,, =V,, - V. 13.111} 
Ox V Oy v 92 vz is ) 


In practice, the calculation of the potential can be done by perform- 
ing the integration along three straight line segments parallel to the 
Figure 13.15 coordinate axes and connected to each other (Fig. 13.15): 


E E c y z 
PE [V dr = U (xo, yo, zo) «f Vi (zx, yo, zo) dx +/ Vy (x. Y, zo) dy «f V.(a, y, z) dz . (13.112) 
E J £O “YO 420 


To 


13.3.2 Surface Integrals 
13.3.2.1 Vector ofa Plane Sheet 


The vector representation of the surface integral of general type (see 8.3.4.2, p. 537) requires to assign 
a vector S toa plane surface region S, which is perpendicular to this region and its absolute value is 
equal to the area of S. Fig. 13.16a shows the case of a plane sheet. The positive direction in S is 
given by defining the positive sense along a closed curve C according to the right-hand law (also called 
right-screw rule): Looking from the initial point of the vector into the direction of its final point, then 
the positive sense is the clockwise direction. By this choice of orientation of the boundary curve one 
fixes the exterior side of this surface region, i.e., the side on which the vector lies. This definition works 
in the case of any surface region bounded by a closed curve (Fig. 13.16b,c). 


ny 


interior side | | 
pos 


exterior side 


do 


Pd 


a) b) C 


Figure 13.16 


13.3.2.2 Evaluation of the Surface Integral 


The evaluation of a surface integral in scalar or vector fields is independent of whether the surface S is 
bounded by a closed curve or is itself a closed surface. The evaluation is performed in five steps: 

a) Dividing the surface region S on the exterior side defined by the orientation of the boundary curve 
(Fig. 13.17) into n arbitrary elementary surfaces AS; so that each of these surface elements can be 
approximated by a plane surface element. Assigning the vector AS; to every surface element AS; as 
given in (13.33a). In the case of a closed surface, the positive direction is defined so that the exterior 
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side is where AS; should start. 
b) Choosing an arbitrary point P; with the position vector rj inside or on the boundary of each surface 
element. 


c) Producing the products U (£j) AS; in the case of a scalar field and the product V (r4) - AS; or V (&) x 
AS; in the case of a vector field. 

d) Taking the sum of all these products. 

e) Evaluating the limit while the diameters of AS; tend to zero, i.e., |AS;| - 0 for n — oo. So, the 


surface elements tend to zero in the sense given in 8.4.1, 1., p. 524, for double integrals. 
If this limit exists independently of the partition and of the choice of the points rj, then one calls it the 


surface integral of V on the given surface. 


yz 


0 ae y 
x 
Figure 13.17 Figure 13.18 
13.3.2.3 Surface Integrals and Flow of Fields 
1. Vector Flow of a Scalar Field 
P= lim σσ) AŠ; = i U(#) dS. (13.113) 
|AS;|20 j=1 (S) 
2. Scalar Flow of a Vector Field 
Q= lim Σύ): AS; = [ve dS. (13.114) 
ΙΔΘ8;|-»0 i—l ὦ) 
3. Vector Flow of a Vector Field 
R= lim YV(E)xAS;- [vw x d$. (13.115) 
|AS;|20 j=1 (S) 
13.3.2.4 Surface Integralin Cartesian Coordinates as 
Surface Integrals of Second Type 
|υ48- f U dy dei + [ [0a axi [ [ vacas (13.116) 
5) (Sys) (Sza) (35) 


[V-&=- ΠΟΙ NT f [ Veae dy. (13.117) 
(S) (Sys) (3.2) (Sey) 
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| ved ᾱ- [[(Vi-Vk)dydz+ ffVk-Vňdzdr+ | [Qvi -- νὰ) de ἄν 03.118) 
(S) (Syz) (S22) ( 


Bye Sey) 


The existence theorems for these integrals can be given similarly to those in 8.5.2, 4., p. 537. 

In the formulas above, each of the integrals is taken over the projection S on the corresponding coor- 
dinate plane (Fig. 13.18), where one of the variables x, y or z should be expressed by the others from 
the equation of S. 


Remark: Integrals over a closed surface are denoted by 
gud = duas. jv: a8 = Gv - d5, j V x dB = BV x a8. (13.119) 
(S) (S) (8) (5) (5) (5) 
W A: Calculate the integral P= / zyz dŠ, where the surface is the plane region r4-y4-z = 1 bounded 
(S) 
s. the coordinate planes. The upward side is the positive side: 


ua z)yz dy dzi+ a ., J [Q-2=weyae dyk; 


(S4) d 
τος 
ΓΚ f 1—y-z)yzdydz = f Á (1— y — z)yzdydz = a We get the two further integrals 
TE 0 Jo 
analogously. The result is: P- —( + j + k). 


τ 
W B: Calculate the integral Q = 1 . d$ = JJ udydz+ {ν΄ dx + η dy over the 


(Sz) 


(Sy 
an 1 
same plane region as in A: l / adydz = [ i (l-—y-—z)dydz = 5 Both other integrals are 


1 oi 1 1 
alculated similarly. The result is: Q = = += +4 ==. 
calculated similarly. The result is: Q stats 3 
E C: Calculate the integral R = Iz x dS = | (xy zk) x (dy dzi-- dz da j + dx dy K), where 
(5) (5) 


the surface region is the same as in A: Performing the computations gives R = 0. 


13.3.3 Integral Theorems 


13.3.3.1 Integral Theorem and Integral Formula of Gauss 


1. Integral Theorem of Gauss or the Divergence Theorem 


The integral theorem of Gauss gives the relation between a volume integral of the divergence of V over 
a volume v, and a surface integral over the surface S surrounding this volume. The orientation of the 
surface (see 8.5.2.1, p. 535) is defined so that the exterior side is the positive one. The vector function 
V should be continuous, their first partial derivatives should exist and be continuous. The integral 
theorem of Gauss reads as follows: 


Pv -a8= [ff aiv v av. (13.120a) 


(S) (0) 
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i.e., the scalar flow of the field V through a closed surface S is equal to the integral of divergence of ν 
over the volume v bounded by δ. In Cartesian coordinates one gets: 


po dy dz + V, dz dz + V; dy dy) = Wh = 4 Be | τα) dx dy dz . (13.120b) 
Oy 2 


2. Integral Formula of Gauss 

In the planar case, the integral theorem of Gauss restricted to the x, y plane becomes the integral for- 
mula of Gauss. It represents the correspondence between a line integral and the corresponding surface 
integral. The integral formula of Gauss reads as follows: 


JI Ee v. y) ane d dz dy nas x,y) dz + Q(x, y) ἀν]. (13.121) 


B mo a plane region which is bounded by C. P and Q are continuous functions with continuous 
first partial derivatives. 

3. Sector Formula 

The sector formula is an important special case of the Gauss integral formula to calculate the area of 
plane regions. For Q = x, P = —y it follows that 


F- [fe dy = : je — y dr]. (13.122) 
(B) (C) 


13.3.3.2 Integral Theorem of Stokes 

The integral theorem of Stokes gives the relation between a surface integral over an oriented surface 
region S, in which the vector field V is defined, and the integral along the closed boundary curve C of 
the surface S. The sense of the curve C is chosen so that the sense of traverse forms a right-screw with 
the surface normal (see 13.3.2.1, p. 722). The vector function V should be continuous and it should 
have continuous first partial derivatives. The integral theorem of Stokes reads as follows: 


I rot V .d8 = fv - dé, (13.123a) 


i.e., the vector flow of the rotation through a surface S bounded by the closed curve C is equal to the 


contour integral of the vector field V along the curve C. 
In Cartesian coordinates 


ὃν. OV, ὃν, ὃν. ον, Ve) ,, 
LG we) duds (5 ac) dete (FE πι 


= jv. dx + V, dy + Vz dz) (13.123b) 
(C) 


holds. In the planar case, the integral theorem of Stokes, just as that of Gauss, becomes into the integral 
formula (13.121) of Gauss. 


13.3.3.3 Integral Theorems of Green 
The Green integral theorems give relations between volume and surface integrals. They are the applica- 


tions of the Gauss theorem for the function V — U, grad Us, where U; and Us are scalar field functions 
and v is the volume surrounded by the surface S. The following theorems hold: 


1. [[[ vov. + grad Us - grad U;) dv = φυ. grad Us - dS, (13.124) 
(v) (8) 
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9. [[[ Sv. — U3AU;) dv = pu: grad Uz — Us grad U;) - dS. (13.125) 
(v) (S) 


In particular for U} = 1, U = U 


3. JII AU dv = (bead U - a8 (13.126) 
(v) (8) 


holds. In Cartesian coordinates the third Green theorem has the following form (compare (13.120b)): 


OU OU PU ου ου au 

JII (G5 t δι t 5) du $ (5 dy dz 4 δι dz dx 4 E? dedy) : (13.127) 
(S) 

M A: Calculating the line integral 7 = ϕ (ay? da 4- dy +2 dz) with a circle C as the intersection curve 
(C) 


of the cylinder z? + y? = a? and the plane z = 0. With the Stokes theorem (13.123a) one gets: 


2π a 6 

T= jv. dr = "EZ d$ = -- NEL dx dy = —3 J / τὸ cos? ysin? y dr dp = -r with 

ζ S) (S*) p=0r=0 
rot V = —3a??k, dS = k dx dy and the circle 6”: £? + y? < a2. 
E B: Determine the flux J = f V - dS in the drift space V = zï + 5j + 23k through the surface S 

(5 
of the sphere 27 + y? + 2? = a?. The theorem of Gauss yields: 
2π 7 a 
——— > F , , 12 . 

I= fv. dš = EST = 3 f [[ Go) de dy dz = J / / r' sind dr dd ἀρ = San 

(S) (v) (v) φ-θ 9=0 r=0 ` 


W C: Heat conduction ri The rad in time of the heat Q of a space region v containing 


no heat source is given by Ž — a i Teo? «ση dv (specific heat-capacity c, density o, temperature T), 


w the corresponding time- ο change of the heat flow through the surface S of v is given 
yi » = fh grad Τ᾽: dS (thermal conductivity A). Applying the theorem of Gauss for the surface 
(8) 
| OT : , : 
integral (13.120a) one gets from / ij / cea div (A grad T)| dv = 0 the heat conduction equation 
(0) 


OT T ; 
at = div (A grad T), which has the form = a^AT in the case of a homogeneous solid (c, o, À 


constants). 


13.4 Evaluation of Fields 
13.4.1 PureSource Fields 


A field V, is called a pure source field or an irrotational source field when its rotation is equal to zero 
everywhere. If the divergence is q(r), then 


divV,—q(F, ^ rot V; 20 (13.128) 
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holds. In this case, the field has a potential U, which is defined at every point P by the Poisson differ- 
ential equation (see 13.5.2, p. 729) 


Vi=eradU, — divgradU = AU = q(ř), (13.129a) 


where T is the position vector of P. (In physics most often V, = —gradU is used.) The evaluation of 
U comes from 


--i - [j| ο ΤΊ (13.129b) 


The integration is taken over the whole of space (Fig. 13.19). The divergence of V must be differen- 
tiable and be decreasing sufficiently rapidly for large distances. 


Tr "gore Coulomb or D Eq 
gravitational field 


NT oulc I 
A πό ἄν A P OR a 


(symbolically) 
0 
Figure 13.19 Figure 13.20 


13.4.2 Pure Rotation Field or Zero-Divergence Field 
A pure rotation (or curl) field or a solenoidal field is a vector field V, whose divergence is equal to zero 
everywhere; this field is free of sources. With w(r) as the rotation density 

divV2=0, το Vp = w(r) (13.130a 
hold. The rotation density w(r) cannot be arbitrary; it must satisfy the equation div w = 0. With the 
approach 


Va(F)-rotA(f) ^ divA —0, ie, rotrotÀ — w (13.130b 
follows according to (13.97) 
graddivA — AA — W, ie, ΔΑ =-—w. (13.130c 


So, A(T) formally satisfies the Poisson differential equation (see (13.135a), p. 729) just as the potentia. 


U of an irrotational field V, and that is why it is called a vector potential. For every point P, then 


5 2 > 1 w(r* 
V,-rotÁ holdswith A= — / i] / W αι, (13.130d 
4π |r — r*| 
The meaning of F is the same as in (13.129b); the integration is taken over the whole of space. 


13.4.3 Vector Fields with Point-Like Sources 
13.4.3.1 Coulomb Field of a Point-Like Charge 


The Coulomb field is an example of an irrotational field, which is also solenoidal, except at the location 
of the point charge q, the point source (Fig. 13.20). For the Coulomb force 
Ëe = 1 qq ο — qı "Mm: ,.9 
4περ rT? " — 4m&g n rà i r3? 4πεῃ 
holds. This force affects attractively for electric charges q1, q2 with different signs and repulsively for 
charges with equal signs. ερ is the electric constant (see Table 21.2, p. 1053), e is the intensity or 


(13.131a) 
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source strength of the source. The electric field strength and the electrostatic potential, generated in 
the space around the charge qı and affecting to the charge q» are given as 


oo 


Eo = © = -r--gadU, U-- (13.131b 
pr 


U denotes the electrostatic potential of the field. The scalar flow in accordance with the theorem o 
Gauss (see (13.120a), p. 724) is equal to 47e or 0, depending on whether the surface S encloses the 
point source or not: 


Β. dg = 4πε, if S encloses the point source, (13.131c 
0, otherwise. 
(S) 
Because of the irrotationality of the electrostatic field 
rot Éc =0. (13.131d 


13.4.3.2 Gravitational Field of a Point Mass 


The field of gravity of a point mass or the Newton field is a second example of an irrotational and at the 
same time solenoidal field, except at the point of the center of mass. For the Newton mass attraction 
mimo» 


Fy -y—.: 6. (13.132) 
E 


holds, where y is the gravitational constant (see Table 21.2, p. 1053). Every relation valid for the 
Coulomb field is valid analogously also for the Newton field. 


13.4.4 Superposition of Fields 
13.4.4.1 Discrete Source Distribution 


Analogously to superposition of fields in physics, vector fields superpose each other. The superposition 
lawis: If the vector fields V, have the potentials U,, then the vector field 

V = XV, has the potential U = XU,. (13.133a) 
For n discrete point sources with source strength e, (v = 1,2,...,n), whose fields are superposed, the 
resulting field can be determined by the algebraic sum of the potentials U,: 
Εν 


V (E) = —grad Συ, with U, = 


v=1 


(13.133b) 


|r = r,| i 
Here, the vector r is again the position vector of the point under consideration, r, are the position 
vectors of the sources. 

If there is an irrotational field V4 and a zero-divergence field Va together and they are everywhere 
continuous, then 


V=Vi+ ΠΠ; WES αυ (τῇ) — rot he? E αυ(α e. (13.133c) 


If the vector field is extended to infinity, then the decomposition of V (1) is unique if |V (z)| decreases 
sufficient rapidly for r = |f| — oo. The integration is taken over the whole of space. 


13.4.4.2 Continuous Source Distribution 


If the sources are distributed continuously along lines, surfaces, or in domains of space, then, instead of 
the finite source strength e,, there are infinitesimals corresponding to the density of the source distri- 
butions, and instead of the sums, we have integrals over the domain. In the case of a continuous space 


distribution of source strength, the divergence is q(F) = div V. 
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Similar statements are valid for the potential of a field defined by rotation. In the case of a continuous 
space rotation distribution, the * rotation density” is defined by W(£) = rot V. 
13.4.4.3 Conclusion 


A vector field is determined uniquely by its sources and rotations in space if all these sources and rota- 
tions located inside a finite space. 


13.5 Differential Equations of Vector Field Theory 
13.5.1 Laplace Differential Equation 


The problem to determine the potential U of a vector field Vig grad U containing no sources, leads 
to the equation according to (13.128) with q(r) = 0 
div V, = div grad U = AU = 0, (13.134a) 
i.e., to the Laplace differential equation. In Cartesian coordinates holds: 
OU OU PU 
AU + zz —=0. 13.134b 
03 * Oy? 02 ( ) 
Every function satisfying this differential equation and which is continuous and possesses continuous 
first and second order partial derivatives is called a Laplace or harmonic function (see also 14.1.2.2,2., 
p. 732). 
There are to distinguish three basic types of boundary value problems: 


1. Boundary value problem (for an interior domain) or Dirichlet problem: A function U(x, y, 2) is 
determined, which is harmonic inside a given space or plane domain and takes the given values at the 
boundary of this domain. 


2. Boundary value problem (for an interior domain) or Neumann problem: A function U (x, y, z) is 


: es ΜΟΝ . . ο. ὃν . 

determined, which is harmonic inside a given domain and whose normal derivative E takes the given 
n 

values at the boundary of this domain. 


3. Boundary value problem (for an interior domain): A function U (x,y,z) is determined, which is 


harmonic inside a given domain and the expression aU -- 9—— (α,β const, | o? ἠ- 8? #0) takes the 


On 


given values at the boundary of this domain. 


13.5.2 Poisson Differential Equation 


The problem to determine the potential U of a vector field V, = grad U with given divergence, leads 
to the equation according to (13.128) with q(r) 4 0 
div V, = div grad U = AU = (τ τ) £0, (13.135a 


i.e., to the Poisson differential equation. Since in Cartesian coordinates: 


2 2 Ω2 
AU = ΗΝ (13.135b 
i y? ο 


the Laplace differential equation (13.134b) is a special case of the Poisson differential equation (13.135b). 
The solution is the Newton P (for point masses) or the Coulomb potential (for point charges) 


q(r*) dv(r* 
; 4π αμ r-r ᾿ (13.135c 


The integration is taken over the whole of space. U(r) tends to zero sufficiently rapidly for increasing 
|r| values. 
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One can discuss the same three boundary value problems for the Poisson differential equation as for the 
solution of the Laplace differential equation in 13.5.1. The first and the third boundary value problems 
can be solved uniquely; for the second one there are to prescribe further special conditions (see [9.5]). 


14 Function Theory 


14.1 Functions of Complex Variables 


14.1.1 Continuity, Differentiability 
14.1.1.1 Definition of a Complex Function 


Analogously to real functions, complex values can be assigned to complex values, i.e., to the value 
z = x + iy one can assign a complex number w = u + iv, where u = u(r, y) and v = v(x, y) are 
real functions of two real variables. This relation is denoted by w = f(z). The function w = f(z) isa 
mapping from the complex z plane to the complex w plane. 

The notions of limit, continuity, and derivative of a complex function w = f(z) can be defined analo- 
gously to real functions of real variables. 


14.1.1.2 Limit of a Complex Function 
The limit of a function f(z) is equal to the complex number wo if for z approaching zo the value of the 
function f(z) approaches wo: 
Wo = Jim f(z). (14. 1a) 


In other words: For any positive ε there is a (real) 6 > 0 such that for every z satisfying (14.1b), except 
maybe 2p itself, the inequality (14.16) holds: 
[zo — z| < ὃ, (14.1b) [wo — f(z)| < €. (14.1c) 
The geometrical meaning is as follows: Any point z in the circle with center zo and radius ὃ, except 
maybe the center zo itself, is mapped into a point w = f(z) inside a circle with center wo and radius 
£ in the w plane where f has its range, as shown in Fig. 14.1. The circles with radii ô and £ are also 


called the neighborhoods U,(wo) and Us(zo). 
yA vA 
0 I τ 0 1 u 
z plane X w plane u 
aj à b) P 


Figure 14.1 
14.1.1.3 Continuous Complex Functions 


A function w = f(z) is continuous at 2 if it has a limit there, and a substitution value, and these two 
are equal, i.e., if for an arbitrarily small given neighborhood U.(wo) of the point wọ = f(zo) in the w 
plane there exists a neighborhood U;(zo) of zo in the z plane such that w = f(z) € U-(wo) for every 
z € Us(zo). As represented in Fig. 14.1, U-(wo) is, e.g., a circle with radius £ around the point wo. 
'The continuity of f is expressed by the usual notation 


Jim f(z) το) ox — lim f(zo + 9) = f(zo). (14.2) 
14.1.1.4 Differentiability of a Complex Function 


A function w = f(z) is differentiable at z if the difference quotient 
Aw ία Δα) f(z) 


A A (14.3) 
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has a limit for A z — 0, independently of how A z approaches zero. This limit is denoted by f'(z) and 
it is called the derivative of f(z). 

Β The function f(z) = Rez = x is not differentiable at any point z = zo, since approaching zo parallel 
to the x-axis the limit of the difference quotient is one, and approaching parallel to the y-axis this value 
is zero. 


14.1.2 Analytic Functions 
14.1.2.1 Definition of Analytic Functions 


A function f(z) is called analytic, regular or holomorphic on a domain G, if it is differentiable at every 
point of G. The boundary points of G, where f'(z) does not exist, are singular points of f(z). 
The function f(z) = u(x, y) 4-iv(z, y) is differentiable in G if u and v have continuous partial derivatives 
in G with respect to x and y and they also satisfy the Cauchy-Riemann differential equations: 


ðu Ov Qu Ov 


= —, . 14.4 
Ox ὃν Oy Ox [8 
The real and imaginary parts of an analytic function satisfy the Laplace differential equation: 
u Pu Ov Av 
Au(x,y) = 55 + oy = 0, (14.5a) Av(xr,y) = ΕΙΣ + dy? = 0. (14.5b) 


The derivatives of the elementary functions of a complex variable can be calculated with the help of 
the same formulas as the derivative of the corresponding real functions. 


MA: f(z)=2, f(z) =32?; WB: f(z) =sinz, f'(z)- cosz. 
14.1.2.2 Examples of Analytic Functions 


1. Elementary Functions 

The elementary algebraic and transcendental functions are analytic in the whole z plane except at 
some isolated singular points. If a function is analytic on a domain, i.e., it is differentiable, then it is 
differentiable arbitrarily many times. 

ΒΑ: The function w = 2? with u = 2? — y?, v = 2zy is everywhere analytic. 

W B: The function w = u + iv, defined by the equations u = 2x + y, v = x + 2y, is not analytic at 
any point. 
Bl C: The function f(z) = z? with f'(z) = 32? is analytic. 

E D: The function f(z) = sin z with f'(z) = cos z is analytic. 


2. Determination of the Functions u and v 

If both the functions u and v satisfy the Laplace differential equation, then they are harmonic functions 
(see 13.5.1, p. 729). If one of these harmonic functions is known, e.g., u, then the second one, as 
the conjugate harmonic function v, can be determined up to an additive constant with the Cauchy— 
Riemann differential equations: 


ðu ð [Ou ) l (146) 


j= [Stay +(x) with P = (5 ! az! 8x dy 
Analogously u can be determined if v is known. 
14.1.2.3 Properties of Analytic Functions 
1. Absolute Value or Modulus of an Analytic Function 
The absolute value (modulus) of an analytic function is: 

[w| = |f(2)] = y uz. y)? + [v(x v)? = ee. v)- (14.7) 


The surface |w| = y(x, y) is called its relief, i.e., 


w| is the third coordinate above every point z = +i y. 
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E A: The absolute value of the function sin z = sin x cosh y+i cos 2; sinh yis | sin z| = ysin? x + sinh? y. 
The relief is shown in Fig. 14.2a. 

E B: The relief of the function w = e!/? is shown in Fig. 14.2b. 

For the reliefs of several analytic functions see [14.10]. 
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Figure 14.2 
2. Roots 
Since the absolute value of a function is positive or zero, the relief is always above the z plane, except 
the points where | f(z)| = 0 holds, so f(z) = 0. The z values, where f(z) = 0 holds, are called the roots 
of the function f(z). 
3. Boundedness 
A function f(z) is bounded on a certain domain G, if there exists a positive number N such that | f(z)| < 
N for all z in G. In the opposite case, if no such number N exists, then the function is called unbounded 
in G. 
4. Theorem about the Maximum Value 


If w = f(z) is an analytic function on a closed domain, then the maximum of its absolute value is 
attained on the boundary of the domain. 


5. Theorem about the Constant (Theorem of Liouville) 
If w = f(z) is analytic in the whole z plane and also bounded, then this function is a constant: f(z) = 


const. 


14.1.2.4 Singular Points 

If a function w = f(z) is analytic in a neighborhood of z = a, i.e., in a small circle with center a, except 
a itself, then f has a singularity at a. There exist three types of singularities: 

1. f(z) is bounded in the neighborhood. Then there exists w = lim f(z), and setting f(a) = w the 
function becomes analytic also at a. In this case, f has a removable singularity at a. 

2. If lim |f(z)| = oo, then f has a pole at a. About poles of different orders see 14.3.5.1, p. 753. 

3. If f has neither a removable singularity nor a pole at a, then f has an essential singularity at a. In 


this case, for any complex w there exists a sequence zn — a such that f(z,) > w. 


W A: The function w = 


has a pole at a. 
z—a 
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E B: The function w = e!/ has an essential singularity at 0 (Fig. 14.2b). 


14.1.3 Conformal Mapping 
14.1.3.1 Notion and Properties of Conformal Mappings 
1. Definition 


A mapping from the z plane to the w plane is called a conformal mapping if it is analytic and injective. 
In this case, 

w= f(z) =utiv, f'(z) #0. (14.8) 
The conformal mapping has the following properties: 
The transformation dw = f'(z) dz of the line element dz = t is the composition of a dilatation by 
o = |f'(z)| and of a rotation by a = arg f'(z). This means that infinitesimal circles are transformed 
into almost circles, triangles into (almost) similar triangles (Fig. 14.3). The curves keep their angles 
of intersection, so an orthogonal family of curves is transformed into an orthogonal family (Fig. 14.4). 


» 

x ol 
b) 

Figure 14.3 Figure 14.4 


Remark: Conformal mappings can be found in physics, electrotechnics, hydro- and aerodynamics and 
in other areas of mathematics. 
2. The Cauchy-Riemann Equations 


'The mapping between dz and dw is given by the affine differential transformation 


Qu Qu Ov Ov 
du = — dx + — dy, dv = — dx + —d 14.9a 
u as By Y, v= g eE - y (14.9a) 
and in matrix form 
dw=Adz with A= is "d (14.9b) 
Uz Uy 


According to the Cauchy-Riemann differential equations, A has the form of a rotation-stretching ma- 
trix (see 3.5.2.2, 2., p. 192) with σ as the stretching factor: 


Ae es pus o Gay pow : (14.102) 


Up Ug sina cosa 
Uy = v,-—ccosa, (14.10b) o —|f'(z) = γ +u? = v3 ευ, (14.100) 
—Uy = v, =osina, (14.10d) a = arg [ (2) = arg (u, + ivr). (14.10e) 


3. Orthogonal Systems 
The coordinate lines x = const and y = const of the z plane are transformed by a conformal mapping 
into two orthogonal families of curves. In general, a bunch of orthogonal curvilinear coordinate systems 
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can be generated by analytic functions; and conversely, for every conformal mapping there exist an 
orthogonal net of curves which is transformed into an orthogonal coordinate system. 
W A: In the case of u = 2x + y, v = x + 2y (Fig. 14.5), orthogonality does not hold. 
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Figure 14.5 
E B: In the case w = z? the orthogonality is retained, except at the point z = 0 because here w’ = 0. 
The coordinate lines are transformed into two confocal families of parabolas (Fig. 14.6), the first 
quadrant of the z plane into the upper half of the w plane. 
yA vA 
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Figure 14.6 
14.1.3.2 Simplest Conformal Mappings 


In this paragraph, some transformations are discussed with their most important properties, and there 
are given the graphs of isometric nets in the z plane, i.e., the nets which are transformed into an or- 
thogonal Cartesian net in the w plane. The boundaries of the z regions mapped into the upper half 
of the w plane are denoted by shading. Black regions are mapped by the conformal mapping onto the 
square in the w plane with vertices (0,0), (0, 1), (1,0), and (1, 1) (Fig. 14.7). 
1. Linear Function 
For the conformal mapping given in the form of a linear function 

w=az+b (a,bcomplex, const; a Z 0), (14.11a) 
the transformation can be done in three steps: 


a) Rotation of the z plane by the angle a = arga according to: w, = ez. 
b) Stretching of the w, plane by the factor |a|: w = |a|w;. (14.11b) 
c) Parallel translation of the w plane by b : w — w+ b. 


Altogether, every figure of the z plane is transformed into a similar one of the w plane. The points 


zı = oo and 21 = I for a # 1 are transformed into themselves, and they are called fixed points. 
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Fig. 14.8 shows the orthogonal net which is transformed into the orthogonal Cartesian net. 


y yA 
v 
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Figure 14.7 Figure 14.8 Figure 14.9 


2. Inversion 
The conformal mapping 


1 
(14.12 


w = = 
represents an inversion with respect to the unit circle and a reflection in the real axis, namely, a poini 
z = re? of the z plane with the absolute value r and with the argument q is transformed into a poin 


w= tev of the w plane with the absolute value 1/r and with the argument —¢ (see Fig. 14.10). 


Circles are transformed into circles, where lines are considered as limiting cases of circles (radius — oc). 
Points of the interior of the unit circle become exterior points and conversely (see Fig. 14.11). The 
point z = 0 is transformed into w = oo. The points z = —1 and z = 1 are fixed points of this conforma 
mapping. The orthogonal net of the transformation (14.12) is shown in Fig. 14.9. 

Remark: In general a geometric transformation is called inversion with respect to a circle with radius 
r, when a point P, with radius rz inside the circle is transformed into a point P, on the elongation o 


τοπ ΜΕ 
the same radius vector OP, outside of the circle with radius OP; = τι = r?/ra. Points of the interior 
of the circle become exterior points and conversely. 


VA 
1 

x 0 a 

b) 

Figure 14.10 Figure 14.11 
3. Linear Fractional Function 
For the conformal mapping given in the form of a linear fractional function 
az +b 

w=——— (a,b, c, d complex, const; bc — ad # 0; c 4 0) (14.13a) 
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the transformation can be performed in three steps: 
a) Linear function: w = cz + d. 


b) Inversion: w = a (14.13b) 
. | a  bc— ad 
c) Linear function: w =- + ———— ως. 
ο ο 
Circles are transformed again into circles (circular transformation), where straight lines are considered 
as limiting cases of circles with radius r — oo. Fixed points of this conformal mapping are the both 
points satisfying the quadratic equation 


az+b 


(14.14) 


If the points zı and z» are inverses of each other with respect to a circle Ky of the z plane, then their 
images w, and wy in the w plane are also inversions of each other with respect to the image circle K 
of Κι. 

The orthogonal net which has the orthogonal Cartesian net as its image is represented in Fig. 14.12. 
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4. Quadratic Function 
The conformal mapping described by a quadratic function 
w= 23 (14.154) 


has the form in polar coordinates and as a function of x and y: 
w -- pre’, (14.15b) w=u+iv = r? — y? +2 ry. (14.15c) 


It is obvious from the polar coordinate representation that the upper half of the z plane is mapped onto 
the whole w plane, i.e., the whole image of the z plane will cover twice the whole w plane. 
The representation in Cartesian coordinates shows that the coordinate lines of the w plane, u = const 
and v = const, come from the orthogonal families of hyperbolas z? — y? = u and 2xy = v of the z plane 
(Fig. 14.13). 
Fixed points of this mapping are z — 0 and z — 1. This mapping is not conformal at z — 0. 
5. Square Root 
The conformal mapping given in the form as a square root of z, 

w= yz, (14.16) 
transforms the whole z plane whether onto the upper half of the w plane or onto the lower half of it, 
i.e., the function is double-valued, i.e., to every value of z (z 7 0) belong two values of w (see 1.5.3.6, 
p. 38). The coordinate lines of the w plane come from two orthogonal families of confocal parabolas 
with the focus at the origin of the z plane and with the positive or with the negative real half-axis as 
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their axis (Fig. 14.14). 

Fixed points of the mapping are z = 0 and z = 1. The mapping is not conformal at z = 0. 
6. Sum of Linear and Fractional Linear Functions 

The conformal mapping given by the function 


k 1 
iub (- + =) (k areal constant, k > 0) (14.17a 
z 


can be transformed by the polar coordinate representation of z = p e!? and by separating the real and 
imaginary parts according to (14.8): 


k 1 k 1 
u-- (» + *) cosy, υξτς (» — *) sin y. (14.17b 
2 p 2 p 
Circles with p — po — const of the z plane (Fig. 14.15a) are transformed into confocal ellipses 
u? v k 1 k 1 
— +5 =l with a | πι. 14.17 
a? p iu: 2 i Poj 2 " Po ( n 


in the w plane (Fig. 14.15b). The foci are the points +k of the real axis. For the unit circle with 
p = po = 1 one gets the degenerated ellipse of the w plane, the twice overrunning segment (—k, +k 
of the real axis. Both the interior and the exterior of the unit circle are transformed onto the entire w 
plane with the cut (—k, +k), so its inverse function is double-valued: 


w+ Jw — k2 


eS — SD (14.17d 
The lines o = qo of the z plane (Fig. 14.15c) become confocal hyperbolas 

Ae ait 

dmg 1 with a= kcosqo, Jf — ksin qo (14.17e 

o? (p 


with foci tk (Fig. 14.15d). The hyperbolas corresponding to the coordinate half-axis of the z plane 


T 3 ; : : : ; 
(e =0, 5:7, 57) are degenerate in the axis u = 0 (v axis) and in the intervals (--οο, —k) and (k, oo 


of the real axis running there and back. 
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Figure 14.15 
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7. Logarithm 


The conformal mapping given in the form of the logarithm function 


w=Lnz (14.18a) 
has the form for z given in polar coordinates: 
u=Inp, v= y~+2kr (eS 0; SETS SED. sso) (14.18b) 


One can see from this representation that the coordinate lines u — const and v — const come from 
concentric circles around the origin of the z plane and from rays starting at the origin of the z plane 
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(Fig. 14.16). The isometric net is a polar net. 

The logarithm function Ln z is infinitely many valued (see (14.74c), p. 758). 

Restricting the investigation to the principal value In z of Lnz (—7 < v € +r), then the whole z plane 
is transformed into a stripe of the w plane bounded by the lines v = +7, where v = π belongs to the 
stripe. 


8. Exponential Function 
The conformal mapping given in the form of an exponential function (see also 14.5.2, 1., p. 758) 


w = e? (14.19a) 
has the form in polar coordinates: 

w = pel”. (14.19b) 
From z = x + iy follows: 

p=e" and wv-y. (14.19c) 


If y changes from —7 to +7, and x changes from —oo to Ἴ-οο, then p takes all values from 0 to oo and 
w from —7 to 7. A 2r wide stripe of the z plane, parallel to the x-axis, will be transformed into the 
entire w plane (Fig. 14.17). 
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9. The Schwarz-Christoffel Formula 
By the Schwarz-Christoffel formula 
n dt 
220, / me? (14.20a) 
| (C wi) 7 wy wn) 
the interior of a polygon of the z plane can be mapped onto the upper half of the w plane. The polygon 
has n exterior angles αιπ, o7, ..., Qnr (Fig. 14.18a,b). The points of the real axis in the w plane 
assigned to the vertices of the polygon are denoted by w; (i = 1,..., n), and the integration variable is 


denoted by t. The oriented boundary of the polygon is transformed into the oriented real axis of the w 
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plane by this mapping. For large values of |t|, the integrand behaves as 1/t? and is regular at infinity. 
Since the sum of all the exterior angles of an n-gon is equal to 27, consequently 


(14.20b) 


X b 


Figure 14.18 


The complex constants C, and C^ yield a rotation, a stretching and a translation; they do not depend 
on the form of the polygon, only on the size and the position of the polygon in the z plane. 


Three arbitrary points w1, we, ws of the w plane can be assigned to three points 21, 2ο, 23 of the polygon 
in the z plane. Assigning a point at infinity in the w plane, i.e., w; = coo to a vertex of the polygon 
in the z plane, e.g., to z = 2, then the factor (t — wi)?! is omitted. If the polygon is degenerated, 
e.g., a vertex is at infinity, then the corresponding exterior angle is 7, so ax = 1, i.e., the polygon is a 
half-stripe. 
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Figure 14.19 
W A: Mapping of a certain region of the z plane (shaded region in Fig. 
14.194): Considering ΣῚ a, = 2 one chooses three points as the table shows 
(Fig. 14.19a,b). The formula of the mapping is: 
w dt 2d 
2Ξ6ι | ( —; = 2C; (vw — arctan vw) =i— (vw — arctan w) s 
Jo π 


t+ lt? 


NET 
. i - . 0 (-1 + pe) 1ρε dp 
To determinate C4 one substitutes t = ρε — 1, giving id = Οι lim / - 
p30 Ja pe? 
d 


i.e., Οἱ =i-. 
π 


For the constant C follows C5 = 0, considering that the mapping assigns “z = 0 > w = 0". 


WB B: Mapping of a rectangle. Let the vertices of the rectangle be z214 = +K, 293 = +K +ik’. The 
points σι and zy should be transformed into the points w; = 1 and w = /k ( 0 « k « 1) of the real 
axis, 24 and 24 are reflections of zı and zg with respect to the imaginary axis. ae ding to the Schwarz 
reflection principle (see 14.1.3.3, p. 741) they must correspond to the points w4 = —1 and ws = —1/k 
(Fig. 14.20a,b). So, the mapping formula for a rectangle (αι = ay = a3 = o4 = 1/2) of the position 
w w 
sketched above is: z = C1 [ Ht =C, [ τ . The 
«Ὁ yt w)(t — wa)(t — w3) (t — wa) 0 
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Figure 14.20 


point z = 0 has the image w = 0 and the image of z = iK is w = oo. With Οι = 1/k follows 
. " dt e dv 
^" — Jo ya - &)a - kt) Jo V1 — k2 sin? Y 


Εφ, k) is the elliptic integral of the first kind (see 8.1.4.3, p. 490). 
For the constant C» one gets C5 = 0, considering that the mapping assigns “ z = 0 > w=0”. 


14.1.3.3 Schwarz Reflection Principle 


1. Statement 

Suppose f(z) is an analytic complex function in a domain G, and the boundary of G contains a line 

segment gı. If the function is continuous on gı and it maps the line gı into a line gj, then the points 

fe nr with respect to gı are transformed into points lying symmetric with respect to gj 
Fig. 14.21). 


F(y,k) (substituting t = sin’, w = sing). 
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Figure 14.21 Figure 14.22 Figure 14.23 


2. Application bm 


'The application of this principle makes it easier to perform calculations and the representations of 
plane regions with straight line boundaries: If the line boundary is a stream line (isolating boundary 
in Fig. 14.22), then the sources are reflected as sources, the sinks as sinks and curls as curls with 
the opposite sense of rotation. If the line boundary is a potential line (heavy conducting boundary in 
Fig. 14.23), then the sources are reflected as sinks, the sinks as sources and curls as curls with the 
same sense of rotation. 


14.1.3.4 Complex Potential 
1. Notion of the Complex Potential 
A field V = V (a, y) is considered in the z, y plane for the zero-divergence and the irrotational case 


with continuous and differentiable components V; (r, y) and V,(x, y) of the vector V. 


a) Zero-divergence field with div V = 0, i.e., ae I ^ 0: That is the integrability condition 
T y 
for the differential equation expressed with the field or stream function V (x, y) 
V Ow 
dv = —V,dr--V;dy —0, (14.314) andthen V, = : ; Vy = i s (14.21b) 


Oy’ 
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For two points P, , P» of the field V the difference W(P) — V(P,) is a measure of the flux through a 
curve connecting the points P, and P», in the case when the whole curve is in the field. 


"NM" OV, ; 
b) Irrotational field with rot V = 0, i.e., E y a = 0: That is the integrability condition for the 
x y 


differential equation with the potential function (x, y) 

Od . OD 
—, V; = —. 
δα ” δν 
If the field is free of sources and rotations, then the functions Φ and V satisfy the Cauchy-Riemann 


differential equations (see 14.1.2.1, p. 732) and both satisfy the Laplace differential equation (A® = 
0, Av — 0). Combining the functions and V into the analytic function 


W = f(z) = O(a, y) - iV(z, y), (14.23) 


then this function is called the complex potential of the field V. 


αφ = V, dx + V, dy = 0, (14.22a) and then V, = (14.22b) 


Then —é(z, y) is the potential of the vector field V in the sense of the usual notation in physics and 
electrotechnics (see 13.3.1.6, 2., p. 721). The level lines of V and @ form an orthogonal net. For the 


derivation of the complex potential and the field vector V the following equations hold: 


W 9Φ δν iy 7 
1 az 


Em or we f'(z) = Ve + iV. (14.24) 


2. Complex Potential of a Homogeneous Field 
The function 

W =az, (14.25) 
with real a is the complex potential of a field whose potential lines are parallel to the y-axis and whose 


direction lines are parallel to the x-axis (Fig. 14.24). A complex a results in a rotation of the field 
(Fig. 14.25). 
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Figure 14.24 Figure 14.25 


3. Complex Potential of Source and Sink 
The complex potential of a field with a strength of source s > 0 at the point z = zp has the equation: 


W = 2 In(z — a). (14.26) 


2m 
A sink with the same intensity has the equation: 


W — -Ê In(z — ~). (14.27) 
2π 


The direction lines run away radially from z = zo, while the potential lines are concentric circles around 
the point 20 (Fig. 14.26). 
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4. Complex Potential of a Source-Sink System 

One gets the complex potential for a source at the point z; and for a sink at the point 25, both having 

the same intensity, by superposition: 
S, z= 


wW (14.28) 


= —]n : 
2*7 z—29 


The potential lines Φ — const form Apollonius circles with respect to z; and 2ο; the direction lines 


V = const are circles through z and 2ο (Fig. 14.27). 
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Figure 14.26 Figure 14.27 
5. Complex Potential of a Dipole 


The complex potential of a dipole with dipole moment M > 0 at the point zo, whose axis encloses an 
angle a with the real axis (Fig. 14.28), has the equation: 


Mei* 


W-2———. 
2n(z — zo) 


(14.29) 
6. Complex Potential of a Curl 
If the intensity of the curl is |I| with real I’ and its center is at the point zo, then its equation is: 


T 
W = — ln(z — 29). (14.30) 


27i 
In comparison with Fig. 14.26, the roles of the direction lines and the potential are interchanged. For 
complex I’ (14.30) gives the potential of a source of curl, whose direction and potential lines form two 
families of spirals orthogonal to each other (Fig. 14.29). 
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14.1.3.5 Superposition Principle 


1. Superposition of Complex Potentials 

A system composed of several sources, sinks, and curls is an additive superposition of single fields, 1.6., 
one gets its function by adding their complex potential and stream functions. Mathematically this is 
possible because of the linearity of the Laplace differential equations A 9 = 0 and AW = 0. 

2. Composition of Vector Fields 

1. Integration Beside addition from complex potentials new fields can be constructed also by inte- 
gration with the application of weight functions. 

W A curl-covering is given on a line segment | with density function o(s). The derivative of the complex 
potential is given by an integral of Cauchy-type (see 14.2.3, p. 748) 


dd - B mS - = E: O ae, (14.31) 
2 7i ας G(s πο z—C¢ 

where ¢(s) is the complex parameter representation of the curve | with arc length parameter s. 
2. Maxwell Diagonal Method If one wants to compose the superposition of two fields with the 
potentials Φι and Φο, then one can draw the potential line figures [[Φι]] and [[»]] so that the value of 
the potential changes by the same amount h between two neighboring lines in both systems. Then the 
lines are directed so that the higher Φ values are on the left-hand side. The lines lying in the diagonal 
direction to the net elements formed by [[Φι]] and [[Φο]] give the potential lines of the field [[2]], whose 
potential is ᾧ -- Φι + by or 6 = Sı — Py. One gets the figure of {[; + Φ.]] if the oriented sides of the net 
elements are added as vectors (Fig. 14.30a), and one gets the figure of |[P, — Φρ]] by subtracting them 
(Fig. 14.30b). The value of the composed potential changes by h at transition from one potential line 
to the next one. 
W Vector lines and potential lines of a source and a sink with an intensity quotient |e;|/|e2| = 3/2 (see 
Fig. 14.31a,b). 


®,+h ®,+h 


o, 9, 
@,+h ®,+h 
a) o, b) [n 


Figure 14.30 


Figure 14.31 
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14.1.3.6 Arbitrary Mappings of the Complex Plane 


A function 

w = f(z=a+iy) = u(x, y) +iv(z,y) (14.32a) 
is defined if the two functions u = u(x, y) and v = v(x, y) with real variables are defined and known. 
The function f(z) must not be analytic, as it was required in conformal mappings. The function w 
maps the z plane into the w plane, i.e., it assigns to every point z, a corresponding point wy. 
a) Transformation of the Coordinate Lines 

y—-c —uc-u(zc, v-vw(zc) x is a parameter; 

z—c—u-u(a,y) υ--υ(ει,υ). y is a parameter. (14.32b) 
b) Transformation of Geometric Figures Geometric figures as curves or regions of the z plane are 
usually transformed into curves or regions of the w plane: 

x= x(t), y — w(t) Σαξ u(z(t),u(t), v =v(z(t) y(t)), tis a parameter. (14.32c) 
W For u = 2x + y, v = x + 2y, the lines y = c are transformed into u = 2x + c, v = x + 2c, hence into 

E u 3 : i : ; 

the lines v = a + 3° The lines x = c, are transformed into the lines v = 2u — 3c, (Fig. 14.5). The 
shaded region in Fig. 14.5a is transformed into the shaded region in Fig. 14.5b. 
c) Riemann Surface Getting the same value w for several different z of the mapping w = f(z), 
then the image of the function consists of several planes “lying on each other”. Cutting these planes 
and connecting them along a curve gives a many-sheeted surface, the so-called many-sheeted Riemann 
surface (see [14.13]). This correspondence can be considered also in an inverse relation, in the case of 
multiple-valued functions as, e.g., the functions V/z, Ln z, Arcsin z, Arctan z. 


Bl w = 2?: While z = re? overruns the entire z plane, i.e., 
0 X y < 2r, the values of w = pe” = r?e?”, cover twice 
the w plane. One can imagine to place two w planes one on 
each other, and cut and connect them along the negative 
real axis according to Fig. 14.32. This surface is called 
the Riemann surface of the function w = z?. The zero 
point is called a branch point. The image of the function e? 
Figure 14.32 (see 14.69) is a Riemann surface of infinitely many sheets. 


(In many cases the planes are cut along the positive real axis. It depends on whether the principal value 
of the complex number is defined for the interval (—7, +7] or for the interval [0, 2π).) 


14.2 Integration in the Complex Plane 


14.2.1 Definite and Indefinite Integral 
14.2.1.1 Definition of the Integral in the Complex Plane 


y 1. Definite Complex Integral 

Suppose f(z) is continuous in a domain G, and the curve C 
is rectifiable, it connects the points A and B, and the whole 
curve is in this domain. The curve C is decomposed between 
the points A and B by arbitrary division points z; into n sub- 
arcs (Fig.14.33). 

A point C; is chosen on every arc segment and forms the sum 


5 f(G)Az with Azi = ži- zi- (14.33a) 
Figure 14.33 = 
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If the limit 
jim, Σ f(G) A zi (14.33b) 


exists for Az; — 0 and n — oo independently of the choice of the points z; and G;, then this limit is 
called the definite complex integral 


r= ft) dz = (©) fro dz (14.33c) 


AB 
along the curve C between the points A and B. The value of the integral usually depends on the path 
of the integral. 
2. Indefinite Complex Integral 
If the definite integral is independent of the path of the integral (see 14.2.2, p. 747), then holds: 


F(z) = / f()dz-C with F(z) = f(z). (14.34) 
Here C is the integration constant which is complex, in general. The function F(z) is called an indefinite 
complex integral. 


The indefinite integrals of the elementary functions of a complex variable can be calculated with the 
same formulas as the integrals of the corresponding elementary function of a real variable. 


ΒΛ: [sinzdz =- cosz + C. HB: [ede=e +0. 


3. Relation between Definite and Indefinite Complex Integrals 
If the function f(z) is analytic (see 14.1.2.1, p. 732) and has an indefinite integral, then the relation 
between its definite and indefinite integral is 


/ flz)dz = EO dz = F(zg) — F(z4). (14.35) 


14.2.1.2 Properties and Evaluation of Complex Integrals 


1. Comparison with the curvilinear integral of the second type 

The definite complex integral has the same properties as the curvilinear integral of the second type (see 
8.3.2, p. 517): 

a) Reversing the direction of the path of integration the integral changes its sign. 

b) Decomposing the path of integration into several parts the value of the total integral is the sum of 
the integrals on the parts. 

2. Estimation of the Value of the Integral 

If the absolute value of the function f(z) does not exceed a positive number M at the points z of the 


path of integration AB which has the length s, then: 


[id «Με if |f(2) x M. (14.36) 
AB 


3. Evaluation of the Complex Integral in Parametric Representation 


If the path of integration AB (or the curve C) is given in the form 
2 — c(t), y = y(t) (14.37) 
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where x and y are differentiable functions of t and the t values for the initial and endpoint are t4 and 
tg, then the definite complex integral can be calculated with two real integrals. Separating the real and 
the imaginary parts of the integrand the integral is 


(ο) f f) gs fua — v dy) +i f(vde+ u dy) 
= / [u(£)z () — v(£)y (£)] dt +i i [v(£)a"(£) + u(t)y (6)] dt (14.382) 
with f(z)-— u(z,y)-civ(rm,y) z-—c-ciy. (14.38b) 


B 

The notation (C) Jj f(z) dz means that the definite integral is calculated along the curve C between 
A 

the points A and B. The notation | f(z) dz is also often used, and has the same meaning. 


AB 
_ /[- J (z — το) dz (n € Z). Let the curve C be a circle around the point zo with radius ro: 
(C) 


X = To + rocost, y = yo + rosint with 0 < t < 27. For every point z of the curve C: z = x +i y = 
zo + ro(cost +isint), dz = ro(— sint + icost)dt. After substituting these values and transforming 


2π 
according to the de Moivre formula follows: J = rj. / (cos nt + isin nt)(— sint + icost) dt 
0 


2π EX 
=at [i cos(n + 1)t — sin(n + 1)t] dt 5 or : f Ei 


4. Independence ofthe Path of Integration 

Suppose a function f(z) of a complex variable is defined in a simply connected domain. The integral 
(14.33c) of the function can be independent of the path of integration, which connects the fixed points 
A(za) and B(zg). A sufficient and necessary condition is that the function f(z) is analytic in this 
domain, i.e., u and v satisfy the Cauchy-Riemann differential equations (14.4, p. 732). Then also the 
equality (14.35) is valid. If a domain is bounded by a simple closed curve, then the domain is simply 
connected. 


5. Complex Integral along a Closed Curve 

Suppose f(z) is analytic in a simply connected domain G. Integrating the function f(z) along a closed 
curve C which is the boundary of this domain, the value of the integral according to the Cauchy integral 
theorem is equal to zero (see 14.2.2, p. 747): 


(ο) { f) z — 0. (14.39) 


If f(z) has singular points in this domain, then the integral is calculated by using the residue theorem 
(see 14.3.5.5, p. 754), or by the formula (14.384). 


R 1 : : : : 
W The function f(z) = ———, with a singular point at z = a has an integral value for the closed curve 
z—a 
dz 


C directed counterclockwise around a (Fig.14.34) Of = 2ni Resf (z)|,-, = 27i. 
z 


--α ο 
14.2.2 Cauchy Integral Theorem 
14.2.2.1 Cauchy Integral Theorem for Simply Connected Domains 


If a function f(z) is analytic in a simply connected domain G, then two equivalent statements hold: 
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a) The integral is equal to zero along any closed curve C inside of G: 
(C) ϕ f(z)dz — 0. (14.40) 


B 
b) The value of the integral / f(z) dz is independent of the curve connecting the points A and B and 
JA 


running inside of G, i.e., it depends only on A and B. 
This is the Cauchy integral theorem. 


14.2.2.2 Cauchy Integral Theorem for Multiply Connected Domains 
If C, C, Co, .. ., C, are simple closed curves such that the curve C encloses all the C, (v = 1,2,...,n), 
none of the curves C, encloses or intersects another one, and furthermore the function f(z) is analytic 


in a domain G which contains the curves and the region between C and Ον, i.e., at least the region 
shaded in Fig.14.35, then 


jioe- ἡ fae b fav 
(C) ( (C2) 


σι) 


ϕ f(z) dz, (14.41) 
(Cn) 

if the curves C, Ci, ..., Cn are oriented in the same direction, e.g., counterclockwise. 

This theorem is useful for the calculation of an integral along a closed curve C, if it also encloses singular 
points of the function f(z) (see 14.3.5.5, p. 754). 


ya 


c 
T E ND 
0 x 
Figure 14.34 Figure 14.35 Figure 14.36 
r z—1 . ' - 5 
W Calculate the integral f (3-1) dz, where C is a curve enclosing the origin and the point z = —1 

J z(z 
(C) 


(Fig. 14.36). Applying the Cauchy integral theorem, the integral along C is equal to the sum of the 
integrals along C4 and C5, where ΟἹ is a circle around the origin with radius rj = 1/2 and C» is a circle 


around the point z = —1 with radius το = 1/2. The integrand can be decomposed into partial fractions. 
z—1 2d 2dz d. d. 
Then follows: f dz = ϕ ce | f f E f m 0 + 47i — 22i — 0 = 2πι. 
J z(z+1) z+1 z+1 P 2 
C (C1) C2) (C1) (C2) 


(Compare the integrals with the example in 14.2.1.2, 3., p. 747.) 


14.2.3 Cauchy Integral Formulas 
14.2.3.1 Analytic Function on the Interior of a Domain 


If f(z) is analytic on a simple closed curve C and on the simply connected domain inside it, then the 
following representation is valid for every interior point z of this domain (Fig. 14.37): 


MEN E 


~ Oni —z 
ne t 


d¢ (Cauchy integral formula) (14.42) 
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where ¢ traces the curve C counterclockwise. With this formula, the values of an analytic function in 
the interior of a domain are expressed by the values of the function on the boundary of this domain. 
The existence and the integral representation of the n-th derivative of the function analytic on the 
domain G follows from (14.42): 


fO) = £ j eras (14.43) 
©) 


Consequently, if a complex function is differentiable, i.e., it is analytic, then it is differentiable arbi- 
trarily many times. In contrast to this, in the real case differentiability does not include repeated 
differentiability. 

The equations (14.42) and (14.43) are called the Cauchy integral formulas. 

14.2.3.2 Analytic Function on the Exterior of a Domain 

If a function f(z) is analytic on the entire part of the plane outside of a closed curve of integration 
C, then the values and the derivatives of the function f(z) at a point z of this domain can be given 
with the same Cauchy formulas (14.42), (14.43), but the orientation of the curve C is now clockwise 
(Fig. 14.38). 

Also certain real integrals can be calculated with the help of the Cauchy integral formulas (see 14.4, 
p. 754). 

y 


Οὔ f: x 
Figure 14.37 Figure 14.38 


14.3 Power Series Expansion of Analytic Functions 


14.3.1 Convergence of Series with Complex Terms 


14.3.1.1 Convergence of a Number Sequence with Complex Terms 
An infinite sequence of complex numbers 21, z2,...,2n,--- has a limit z (z = Jim. Zn) if for every 
arbitrarily given positive ε there exists an no such that the inequality |z — z,| < & holds for every 
n > no, i.e., from a certain no the points representing the numbers Zn, 2,41, ... are inside of a circle 
with radius ε and center at z. 
W If the expression { {/a} means the root with the smallest non-negative argument, then the limit 
jim, (Va) = 1 is valid for arbitrarily complex a 4 0 (Fig. 14.39). 
14.3.1.2 Convergence of an Infinite Series with Complex Terms 
A series αι + a2+:+++a,+-++ with complex terms a; converges to the number s, the sum of the series, 
if the sequence of the partial sums s,, with 

Sn =A, +a2 +: +an (n=1,2,...) (14.44) 


converges. Connecting the points corresponding to the numbers s, = αι + a2 +-+ + an in the z plane 
by a broken line, then convergence means that the end of the broken line approaches the point s. 


2 p j 2 2 
MA: i+—+2+4+—4+:::. WB:i----—--- (Fig. 14.40). 
ud hu τ. Et (Fig ) 
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Figure 14.39 Figure 14.40 


A series is called absolutely convergent (sce W B), if the series of absolute values of the terms |ai| + 
[a2] + |aa| +--+ is also convergent. The series is called conditionally convergent (sce W A) if the series 
is convergent but not absolutely convergent. If the terms of a series are functions f;(z), like 

Fale) + falz) Εντ fh) (14.45) 


then its sum is a function defined for the values z for which the series of the function values is convergent. 


14.3.1.3 Power Series with Complex Terms 


1. Convergence 
A power series with complex coefficients has the form 
P(z — zo) = ao + ay(z — zo) + aa(z — zo +++ + an(z — Ζω)" + °°, (14.46a) 


where zy is a fixed point in the complex plane and the coefficients αν are complex constants (which can 
also have real values). For zo = 0 the power series has the form 


P(z) = ag + az + αρα) - + an2” ee. (14.46b) 


If the power series P(z — zo) is convergent for a value 24, then it is absolutely and uniformly convergent 
for every z in the interior of the circle with radius r = |z, — zo| and center at zo. 


2. Circle of Convergence 


The limit between the domain of convergence and the domain of divergence of a complex power series 
is a uniquely defined circle. One determines its radius just as in the real case, if the limits 


1 g 
r= —— or r= lim 


lim n, janl n—oo 
noo 


απ 


(14.47) 


An+1 


exist. If the series is divergent everywhere except at z = 29, then r = 0; if it is convergent everywhere, 
then r = oo. The behavior of the power series on the boundary circle of the domain of convergence 
should be investigated point by point. 


oo z^ 

W The power series P(z) = >> — with radius of convergence r = 1 is divergent for z = 1 (harmonic 
n=1 TL 

series) and convergent for z — —1 (according to the Leibniz criteria for alternating series (see 7.2.3.3,1., 


p. 463)). This power series is convergent for all further points of the unit circle |z| = 1 except the point 
gp 

3. Derivative of Power Series in the Circle of Convergence 

Every power series represents an analytic function f(z) inside of the circle of convergence. One gets the 
derivative by a term-by-term differentiation. The derivative series has the same radius of convergence 
as the original one. 
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4. Integral of Power Series in the Circle of Convergence 
The power series expansion of the integral / | f (C) ας can be obtained by a term-by-term integration 
PE 


of the power series of f(z). The radius of convergence remains the same. 


14.3.2 Taylor Series 


Every function f(z) analytic in a domain G can be expanded uniquely into a power series of the form 
oo 
f(z) = ΣῚ an(z — zo)" (Taylor series), (14.48a) 
n=0 
for any zo in G, where the circle of convergence is the greatest circle around 20 which belongs entirely 
to the domain G (Fig. 14.41). The coefficients an are complex numbers in general; for them one gets: 


(n) 
νο μυς (14.48b) 
n! 
The Taylor series can be written in the form 
ily "ία . (m) (». 
f(z) = f(z) 4 Po) (z — 29) 4 f Cog 20) + f E ο); 20)" ee. (14.456) 


Every power series is the Taylor expansion of its sum function in the interior of its circle of convergence. 


W Examples of Taylor expansions are the series representations of the functions e^, sin z, cos z, sinh z, 
and cosh z in 14.5.2, p. 758. 


Figure 14.41 Figure 14.42 Figure 14.43 


14.3.3 Principle of Analytic Continuation 


Let consider the case when the circles of convergence Ko around 20 and Kı around 2ι of the two power 
series 
oo oo 
e — n " -— κ. "S n a 
fol2)= ΣΣ al- and f(z) = D> hle- ει) (14.49a) 
n=0 n=0 


have a certain common domain (Fig. 14.42) and in this domain they are equal: 


folz) = fil2). (14.49b) 
Then both power series are the Taylor expansions of the same analytic function f(z), belonging to the 
points zo and σι. The function fi(z) is called the analytic continuation into K, of the function fo(z) 
defined only in Ko. 
oo 
Β The geometric series fo(z) = Y; 2” with the circle of convergence Ko (ro = 1) around z = 0 and 
n=0 


1 oo z— i n . . . ; 
filz) = pes Σ E Ὁ) with the circle of convergence K, (rı = V2) around z; = i have the same 
m n=0 T 
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analytic function f(z) = 1/(1 — z) as their sum in their own circle of convergence, consequently also 
on the common part of them (doubly shaded region in Fig. 14.42) for z Z 1. So, fi(z) is the analytic 
continuation of fo(z) from Ko into Ky (and conversely). 


14.3.4 Laurent Expansion 


Every function f(z), which is analytic in the interior of a circular ring between two concentric circles 
with center 20 and radii rj and rs, can be expanded into a generalized power series, into the so-called 
Laurent series: 


x a-k a-k+1 
αὶ -- : £a)" teed } fi 
f(z) p» as(z o) t (=a)  (z— zw) 1 
a , . 
tZ + 4 a+ a(z- zo) + a(z — zo) - + ak(z — zo) ἠ-:--.. (14.50a) 
= 20 


The coefficients αμ are usually complex and they are uniquely defined by the formula 


ru fc (n = 0, +1,+2,...), (14.50b) 


ἽΝ ri E -- n+l 
πο 20) 


where C denotes an arbitrary closed curve which is in the circular ring rı < |z| < r2, and the circle 
with radius Τι is inside of it, and its orientation is counterclockwise (Fig. 14.43). If the domain G of 
the function f(z) is larger than the circular ring, then the domain of convergence of the Laurent series 
is the largest circular ring with center zp lying entirely in G. 


W The Laurent series expansion of the function f(z) = σπα is determined around 2 = 0 in 
the circular ring 1 < |z| < 2 where f(z) is analytic. First the function f(z) is decomposed into partial 
fractions: f(z) = F 1 2 — Since |1/z| < 1 and |z/2| < 1 holds in the considered domain, the two 
terms of this decomposition can be written as the sums of geometric series being absolutely convergent 
in the entire circular ring 1 < |z| < 2. One gets: 


1 1 1 = d κ Καντ 
ο-τ-ης-ῃ 1 rums ο 
α[]-- 211 n=1 n=0 
z 2 — —— 
[2| >1 [2| « 2 
> ie for n= —1,-2,..., 
= anz” with ap, = u 
S στι for m. —0,1,2,.... 


14.3.5 Isolated Singular Points and the Residue Theorem 
14.3.5.1 Isolated Singular Points 


If a function f(z) is analytic in the neighborhood of a point zo but not at the point zo itself, then zo is 
called an isolated singular point of the function f(z). If f(z) can be expanded into a Laurent series in 
the neighborhood of zo 
oo 
18) - YS a(z— πο)", (14.51) 
n-—-—oo 

then the isolated singular point can be classified by the behavior of the Laurent series: 
1. If the Laurent series does not contain any term with a negative power of (z — zo), i.e., an = 0 for 
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n < 0 holds, then the Laurent series is a Taylor series with coefficients given by the Cauchy integral 
formula 
1 =n— f? (zo 
a fc ~ 9) ¢(C) de = PE (eoi, ). (14.52) 
2πὶ . n! 
(K) 

In this case, the function f(z) itself is either analytic at the point zo and f (zo) = αρ or zo is a removable 
singularity. 

2. Ifthe Laurent series contains a finite number of terms with negative powers of (z — 0). 1.6., dm # 0, 
an = O forn < m < 0, then zo is called a pole, a pole of order rn, or a pole of multiplicity m. Multiplying 
by (z — 29)’, and not by any lower power, then f(z) is transformed into a function which is analytic at 
zo and in its neighborhood. 


1 1 
Β /(:)-— 5 c + 5) has a pole of order one at z = 0. 


3. If the Laurent series contains an infinite number of terms with negative powers of (z — zo), then zo 
is an essential singularity of the function f(z). 
Approaching a pole, | f(z)| tends to oo. Approaching an essential singularity, f(z) gets arbitrarily close 


to any complex number c. 
oo 


Bl The function f(z) = e!/*, whose Laurent series is f(z) = 5 
=Ó; n=0 
14.3.5.2 Meromorphic Functions 


If an otherwise holomorphic function has only isolated singularities which are poles, then it is called 
meromorphic. A meromorphic function can always be represented as the quotient of two analytic func- 
tions. 


σι has an essential singularity at 
nl z 


W Examples of functions meromorphic on the whole plane are the rational functions which have a finite 
, sinz COS Z 

number of poles, and also transcendental functions such as tan z — and cotz — 
COS z 


sin 2 


14.3.5.3 Elliptic Functions 
Elliptic functions are double periodic functions whose singularities are poles, i.e., they are meromorphic 
functions with two independent periods (see 14.6, p. 762). If the two periods are w, and w», which are 
in a non-real relation, then 


f(z + mw + nw) = f(z) (m,n = 0,41, +2,...;Im (=) # 0). (14.53) 


The range of f(z) is already attained in a primitive period parallelogram with the points 0, w1, w1 + 
W2, W2. 


14.3.5.4 Residue 


With zp as an isolated singularity of the function f(z) the coefficient a..; of the power (z — z9) 1η the 
Laurent expansion of f(z) valid in the neighborhood of zg is called the residue of the function f(z) at 
the point 20. According to (14.50b) 


1 
a1 ο. ϕ ΚΟ ἀξ (14.54) 
(K) 
holds. The residue belonging to a pole of order m can be calculated by the formula 
1 d" 
a_, = Res f(z)],—4, = lim [f(z)(z — z9)""]. (14.54b) 


z>% (m — 1)! dz m-1 
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If the function f(z) can be represented as a quotient f(z) = φ(α)/ψ(α), where the functions y(z) and 
p(z) are analytic at the point z = zo and zo is a simple root of the function v(z), i.e., (zo) = 0 and 
('(zo) # 0 holds, then the point z = zo is a pole of order one of the function f(z). It follows from 
(14.54b) that 


ze] v (20) 

Res = T 14.54c 
be EC ( ) 
If z is a root of multiplicity m of the function ψ(α), i.e., (zo) = w'(zo) = ::: = v-9(z) = 


0, Y™ (z0) 4 0 holds, then the point z = zo is a pole of order m of f(z). 
14.3.5.5 Residue Theorem 


With the help of residues one can calculate the integral of a function along a closed curve enclosing 
isolated singular points (Fig. 14.44). 

If the function f(z) is single valued and analytic in a simply connected domain G except at a finite 
number of points zo, 21, 22, ... , Zn, and the domain is bounded by the closed curve C, then the value of 
the integral of the function along this closed curve in a counterclockwise direction is the product of 27i 
and the sum of the residues in all these singular points: 


j Fe) de = S Res f(e) icu (14.55) 


ὦ) k=0 


Β The function f(z) = e*/(z? + 1) has poles of order one at z;» = +i. The corresponding residues 
have the sum sin 1. If K is a circle around the origin with radius r > 1, then 


e ο ο ον fè e! TA 
y aya l > (s x) emn 


y y 


Figure 14.44 Figure 14.45 


14.4 Evaluation of Real Integrals by Complex Integrals 
14.4.1 Application of Cauchy Integral Formulas 


The value of certain real integrals can be calculated with the help of the Cauchy integral formula. 


W The function f(z) = e^, which is analytic in the whole z plane, can be represented with the Cauchy 
integral formula (14.42), where the path of integration C' is a circle with center z and radius r. The 
equation of the circle is À = z + re^. From (14.43) follows 


, πι e πι qe qe) O fF 2 create rinsing 
e = : dc = - — ire? dip μα e cos gir sin p—iny dy, so that 
27i τ (6 — z)" 2miJo-o r'tleie(nt1) 2mr^ Jo 
a 


Irr” 


n! 


2π "M 27 2m 
= / er cose H(rsingne) dy = | e" ***[cos(r sin y — ng)] dp + if e" *?[sin(r sin y — ng)] dy. 
0 0 0 
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2m 2rr” 
Comparing real and imaginary part gives e”? cos(r sin y — ny) dp = ——— , since the imaginary 
A Y pi ) 
0 n! 


part is equal to zero. 


14.4.2 Application ofthe Residue Theorem 


Several definite integrals of real functions with one variable can be calculated with the help of the 
residue theorem. If f(z) is a function which is analytic in the whole upper half of the complex plane 
including the real axis except the singular points 21, 22, ... , Zn above the real axis (Fig. 14.45), and if 
one of the roots of the equation f(1/z) = 0 has multiplicity m > 2 (see 1.6.3.1, 1., p. 43), then 


Too 4 
/ f(x) dz = mi Y Res f (2)].—... (14.56) 
—oo i=l 
Calculation of the int 1/7 d: μα ti ΠΕ ! P. gi 
'alculation of the integra ——.: The equatio = z= = 0 has 
u. 101 16 1ntegr P" ü + ση L quation T ( I D (22 d E 1: 
DT 
x 
a root of order six at x = 0. The function w = UF AF has a single singular point z = i in the upper 


half-plane, which is a pole of order 3, since the equation (1 + 2°)? = 0 has two triple roots at i and —i. 
The residue is according to (14.54b): 


1 1 @ | (z-i)8 d 2-11} αἲ "E ΜΉ. 

ατομα” lé σσ] s Benq (Fea) = {πῶ 1’ eni 
5 +00 

follows that Res ü vsu = 6(z i) ?L. ap i and with (14.56): f. f(x)dx = 


3 3 
2πὶ (-5i) =" For further applications of residue theory see, e.g., [14.12]. 


14.4.3 Application of the Jordan Lemma 
14.4.3.1 Jordan Lemma 
In many cases, real improper integrals with an infinite interval of integration can be calculated by 


complex integrals along a closed curve. To avoid the always recur- 


= de C rent estimations, the Jordan lemma is used about improper inte- 
ae XR grals of the form 
f X / f(zé dz, (14.57a) 
i \ (Cx) 
-R g οἱ τ Rx where C is the half-circle arc with center at the origin and with the 
radius R in the upper half of the z plane (Fig. 14.46). The Jordan 
Figure 14.46 lemma distinguishes the following cases: 


a) o > 0: If f(z) tends to zero uniformly in the upper half-plane and also on the real axis for |z| — oo 
and if a > 0 holds, then for R — oo is valid: 

/ f(z)e** dz 30. (14.57b) 

(Cn) 

b) o = 0: If the expression z f(z) tends to zero uniformly for |z| — oo, then the above statement 
(14.57b) is also valid in the case a = 0. 
c) a < 0: Ifthe half-circle Cs is now below the real axis, then the statement corresponding to (14.57b) 
is also valid for a < 0. 
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d) The statement to (14.57b) is also valid if only an arc segment is considered instead of the complete 
half-circle. 


e) When Ο is a half-circle or an arc segment in the left half-plane with a > 0, or in the right one with 
a « 0, then the statement corresponding to (14.57b) is valid for the integral in the form 


/ f(z)e* dz. (14.57c) 
(Cp) 


14.4.3.2 Examples of the Jordan Lemma 


1. Evaluation of the Integral / "I dr (a>0,a>0). (14.58a) 


The ee complex integral is assigned to the above real E d 


R R 
T [5 sin aw | A =i f zsinor dira / COS OME ο... IE: e (14.58b) 
x ere p +a s 23 + a? gg 


even en fonction aa RE 
= 0 (odd integrand) 


ACT 
The very last of these integrals is part of the complex integral j στα dz. The curve C contains the 
z+a 


Cr half-circle defined above and the part of the real axis between the values bx and R (R > Jaj). 
The complex integrand has the only singular point in the upper half-plane z = ai. From the residue 


zelo? zez zelaz 
theorem follows: J = $3 zdz = 2nilim | —,(z — ai)! = 2ri lim - = mie ^", hence 
r^ 22 +a? zai | 2” + 4 zi z + αἱ 
eine rer 
1 / - dz 4 / - ; dr = mie ^". From lim J and from the Jordan lemma follows 
s 22 +a? 5 x +a? Ro 
OR - 
OG. 95) 
z sinar T 
[ode —-e"" (a>0, a>0). (14.58c) 
2 2 2 QUSS 
Q9 +a 


Several further integrals can be evaluated in a similar way (see Table 21.8, p. 1098). 


2. Sine Integral (see also 8.2.5,1., (8.95), p. 513) 


y i 
eee ee ; Ῥρῖῃ a ; "UP ; T 
2n "se Ca The integral / — dx is called the sine integral or the integral sine 
y a E 
΄ ^ (see also 8.2.5, 1., p. 513). Analogously to the previous example, 
1 F | e 
R > t 0 ; > Ro here the complex integral J = f — dz is investigated with the curve 
ή ey 
Cr according to Fig. 14.47. The integrand of the complex integral 
Figure 14.47 has a pole of first order at z — 0, so 
ε΄ sin £ T e 
I = 2m lim ilz ‘| = = Οπὶ, hence I = 2i f= dx + ene dip + [te = 2mi. This 
T 7 = 


limit is evaluated as R — oo, r — 0, where the second integral tends to 1 uniformly for r — 0 with 
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respect to y, i.e., the limiting process r — 0 can be done behind the integral sign. With the Jordan 
lemma follows: 
sina sina π 

2ἱ / απ + πὶ = 275i, hence J dx = 5 


(14.59) 


0 0 


3. Step Function 
Discontinuous real functions can be represented as complex integrals. The so-called step function (see 
also 15.2.1.3, p. 774) is an example: 


1 E 1 for t>0, 
re) = / ode =} 1/2 for t=0, (14.60) 
2mi 2. e 0  fort«0. 


The symbol ——— denotes a path of integration along the real axis (| R | o6) going round the origin 
(Fig. 14.47). 
If t denotes time, then the function (t) = cF (t — to) represents a quantity which jumps at time t = to 
from 0 through the value c/2 to the value c. It is called a step function or also a Heaviside function. It 
is used in the electrotechnics to describe suddenly occurring voltage or current jumps. 
4. Rectangular Pulse 
A further example of the application of complex integrals and the Jordan lemma is the representation 
of the rectangular pulse (see also 15.2.1.3, p. 774): 
1 gi-0: 1 eila—t)z 0 for t € a and t » b, 
v= | ΛΙ dz=}1 for a<t<b, (14.61) 
2πὶ E 2πὶ 8 1/2 for t=a andt=b. 


——— ——— 


5. Fresnel Integrals 


y 'To derive the Fresnel integral 
III Ῥ : Ῥ ; 1 
I [ n6 dx = [ 56d TE 1/2 (14.62) 
Ὁ Ὁ 


the integral J = J e~* dz has to be investigated on the closed path 
K 
of integration shown in Fig. 14.48. According to the Cauchy inte- 
Figure 14.48 gral " " 
theorem holds: J = Jy + (yy + Aqyy = 0 with 7p = / e” dz, Ig = iR f e R'(cos2ptisin 2) +iv dy, 
0 0 
in 9. 1 RO. RO. 
lyp-e4 f e" dr= 5V2(1 +i) i [ sinr? dr — i cos r? arl. Estimation of ΠΠ: Since |i] = |e] = 
R Jo 0 


π/4 P R re - R m? ΜΗ 
1 (7 real) holds, one gets: [1| < RÍ g FP cos 2e qiu =f es? qo + =f ης: 
0 0 4 Ja 


i e E cosa 


R f% moa R 7/2 sing πι aR og cos π 
cfg SG τ/ ne * "ST dec το ot : (o< «5).Ρ form- 
2 / ‘ oe 2Ja sin α΄ Ld E ' 2Rsina απο πο. 


1 
ing the limiting process jm I yields the values of the integrals Jy and ΠΠ: jm. = gv" ; Jim. Πιτ 


0. The given formulas (14.62) can be obtained by separating the real and imaginary parts. 
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14.5 Algebraic and Elementary Transcendental Functions 


14.5.1 Algebraic Functions 
1. Definition 


A function which is the result of finitely many algebraic operations performed with z and maybe also 
with finitely many constants, is called an algebraic function. In general, a complex algebraic function 
w(z) can be defined in an implicit way as a polynomial, just as its real analogue 

az w + agg 2492 +e tape” w™ = 0. (14.63) 
It happens that w cannot be expressed explicitly. 


2. Examples of Algebraic Functions 


: : 1 
Linear function: w = az +b. (14.64) Inverse function: w= -. (14.65) 
z 
Quadratic function: w= 2”. (14.66) Square root function: w= V2? — a?. (14.67) 
2-1 


(14.68) 


Fractional linear function: w = 


z—i 


14.5.2 Elementary Transcendental Functions 


'The complex transcendental functions have definitions corresponding to the transcendental real func- 
tions, just as in the case of the algebraic functions. For a detailed discussion of them see, e.g., [21.1] or 
(21.12]. 

1. Natural Exponential Function 


ec -- 1 l + | Fen (14.69 
The series is absolutely convergent in the whole z plane. 
a) Pure imaginary exponent iy: This is valid according to the Euler relation (see 1.5.2.4, p. 36): 

e” = cosy +isiny with e" = —1. (14.70 
b) General case z = x + iy: 


e = eT HY = etel! = e" (cos y + isiny), (14.71a 
Re (e7) = e” cosy, Im(e*)=e*siny, |e*| =e", arg(e*) =y. (14.71b 
The function οὔ is periodic, its period is 2ri: e? = ο τι (k= 0,41.+2,...). (14.71c 
In particular: e° = e?*7i = 1, ePk+lmi = 1 (14.71d 


Exponential form of a complex number (see 1.5.2.4, p. 36): 
a+ib= pe. (14.72 
d) Euler relation for complex numbers: 


ς 


— 


e? = cos z + isin z, (14.73a) e™ = cos z — isin 2. (14.73b 
2. Natural Logarithm 
w=Lnz, if z=e”. (14.74a 
Since z = pel, one can write: 
Lnz=Inp+i(p+2k7) and (14.74b 


Re(Lnz)=Inp, Im(Lnz) = y+ 2k (b= 0,41, 2,...). (14.74c 
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Since Ln z is a multiple-valued function (see 2.8.2, p. 86), we usually give only the principal value of the 
logarithm In z: 


Inz=np+iy (=r < p < +r). (14.74d) 
The function Ln z is defined for every complex number, except for z = 0. 


3. General Exponential Function 


aoe. (14.75a) 
a? (a zz 0) is a multiple-valued function (see 2.8.2, p. 86) with principal value 
αἲ = gna, (14.75b) 


4. Trigonometric Functions and Hyperbolic Functions 


. iè e? 29 αὖ 
sin 2 = X —4— t 5i (14.762) 
ez 3 e E E 
COS Z 5 1 9i + rie (14.76b) 
. ος - ο-3 αὖ 25 
sinh z 5 2 3i 5i μμ, (14.77a) 
am 5—£ oe 4 
ete est = ο. (14.77b) 


All four series are convergent on the entire z plane and they are all periodic. The period of the functions 
(14.76a,b) is 27, the period of the functions (14.77a,b) is 27 i. 
The relations between these functions for any real or complex z are: 
siniz = isinh z, (14.78a) cosiz = cosh z, (14.78b) 
sinhiz = isin z, (14.79a) coshiz = cos z. (14.79b 
The transformation formulas of the real trigonometric and hyperbolic functions (see 2.7.2, p. 81, and 
2.9.3, p. 91) are also valid for the complex functions. The values of the functions sin z, cos z, sinh z, and 
cosh z for the argument z = x +i y can be calculated with the help of the formulas sin(a +b), cos(a +b), 
sinh(a + b), and cosh(a + b) or by using the Euler relation (see 1.5.2.4, p. 36). 


E cos(x + iy) = cosacosiy — sina siniy = cos x cosh y — isin sinh y. (14.80 
Therefore: 
Re (cos z) = cos Re (z) cosh Im (2). (14.81a 
Im (cos z) = — sin Re (z) sinh Im (2). (14.81b 
The functions tan z, cot z, tanh z, and coth z are defined by the following formulas: 
tanz = 2n? y ob τας ; (14.82a) tanhz = ΞΕ coth z = cosh .  (14.82b 
z sin z cosh z sinh z 


5. Inverse Trigonometric Functions and Inverse Hyperbolic Functions 


These functions are many-valued functions, and one can express them with the help of the logarithm 
function: 


Arcsin z = —iLn (iz + Vl— 2), (14.83a) Arsinh z = Ln (z + Vz? + 1), (14.83b) 

Arccos z = —i Ln (z + vz? --1), (14.84a) Arcosh z = Ln (z + V2? — 1), (14.84b) 
1 1112 dn 

Arctanz = j;Ln 1 = τ (14.854) Artanh z = jLn 1 i : ; (14.85b) 
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1 iz+1 
Arccot z = —— Ln tee 
2i 


by the same formulas using the principal value of the logarithm In z: 
arcsin z = —i In(iz + v1 — z?), 14.872) 
arccos z = —i ln(z + Vz? — 1), 14.884) 


ig= 1’ 


14.86a) Arcoth z = sin : + : ; 4.86b 
The principal values of the inverse trigonometric and the inverse hyperbolic functions can be expressed 
arsinh z = ln(z + V2? + 1), 4.87b 

arcosh z = ln(z + V2? — 1), 4.88b 

14.89a) artanh z = In ; = - ; 4.89b 

14.90a) arcoth z — Lin Z + : ; 4.90b 


6. Real and Imaginary Part of the Trigonometric and Hyperbolic Functions 


Table 14.1 Real and imaginary parts of the trigonometric and hyperbolic functions 


1, l-ciz 
arctanz = — ln — 
2i 1-12 
1 iz 
arccot z = —— mi = E 
2i 1-1 
(See Table 14.1) 
Function 
w = f(x + iy) 


Real part Re (w) 


Imaginary part Im (w) 


sin(x + iy) 
cos(x + iy) 


tan(x + iy) 
sinh(x + iy) 
cosh(a + i 


tanh(xz + iy) 


sin x cosh y 
cos x cosh y 
sin 2x 
cos 2x + cosh 2y 
sinh x cos y 
cosh x cos y 
sinh 2x 
cosh 2a + cos 2y 


T cosh 2x + cos 2y 


+ cos x sinh y 

+ sin x sinh y 
sinh 2y 

cos 2x + cosh 2y 
+ cosh x sin y 
+sinh z sin y 
sin 2y 


7. Absolute Values and Arguments of the Trigonometric and Hyperbolic Func- 


tions (See Table 14.2) 


Table 14.2 Absolute values and arguments of the trigonometric and hyperbolic functions 


cos(r + iy) 
sinh(x + iy) 


cosh(x + iy) 


\/cos? x + sinh? y 


sinh? z + sin? y 


H A D 
sinh” x + cos? y 


Function 
Absolute value |w Argument arg w 
"OR |w] g g 
sin(x + iy) ysin? z + sinh? y c arctan(cot x tanh y 


T arctan 


( 
(t 

ΞΕ arctan(coth x tany 
(t 


+ arctan(tanh z tan y 


anz tanhy 


SoS. Ss Ss 


14.5.3 Description of Curves in Complex Form 


A complex function of one real variable t can be represented in parameter form: 


z = z(t) + iy(t) = f(t). 


(14.91) 


As t changes, the points z draw a curve z(t). Now, the equations and the corresponding graphical 
representations of the line, circle, hyperbola, ellipse, and logarithmic spiral are represented. 
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1. Straight Line 
a) Line through a point (σι, ϱ) (φ is the angle with the x-axis, Fig. 14.49a): 


z= z + te. (14.92a 
b) Line through two points σι, 2» (Fig. 14.49b): 
z = z1 + t(z2 — 21). (14.92b 
2. Circle 
a) Circle with radius r, center at the point zọ = 0 (Fig. 14.504): 
z=re (|z| 5r). (14.93a 
b) Circle with radius r, center at the point zy (Fig. 14.50b): 
z= z + ret (\z — zo| =r). (14.93b 
y ya y 
21 22 
a 
ol x 0 X 0 x 
a) b) b) 
Figure 14.49 Figure 14.50 
3. Ellipse 
2 2 
a) Ellipse, Normal Form -. + i = ] (Fig. 14.514): 
a 
"- P it Lit . a+b a—b 
z = acost + ibsin t (14.94a) or z = ce* c de" (14.94b) with c = , d= m (14.94c) 
, € and d are arbitrary real numbers. [Ρ 
b) Ellipse, General Form (Fig. 14.511): The center is at z;, the axes are rotated by an angle. 
z—z 4 ce* 4 de“, (14.95) 
Here c and d are arbitrary complex numbers, they determine the length of the axis of the ellipse and 
the angle of rotation. 
Y y! 
» 
x 
» 
0 x 
a) b) 
Figure 14.51 Figure 14.52 


2 γῇ 
4. Hyperbola, Normal Form a = a = 1 (Fig. 14.52): 
a 
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z = acosht + ibsinht (14.964) 
or 
z — ce 4 ce *, (14.97) y 
where c and € are conjugate complex numbers: 
c=, ε-5- (14.98) z 
5. Logarithmic Spiral (Fig. 14.53): 
z=ae™, (14.99) 
where a and b are arbitrary complex numbers. Figure 14.53 


14.6 Elliptic Functions 
14.6.1 Relation to Elliptic Integrals 


Integrals in the form (8.22) with integrands R(x, γ P(x)) can not be integrated in closed form if P(x) is 
a polynomial of degree three or four, except in some special cases, but they are calculated numerically as 
elliptic integrals (see 8.1.4.3, p. 490). The inverse functions of elliptic integrals are the elliptic functions. 
They are similar to the trigonometric functions and they can be considered as their generalization. As 
an illustration the special case 
/α- θά τα (|u| < 1). (14.100) 
0 
is considered. 
a) There is a relation between the trigonometric function u = sin x and the principal value of its inverse 
function 
π π 
ποστς 
2^ 2 
b) the integral (14.100) is equal to arcsin u. The sine function can be considered as the inverse function 
of the integral (14.100). Analogies are valid for the elliptic integrals. 


u=sing $ x = arcsinu for palaut (14.101) 


W The period of a mathematical pendulum, with mass m, hanging on a non-elastic weightless thread 
of length | (Fig. 14.54), can be calculated by a second-order non-linear differential equation. This 
equation follows from the balance of the forces acting on the mass of the pendulum: 
Πο] 
dt? 


g l d | (do? d 
+2 sind =0 with 0(0) = vo, (0) — 0 or — 22 “(cos 0). (14.102a) 
1 dt | \ dt 
The relation between the length | and the amplitude s from the normal position is s = lù, so ὁ = | Å and 
S = lV hold. The force acting on the mass is F = mg, where g is the acceleration due to gravity (see 
Table 21.2, p. 1053), and it is decomposed into a normal component Fy and a tangential component 
Fr with respect to its path (Fig. 14.54). The normal component Fy = mg cos% is balanced by the 
thread stress. Since it is perpendicular to the direction of motion, it has no effect to the equation of 
motion. The tangential component Fr yields the acceleration of the motion. Fr = ms = ml) = 
—mgsinv. The tangential component always points in the direction of the normal position. 
By separation of variables follows: 


— {Ef do 
° g J0 \/2(cos Θ — cos vo) 


(14.102b) 
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Here, to denotes the time for which the pendulum is in the deepest position for the first time, i.e., 
where U(to) = 0 holds. © denotes the integration variable. After some transformations and with the 


7 
substitutions sin — = ksinw, k = sin — ? follows 


t He --γε (k, ϱ) (14.102c 
9 J1— k2 sin? 


Here F(k, p) is an elliptic integral of the first kind (see (8.25a), p. 490). The angle of deflection 0 = V(t 
is a periodic function of period 2T with 


T= "E (v. 5) = "s (14.102d 
g 2 g 


where K represents a complete elliptic integral of the first kind (Table 21.9, p. 1103). T denotes the 


. . : . a .. dù 
period of the pendulum, i.e., the time between two consecutive extreme positions for which — = 0. I 


the amplitude is small, i.e., sin’? πε Ὁ, then T = 27\/1/g holds. 


-1 


cnx snx 


Figure 14.54 Figure 14.55 Figure 14.56 


14.6.2 Jacobian Functions 
1. Definition 


It follows for 0 < k < 1 from the representation (8.24a) and (8.25a) (see 8.1.4.3, p. 490) for the elliptic Γ- 


integral of the first kind F (k, p) that 
dF 


= (1—k? sin? y)~? > 0, (14.103) 
dp 
i.e., F(k, p) is strictly monotone with respect to p, so the inverse function 
ig αφ 
p =am(k,u) = y(u) (14.104a) of u= I — —— =u(y) (14.104b) 
y1 — k2 sin? y 
exists. It is called the amplitude function. The so-called Jacobian functions are defined as: 
snu = sin y = sinam(k,u) (amplitude sine), (14.105a) 
cnu = cosy = cosam(k, u) (amplitude cosine), (14.105b) 
dnu = v1 — k?sn?u (amplitude delta). (14.105c) 


2. Meromorphic and Double Periodic Functions 
The Jacobian functions can be continued analytically in the z plane. The functions snz, cnz, and dnz 
are then meromorphic functions (see 14.3.5.2, p. 753), i.e., they have only poles as singularities. Besides, 
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they are double periodic: Each of these functions f(z) has exactly two periods w; and w» with 


δία Έωι) = f(z), f(z +e) = f(z). (14.106) 
Here, ὧι and w» are two arbitrary complex numbers, whose ratio is not real. The general formula 
f(z + mw, + nw) = f(z) (14.107) 


follows from (14.106), and here m and n are arbitrary integers. Meromorphic double periodic functions 
are called elliptic functions. The set 

{zo + awi + agwo: 0 X 04,03 < 1}, (14.108) 
with an arbitrary fixed 20 € C, is called the period parallelogram of the elliptic function. If this function 
is bounded in the whole period parallelogram (Fig. 14.55), then it is a constant. 
W The Jacobian functions (14.105a) and (14.105b) are elliptic functions. The amplitude function 
(14.104a) is not an elliptic function. 


3. Properties of the Jacobian functions 
The properties of the Jacobian functions given in Table 14.3 can be got by the substitutions 


i-i, K = F(R), K-F(RZ). ;.͵; 


where m and n are arbitrary integers. 


Table 14.3 Periods, roots and poles of Jacobian functions 
Periods w , w2 Roots Poles 
snz 4K, 21’ 2mK + 2π1 κ’ 
cnz | AK,2(K +ik’) (2m + 1) + 2nik’ 2mK + (2n + 11K" 
dnz 2K,4iK' (2m +1)K + (2n + 1)ik’ 


The shape of snz, cnz, and dnz can be found in Fig. 14.56. The following relations are valid for the 
Jacobian functions except at the poles: 


1. sn?z+cn?z=1, K?sn?z 4 di?z — 1, (14.110 
ρα... Gnu)(env)(dnv) + (snv)(enu)(dnu) . 
2. sn(u+v) 1— E (enu) (env) ; (14.111a 
"m (cnu)(cnv) — (snu)(dnu)(snv)(dnv) 
cn(u + v) = T — k?(en2u)(en%v) (14.111b 
_ (dnu)(dnv) — k?(snu)(cnu)(snv)(cnv) 
dniu w) 1 — K?(sn2u)(snv) | ος 
3. (snz) = (cnz)(dnz), (14.112a) (cnz)' = —(snz)(dnz), (14.112b 
(dnz)! = —k?(snz)(cnz). (14.112c 


For further properties of the Jacobian functions and further elliptic functions see [14.8], [14.12]. 


14.6.3 Theta Functions 


The theta functions can be used to evaluate the Jacobian functions: 


θι(σ,ϱ) = 293 Y (—1)" g sin(2n + 1)z, (14.1132) 


n=0 
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ϑο(α, α) = PIE Sg?) cos(2n + 1)2, (14.113b) 
n=0 
03(z,q) 511525 q” cos 2nz, (14.113c) 
n-i 
Πο -- 1-- 2 Y (-1)"^q" cos 2nz. (14.113d) 
n=1 


If |g| < 1 (q complex) holds, then the series (14.113a)-(14.113d) are convergent for every complex 
argument z. In the case of a constant q one uses the following brief notations: 


O,(z) = Ux(mz,q) (k= 1,2,3,4). (14.114) 


Then, the Jacobian functions have the representations: 


WIR (14.1152) xis 00) (zx) 


ς 9. = 2 
va (0) ? bac) Κ' 95(0) 
— <A 14.115c 7 = c= = 5 
(0) " A | ; ( 5c) with q-exp|—- x E 3.0) (14.115d) 
2K 


and ΜΚ, K’ are as in (14.109). 


snz = 2K 


(14.115b) 


dnz — 


= 
A 


14.6.4 Weierstrass Functions 


The functions 


p(z) = p(z, 01,2), (14.116a) C(z) = ¢(z, wi, we), (14.116b 

σ(5) = a(z,01,w2), (14.116c 
were introduced by Weierstrass, and here w; and w» represent two arbitrary complex numbers whose 
quotient is not real. One substitutes 

Wmn = 2(mwi + nua), (14.117a 
where m and n are arbitrary real numbers, and defines 

Q(z,01,02) = z ? 4 y IG Us) ^ usa] à (14.117b 

mn 

The accent behind the sum sign denotes that the value pairm = n = 0 is omitted. The function 


p(z, W1, w2) has the following properties: 

1. It is an elliptic function with periods ωι and wy». 

2. The series (14.117b) is convergent for every z Z Wmn- 

3. The function p(z, wi, w2) satisfies the differential equation 


Q9? = 463 — gp- g (14.118) with g — 60S ωρα. gs = 140 wn (14.118b) 


mn' 
m,n m,n 


The quantities go and gs are called the invariants of p(z, wi, we). 
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4. The function u = ¢(z,w1, w2) is the inverse function of the integral 


dt 
40) = OT a- oe) 


The Weierstrass functions 


Qz)—21-M [ο Umn) F Us d ος 


5. plu + v) i| 


4 


2 


(14.119 


(14.120 


(14.121a 


a(z) = zexp (f KO τη at) -II : (1 "n ) exp (= 


202 


2 
) (14.121b 


mn 


are not double periodic, so they are not elliptic functions. The following relations are valid: 


1. C(z) 2 —g(z), ¢(z) = (Ina(z)), 
6 Q(z). σί--2) = --σ(2). 


(5 + 2w) = C(z) + (wi), Cle + 2wa) = Cz) 202), 
ΗΟ; 


(14.122) 
(14.123) 
(14.124) 


(14.125) 


5. Every elliptic function is a rational function of the Weierstrass functions p(z) and ¢(z). 


15 Integral Transformations 


15.1 Notion of Integral Transformation 


15.1.1 General Definition of Integral Transformations 


AnIntegral transformation is a correspondence between two functions f(t) and F(p) in the form 
+00 
Fp) = | Kp.) sat. (15.1a) 
—o6 


The function f(t) is called the original function, its domain is the original space. The function F(p) is 
called the transform, its domain is the image space. 

The function K(p,t) is called the kernel of the transformation. In general, { is a real variable, and 
p =o + iw isa complex variable. 

The shorter notation can be used by introducing the symbol 7 for the integral transformation with 
kernel K (p, t): 


F(p) = Τ{/(0}. (15.1b 


Then, (15.1b) is called 7 transformation. 


15.1.2 Special Integral Transformations 

Different kernels K (p, t) and different original spaces yield different integral transformations. The mos 
widely known transformations are the Laplace transformation, the Laplace-Carson transformation, 
and the Fourier transformation. In this book an overview is given about the integral transformations 
of functions of one variable (see also Table 15.1). More recently, some additional transformations 
have been introduced for use in pattern recognition and in characterizing signals, such as the Wavele 
transformation, the Gabor transformation and the Walsh transformation (see 15.6, p. 800ff.). 


15.1.3 Inverse Transformations 


The inverse transformation of a transform into the original function has special importance in applica- 
tions. With the symbol 7 -! the inverse integral transformation of (15.1a) is 


f(t) =T (EF). (15.2a 
The operator T~! is called the inverse operator of T, so 
T (TUQY = f(0. (15.2b 


The determination of the inverse transformation means the solution of the integral equation (15.14), 
where the function F(p) is given and function f(t) is to be determined. If there is a solution, then i 
can be written in the form 

f(t) = T^ 0p. (15.2c 
The explicit determination of inverse operators for different integral transformations, i.e., for differen 
kernels K (p, t), belongs to the fundamental problems of the theory of integral transformations. The 
user can solve practical problems by using the given correspondences between transforms and origina 
functions in the corresponding tables (Table 21.13, p. 1109, Table 21.14, p. 1114, and Table 21.15, 
p. 1128). 


15.1.4 Linearity of Integral Transformations 
If f; (t) and fa(t) are transformable functions, then 

T Us fi (t)  kafo(t)) = FT CR T + keT Ufo (03. (15.3) 
where kı and kə are arbitrary numbers. That is, an integral transformation represents a linear operation 
on the set T' of the 7 -transformable functions. 
© Springer-Verlag Berlin Heidelberg 2015 767 
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Table 15.1 Overview of integral transformations of functions of one variable 
Transformation | Kernel K (p, t) Symbol Remark 
σο 
Laplace m for t «0 u / E = i 
E tjj = 2 t)dt = C 
transformation e? for t» 0 uidi 0 diis PME 
Lud f(t) 1(t)} = LAFE) 
Two-sided E Here 
Laplace e Pt Lyi f(t j= = j e f (t)dt where 
transformation -oo 1(t) = 0 for t<0 
1 for t>0 
Finite 0 for t<0 
Laplace e for 0<t<a |£,(f(t)) fe 7? ftat 
transformation 0 fort>a ὁ 
E 'The Carson transforma- 
Laplace-Carson i for t <0 C | —pt tion can also be a two- 
Ν e t= 2 t)dt . s 
transformation pe” for t» 0 (0) J 98 fe sided and finite transfor- 
9 mation. 
+00 
Fourier i F / —iwt 
1 z tjj = 2 t)dt =0 + = 
ransformation ᾽ {FO} "S ft) HT Me d 
—oo0 
One-sided 0 for 
: or t<0 -iw 
Fourier , (eua for t»0 Fi ft} = fe edt |p=ot+iw σ 
ransformation 0 
Finite 0 for t <0 a 
Fourier eo for 0<t<a|Fiff(t)} = [poate p=ot+iw σ- 
ransformation 0 for t>a A 
Fouri i 0 for t <0 T 
ourier cosine Or u D = _ 
ransformation Re [εν for t > 0 FAf()- f fos wtdt |p =0 +iw σ 
0 
Fourier si 0 for t<0 T 
ourier sine or = n MN " _ 
transformation Im [e^] for t > 0 Ff} = / σι ee dimid 
0 
0 for t<0 f 
Mellin or t< J -1 
i= 117 t)dt 
transformation t" for t>0 MIT) " fe) 
p=otiw w=0 
Hankel ο. 
. 0 fort«0 J,(ot) is the v-th or- 
ransformatio V r= tJ, ( t) f t)dt jd j ; 
νη, τὸν (στ) for t > 0 HALO} = / G Ja der Bessel function of the 
of order v A 
first kind. 
ΠΝ 0 for t<0 oo 
Stieltjes | 1 M δ{0] = f(t) dt 
transformati for t>0 ἔπ“ Í 
ransformation "EE. JP +t 
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15.1.5 Integral Transformations for Functions of Several Variables 


Integral transformations for functions of several variables are also called multiple integral transforma- 
tions (see [15.13]). The best-known ones are the double Laplace transformation, i.e., the Laplace trans- 
formation for functions of two variables, the double Laplace-Carson transformation and the double 
Fourier transformation. The definition of the double Laplace transformation is 


F(p.a) = £(f(,y) = f. [ efi y) de dy. (15.4) 
r—0y-0 


The symbol £ denotes the Laplace transformation for functions of one variable (see Table 15.1). 


15.1.6 Applications of Integral Transformations 
1. Fields of Applications 


Besides the great theoretical importance that integral transformations have in such basic fields of math- 
ematics as the theory of integral equations and the theory of linear operators, they have a large field of 
applications in the solutions of practical problems in physics and engineering. Methods with applica- 
tions of integral transformations are often called operator methods. 'They are suitable to solve ordinary 
and partial differential equations, integral equations and difference equations. 

2. Scheme of the Operator Method 

'The general scheme to the use of an operator method with an integral transformation is represented 
in Fig. 15.1. One gets the solution of a problem not directly from the original defining equation; first 
an integral transformation is applied. The inverse transformation of the solution of the transformed 
equation gives the solution of the original problem. 


Equation of Solution of 
Problem > ——— : ——» Result 
the problem the equation 
Transformation Inverse transformation 
Solution by using | Transformed Solution of 
the transformation equation > the transformed 
equation 


Figure 15.1 


The application of the operator method to solve ordinary differential equations consists of the following 
three steps: 
1. Transition from a differential equation of an unknown function to an equation of its transform. 

2. Solution of the transformed equation in the image space. The transformed equation is usually no 
longer a differential equation, but an algebraic equation. 
3. Inverse transformation of the transform with help of 7 ^! into the original space, i.e., determination 
of the solution of the original problem. 


The major difficulty of the operator method is usually not the solution of the transformed equation, 
but the transformation of the function and the inverse transformation. 
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15.2 Laplace Transformation 


15.2.1 Properties of the Laplace Transformation 
15.2.1.1 Laplace Transformation, Original and Image Space 


1. Definition of the Laplace Transformation 
The Laplace transformation 
oo 
cut = | emf at = Fo) (15.5) 
0 
assigns a function F(p) of a complex variable p to a function f(t) of a real variable t, if the given 
improper integral exists. f(t) is called the original function, F(p) is called the transform of f(t). In 
the further discussion it is assumed that the improper integral exists if the original function f(t) is 
piecewise smooth in its domain t > 0, in the so called original space, and for t — oo, |f(t)| € Κο” 
holds with certain constants K > 0, œ > 0. The domain of the transform F(p) is called the image 
space. 
In the literature the Laplace transformation is often found also in the Wagner or Laplace-Carson form 


£w4f(0) =p [ e" f(04 = p FQ). (15.6 


2. Convergence 
The Laplace integral C f (t) } converges in the half-plane Rep > a (Fig. 15.2). The transform F(p 
is an analytic function with the properties: 


1. lin F(p)=0. (15.7a 
Re poo 
This property is a necessary condition for F(p) to be a transform. 
2. lim pF(p)—4A, (15.7b 
Po 
(poo) 


if the original function f(t) has a finite limit lim f(t) = A. 


(t-+0) 


IM .. | 
= f(t)=sin t f(t)=sin(at) 
1 1 
α z 2 2π 
0 Re E Em a > 
ET 0 t 0 t 
a) b) 
Figure 15.2 Figure 15.3 
3. Inverse Laplace Transformation 
One can retrieve the original function from the transform with the formula 
c-Fioo 
zj m. „pt _{ff(t) for t>0, κ 
C (FO) = i | dii jo for t « 0. (15:8) 


c—ioo 
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The path of integration of this complex integral is a line Re p = c parallel to the imaginary axis, where 
Rep = c > a. Ifthe function f(t) has a jump at t = 0, i.e., lim, F(t) Æ 0, then the integral has the 
> 


1 
value gi tte) there. 


15.2.1.2 Rules for the Evaluation of the Laplace Transformation 
The rules for evaluation are the mappings of operations in the original domain into operations in the 
transform space. 
Hereafter the original functions will be denoted by lowercase letters, the transforms are denoted by the 
corresponding capital letters. 
1. Addition or Linearity Law 
The Laplace transform of a linear combination of functions is the same linear combination of the 
Laplace transforms, if they exist. With constants À1,...,À, that is 

LAA failt) + M fo (t) - + An falt) y} = AA) + A2Fo(p) + +++ + λε Ει(ψ)- (15.9) 


2. Similarity Laws 
The Laplace transform of f (at) (a > 0,a real) is the Laplace transform of the original function divided 
by a and with the argument p/a: 


1 
L{f(at)} = =F (2) (a > 0, real). (15.10a) 
Analogously for the inverse transformation 
1 t 
£^ F(ap)} =—f (-) (a> 0). (15.10b) 
a a 


Fig. 15.3 shows the application of the similarity laws for a sine function. 
W Determination of the Laplace transform of f(t) = sin(wt). The transform of the sine function is 
F ji 
L{sin(t)} = F(p) = 1/(p? + 1). Application of the similarity law gives C(sin(wt)) = —F(p/w) = 
ω 
1 1 00 
w (pw +1 γω. 


3. Translation Laws 


1. Shifting to the Right The Laplace transform of an original function shifted to the right by a 


(a > 0) is equal to the Laplace transform of the non-shifted original function multiplied by the factor 
e αρ: 


L{f(t—a)} =e? F(p). (15.11a) 


2. Shifting to the Left The Laplace transform of an original function shifted to the left by a is 
equal to e°? multiplied by the difference of the transform of the non-shifted function and the integral 


Jo f(t) e dt: 
L{f(t+a)} =e 


F(p) - fe* t) a (15.11b) 
0 
Figs. 15.4 and 15.5 show the cosine function shifted to the right and a line shifted to the left. 

4. Frequency Shift Theorem 

The Laplace transform of an original function multiplied by e~ 
the argument p +b (bis an arbitrary complex number): 


L{e™ f(t)) = F(p +b). (15.12) 


èt is equal to the Laplace transform with 
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110 " f(t) f(t) f(t) 
0 als hot 0 t7 Fo t 


Figure 15.4 Figure 15.5 


5. Differentiation in the Original Space 
If the derivatives f'(t), f"(t), .. ., f"? (t) exist for t > 0 and the highest derivative of f(t) has a trans- 
form, then the lower derivatives of f(t) and also f (t) have a transform, and: 

Ef (0) = p F(p) — f(+0), 

EC" (0) = pP F(p) - f(+0)p — f'(+0), 


LEF()} =v" Ε() - $40) pm? - P0)? - -Ὃ VAS 
{λαο p — f”) (+0) with 
fOO) = lim. (0). 
Equation (15.13) gives the following representation of the Laplace integral, which can be used for ap- 
proximating the Laplace integral: 
{140 40, 490 , £^ RUD 


= τ. l ερ) 
£f) ONG mol mdr ty (t)). (15. 


c 
S 


6. Differentiation in the Image Space 
LAE f(0) — (71) F™ (p). (15. 


The n-th derivative of the transform is equal to the Laplace transform of the (—t)"-th multiple of the 
original function f(t): 


£(-1ymf(0)- FM)  (n-12..). ( 
7. Integration in the Original Space 


The transform of an integral of the original function is equal to 1/p" (n > 0) multiplied by the trans- 
form of the original function: 


i τι Τπι--1 t 
1 
: - ο... = —F( 5.17a 
εἰ dn | ὅτ... / ΠΝ CERT cf fu T) ie] wee (15.17a) 


In the special case of the ordinary simple integral 


£ μ f(T) «| = FO) (15.17b) 
0 


holds. In the original space, differentiation and integration act in converse ways if the initial values are 
Zeros. 


σι 
σι 


σι 
lor) 


8. Integration in the Image Space 


= j= 1 Ε 
c(t 7 2) = Jon Jam J Fn) dpm = c i F(z) dz. (15.18) 


Pn-1 
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This formula is valid only if f(t)/t” has a Laplace transform. For this purpose, f(x) must tend to zero 
fast enough as t — 0. The path of integration can be any ray starting at p, which forms an acute angle 
with the positive half of the real axis. 


9. Division Law 
In the special case of n — 1 of (15.18) 


BEI lore : 
ε[55) - {19 dz (15.19) 


t 
holds. For the existence of the integral (15.19), the limit lim Es must also exist. 
— 


10. Differentiation and Integration with Respect to a Parameter 


Of (t, a) OF (p, a) 
2 Oa } τ, ϑα ^" 


az ag 
(15.20a) £ | [ft a) in} = nho a) da. (15.20b) 
αι αι 
Sometimes one can calculate Laplace integrals from known integrals with the help of these formulas. 
11. Convolution 


1. Convolution in the Original Space The convolution of two functions fı(t) and f(t) is the 
integral 


fix fa= [ flr): f - nar. (15.21) 


Equation (15.21) is also called the one-sided convolution in the interval (0, t). A two-sided convolution 
occurs for the Fourier transformation (convolution in the interval (--οο, oo) see 15.3.1.3, 9., p. 789). 
The convolution (15.21) has the properties 


a) Commutative law: fi * fa= fax fi: (15.22a) 
b) Associative law: (fi fa) * fs = fa * (fo * fs). (15.22b) 
c) Distributive law: (fit fo) * fs = fi * fat fo * fs. (15.22c) 

^g(t) In the image domain, the usual multiplication 


corresponds to the convolution: 
+ L{fi * fo} = FA(p): Fo(p). (15.23) 


n 'The convolution of two functions is shown in 
f(t) Fig. 15.6. One can apply the convolution theo- 


rem to determine the original function: 
a) Factoring the transform 
λάμα F(p) = Fi(p) + Fa(p). 

(£«g)(t) b) Determining the original functions fi(t) and 
fo(t) of the transforms F\(p) and F(p) (from a 
DAS table). 

c) Determining the original function associated 
to F(p) by convolution of fı(t) and f2(t) in the 
Figure 15.6 original space (f(t) = fi(t) « fo(t)). 


=y 


=y 


=y 
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2. Convolution in the Image Space (Complex Convolution) 


1 σι Γἶοο 
zg | BG: δρ ϱ) dz 
CA) ΜΘ} 51 prs (15.24) 
su Fi(p— 5): F(z) dz 


The integration is performed along a line parallel to the imaginary axis. In the first integral, xı and p 
must be chosen so that z is in the half plane of convergence of £( fı} and p — z is in the half plane of 
convergence of L{ f2}. The corresponding requirements must be valid for the second integral. 


15.2.1.3 Transforms of Special Functions 


1. Step Function 
The unit jump at t = to is called a step function (Fig. 15.7) (see also 14.4.3.2, 3., p. 757); it is also 
called the Heaviside unit step function: 


1 for t > to, 
ut- = {orte 029. 


— 
fan 
σι 
N 
σι 

wa 


top ὦ cosw to + p sin w to 
pP tw 


BA: f(t) = u(t — to) sinwt, F(p) =e (Fig. 15.8). 


W B: /(t) = u(t — to) sinw (t — to), F(p) = et? — — (Fig. 15.9). 


Kt) 


ff u(t-t,) sinot-t) 


Figure 15.7 Figure 15.8 Figure 15.9 


2. Rectangular Impulse 


Α rectangular impulse of height 1 and width T' (Fig. 15.10) is composed by the superposition of two 
step functions in the form 


O0 for t < to, 
ur(t — to) = u(t — to) — u(t — t9 — T) 24 1 for tp ct «to T, (15.26 
0 fort>t)+T; 


ερ] = e-T») 


p 


L{ur(t — to)} = 


3. Impulse Function (Dirac 6 Function) 


(See also 12.9.5.4, p. 700.) The impulse function δ({ — to) can obviously be interpreted as a limit ο 
the rectangular impulse of width T and height 1/T at the point t = ty (Fig. 15.11): 


ó(t to) = Jim, u(t — to) — u(t — to — T)]. (15.28 


1 
gl 
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t) QT t ο, t tT t 
Figure 15.10 Figure 15.11 
For a continuous function h(t), 
b E ΓΝ P 
[056-i)a {Qe iE is outside (a 8). (15.29) 
Relations such as 
δ(ί — to) = m L{d(t — to)} =e” (to > 0) (15.30) 


are investigated generally in distribution theory (see 12.9.5.3, p. 699). 
4. Piecewise Differentiable Functions 


The transform of a piecewise differentiable function can be determined easily with the help of the 6 


function: If f(t) is piecewise differentiable and at the points t, (v = 1,2,...,n) it has jumps αν, then 
its first derivative can be represented in the form 
df (t 
Da = fi(t) + a(t — th) + a26(t — Ὁ) +-+- + αμδί -- tn) (19.51) 
dt 


where f/(t) is the usual derivative of f(t), where it is differentiable. 

If jumps occur first in the derivative, then similar formulas are valid. In this way, one can easily de- 
termine the transform of functions which correspond to curves composed of parabolic arcs of arbitrarily 
high degree, e.g., curves found empirically. In formal application of (15.13), the values f (+0), f’(+0),... 
should be replaced by zero in the case of a jump. 


ΒΑ: 
t+bfor0<t<to, pp; 
f(t) = (qe t Ofer <t < to (Pig, 15.12); / (0 = a tuolt) + bE) — (ato + b) AlE — to); £U'() = 
a lja a 
τα e P) +b — (ato + be; C(f(t)) » |; + b— ο ορ E + ato 4 Ju 
HB: 
t for 0 € t € to, 1 for 0<t< to, 
f(t) = 4 2to — t for to « t < 2to, (Fig. 15.13); f(t) = 4 —1 for to <t< 2to, (Fig. 15.14); 
0 for t > 2to, 0 for t > 2to, 
I I —t —2t ( pug WP Y 
f" (t) = 5(t)-4(t-to) -6(t-to) +6(t-2to); LLP" (Ef 21-2e Pe?" LA F(t} = — 3 
Et/to for 0 «t « to, 
: = for tp « t « T — to, " . 
E Of A ms for T—t<t<T, (Fig. 15.15); 


0 otherwise, 
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f(t) 
b > 
in 
» 
0| bh. t 
Figure 15.12 Figure 15.13 Figure 15.14 
E/ty for 0 « t « to, 
ta) Jo for to « t« T — to, (t » T), ; | 
FO=) _ pity for T—t) <t<T, (Fig. 15.16): 
0 otherwise, 
E E E E E 
f" (t) = FOH- (tto) — 6 (t-T +to) + —9(t- T); L{f"(t)} = = [1 — et — eto? + e»: 
to to to to to 
E(1- e top \a- e-Q-to) 
PORE B | 
0 p 


Figure 15.15 Figure 15.16 


B D: 
| ft- P for0<t<1, πι - µία. [1-26 forO<t<1, su. . 
fi) = i otherwise, (Fig. 15.17); f'(t)— T otherwise, (Fig. 15.18); 


E f" (t) = —2ui(t) + d(t) + ó(t — 1); i i 
L{f"(t)} = —21—e*) 41467; rtf) - e zs ) 
p p p 


f(t) 


1/2 1 


--Υ 


Figure 15.17 Figure 15.18 


5. Periodic Functions 
The transform of a periodic function f *(t) with period T', which is a periodic continuation of a function 
f (t), can be obtained from the Laplace transform of f(t) multiplied by the periodization factor 


(1—e7»)71, (15.32) 
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W A: The periodic continuation of f(t) from example B (see above) with period T = 2to is f *(t) with 


: (1 — eter)? 1 1—e to 
Lif (t)} p? i 1 — e—2top p + e-top) 


WB D: The periodic continuation of f(t) from example C (see above) with period T is f*(t) with 


— e-top — eg-(T-to)p 
e) το. 


15.2.1.4 Dirac 6 Function and Distributions 

In describing certain technical systems by linear differential equations, functions u(t) and ó(t) often 
occur as perturbation or input functions, although the conditions required in 15.2.1.1, 1. p. 770, are 
not satisfied: u(t) is discontinuous, and ó(t) cannot be defined in the sense of classical analysis. 
Distribution theory offers a solution by introducing so-called generalized functions (distributions), so 
that with the known continuous real functions 6(¢) can also be examined, where the necessary differen- 
tiability is also guaranteed. Distributions can be represented in different ways. One of the best known 
representations is the continuous real linear form, introduced by L. Schwartz (see 12.9.5, p. 698). 
Fourier coefficients and Fourier series can be associated uniquely to periodic distributions, analogously 
to real functions (see 7.4, p. 474). 

1. Approximations of the 6 Function 

Analogously to (15.28), the impulse function 6(¢) can be approximated by a rectangular impulse of 
width £ and height 1/e (e > 0): 


gel es (15.334) 


Further examples of the approximation of ô(t) are the error curve (see 2.6.3, p. 73) and Lorentz function 
(see 2.11.2, p. 95): 


fte) = UO (e > 0), (15.33b 
f(t,s)— gi (e > 0). (15.33¢ 
These functions have the common properties: 
1. j f(t,¢) dt =1. (15.34a 
2. "m €) = f(t,e), i.e., they are even functions. (15.34b 
ο 7 


2. Properties of the 6 Function 
Important properties of the ὃ function are: 


α γα 
1. / f(t)ó(x —t)dt = f(x) (fis continuous, a > 0). (15.35) 
2. d(ax) = —ó(x) (a> 0). (15.36) 


3. δ(ο(α)) = E ΠῚ 


δία --αι) with g(z;) = 0 and g'(z;) Z 0 (i =1,2,...,n). (15.37) 
= gui) 
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Here all roots of g(a) are considered and they must be simple. 
4. n-th Derivative of the 6 Function: After n repeated partial integrations of 
rca 
fa) = ji f(t) δία — t) dt, (15.38a) 
z—a 
a rule is obtained for the n-th derivative of the ó function: 
rca 


(- ΠΡ} f(s i=] K t) δ (η (x = t) dt. (15.38b) 


15.2.2 Inverse Transformation into the Original Space 


To perform an inverse transformation, there are the following possibilities: 

1. Using a table of correspondences, i.e., a table with the corresponding original functions and trans- 
forms (see Table 21.13, p. 1109). 

2. Reducing to known correspondences by using some properties of the transformation (see 15.2.2.2, 
p. 778, and 15.2.2.3, p. 779). 

3. Evaluating the inverse formula (see 15.2.2.4, p. 780). 


15.2.2.1 Inverse Transformation with the Help of Tables 


The use of a table is shown here by an example with Table 21.13, p. 1109. 
Further tables can be found, e.g., in [15.3]. 


πο ο Fle), cn - e? 


1 
LH Fo(p)} = L7 {; 7 :) = e * = f,(t). Applying the convolution theorem (15.23) yields: 


1 
p) cto? 


B F(p)= [= L snot = fi(t), 


f) = LHF (p): Fa(p)} 
t t 1 "ql πὰ Wi 
= [ filt) + falt — τ) dr = [ gree SOUT V d (== ώς p 
“Ὁ “Ὁ 


w e +w? w 


15.2.2.2 Partial Fraction Decomposition 


1. Principle 

In many applications, there are transforms in the form F(p) = H(p)/G(p), where G(p) is a polynomial 
ofp. Ifthe original functions for H (p) and 1/G(p) are already known, then the required original function 
of F(p) can be got by applying the convolution theorem. 

2. Simple Real Roots of G (p) 

If the transform 1/G(p) has only simple poles p, (v = 1,2,..., n), then it has the following partial 
fraction ΕΑ; 


j"À Gon Y» — p) 


The s original function is 


=i 1 pes - 1 „Put 5 
q(t) =L tam} 223 : (15.40) 
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3. The Heaviside Expansion Theorem 
If the numerator H (p) is also a polynomial of p with a lower degree than G(p), then we can obtain the 
original function of F(p) with the help of the Heaviside formula 


f(t) = = ae et. (15.41) 


4. Complex Roots 
Even in cases when the denominator has simple complex roots, the Heaviside expansion theorem can 
be used in the same way. The terms belonging to complex conjugate roots can be collected into one 
quadratic expression, whose inverse transformation can be found in tables also in the case of roots of 
higher multiplicity. 

1 


(p o τω 
zeroes of G(p) py = —c, pa = iw, pa = —iw are all simple. According to the Heaviside theorem one gets 
f(t) _ 1 Ε et 1 - git 1 - e it 
w? ro 2w(w — ic) 2ω(ω + ic) 

1 1 c—p 1 c 
- - + = = hg t) 2 — ~le “+4 —sinwt — coswt |. These 
τος στ στ] f(t) w + 2 ω 

expressions for f(t) are identical. 


B F(p) = ie, H(p) = 1, G(p) = (p + e) (p? + w°), G'(p) = 3p? + 2pc + ω”. The 


or by using partial fraction decomposition and 


the table F(p) = 


15.2.2.3 Series Expansion 

In order to obtain f(t) from F(p) one can try to expand F(p) into a series F(p) = ` Fa(p), whose 
n=0 

terms F,,(p) are transforms of known functions, i.e., F,(p) = L{fn(t)}. 

1. F(p) isan Absolutely Convergent Series 

If F(p) has an absolutely convergent series 


áo 
n, 
F(9-»»--. (15.42) 
n=0 P 
for |p| > R, where the values A, form an arbitrary increasing sequences of numbers 0 < Ay < Ay < 
ree < Àn Xo xe oo, then a termwise inverse transformation is possible: 


oo p» ο 


f(t) = È On TO)" (15.43) 


I' denotes the gamma function (see 8.2.5, 6., p. 514). In particular, for A, = n + 1, i.e., for F(p) = 


oo a : 55 A, 
at the series f(t) = Y; == 
i n=0 n! 


t" is obtained, which is convergent for every real and complex t. 
n=0 p 


Furthermore, one can have an estimation in the form |f(t)| < C et! (C, c real constants). 


1 j iy? e FEY 4 
WB F(p)= =-|{1l+-—- = ——— . After a termwise transformation into the 
(p) Vip p ( =) > a pu 


n=0 


oo 1 pn oo (-1)" t\2n 
original space the result is f(t) = X | 9 YT (n1) () = Jo(t) (Bessel function of 
n=0 \ m * ‘ 
0 order). 
2. F(p) isa Meromorphic Function 
If F(p) is a meromorphic function, which can be represented as the quotient of two integer functions 
(of two functions having everywhere convergent power series expansions) which do not have common 
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roots, and so can be rewritten as the sum of an integer function and infinitely many partial fractions, 
then the equality 


ctiyn 
1 T n 1 , 
— f e F(p)dp= >be — — J W F(p) d 15.44 
E / e” F(p) dp p» exi] €F)dp (15.44) 
c—iyn (Kn) 
is obtained. Here p, (v = 1,2,...,n) are the first-order poles of the function F(p), b, are the corre- 


sponding residues (see 14.3.5.4, p. 753), y, are certain values and K, are certain curves, for example, 
half circles in the sense represented in Fig. 15.19. The solution f(t) has the form 


Πὺ- e, i = [ Fo) ar > 0 (15.45) 
v=1 πι ^ 
(Kn) 


as y — oo, what is often not easy to verify. 


γα ^ 
Yn -B Yn 


Figure 15.19 Figure 15.20 


In certain cases, e.g., when the rational part of the meromorphic function F(p) is identically zero, the 
above result is a formal application of the Heaviside expansion theorem to meromorphic functions. 


15.2.2.4 Inverse Integral 


The inverse formula 
1 οΓἶψη 
‘)= lim = | Fod 15.46 
f(t) = lim 55 (p) dp (15.46) 
cin 

represents a complex integral of a function analytic in a certain domain. The usual methods of inte- 
gration for complex functions can be used, e.g., the residue calculation or certain changes of the path 
of integration according to the Cauchy integral theorem. 


p TV PO ας » 3 η m re 3 ‘al E i 
Β F(p) = Saw νρα is double valued because of „P. Therefore, we chose the following path of inte- 
1 
gration (Fig. 15.20): TE —e ντα dp foot foot feet foot fot fen = 
2πὶ $ P +w? PA J E EA ub 
(K) a op ΡΕ DA BE FC 


SC Rese” F(p) = e^? VP cos(wt — av/u/2). According to the Jordan lemma (sce 14.4.3, p. 755), the 
integral part over AB and CD vanishes as y, — oo. The integrand remains bounded on the circular 


arc EF (radius £), and the length of the path of integration tends to zero for € — 0; so this term of the 
integral also vanishes. There are to investigate the integrals on the two horizontal segments BE and 
FC, where it is to consider the upper side (p = το") and the lower side (p = re^?) of the negative real 
axis: 
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oo 


0 oo ^ , —oo6 ^ : 
r F(p)e? dp = — [ e T εαν dr, f F(p)e'? dp = Í e t LL cele dr. 
—oo “Ὁ 


r? +w? 0 0 r? +w? 


Finally one gets: 
: [7] œ ,rSinav/r 
t)=e° =P cos ( t- A-S otr Np, 
f(t) =e wt— als πο © pau ot 


15.2.3 Solution of Differential Equations using Laplace 
Transformation 


It has been noticed already from the rules of calculation of the Laplace transformation (see 15.2.1.2, 
p. 771), that complicated operations, such as differentiation or integration in the original space, can be 
replaced by simple algebraic operations in the image space using the Laplace transform. Here, some 
additional conditions are considered, such as initial conditions in using the differentiation rule. These 
conditions are necessary for the solution of differential equations. 


15.2.3.1 Ordinary Linear Differential Equations with Constant Coefficients 


1. Principle 
The n-th order differential equation of the form 


y (t) + erry” Y(t) e ταν (ϐ) + coy) = f(t) (15.472) 
with the initial values y(+0) = yo, y/(-0) πα (4-0) = ("P can be transformed by Laplace 
transformation into the equation 

n n k-1 ω 

3 arY(p-,c»,» ty” -F(p (=l). (15.47b) 

k=0 k=l v=0 


n 
Here G(p) = Y; ckp" = 0 is the characteristic equation of the differential equation (see 4.6.2.1, p. 315). 
) 


2. First-Order Differential Equations 


The original and the transformed equations are: 


y (t) + coy(t) = f(t), y(+0) = yo, (15.48a) (p4-co)Y(p)— yo = F(p), (15.485) 
where co = const. The solution for Y (p) results in 
F(p) + yo 
Y(p) = 9 15.486 
®) P + co ( 
: bs A 
Special case: For f(t) = Ae" with F(p) = , (A, const) : (15.49a 
p-pu 
^ Yo 
Y(p) = H ; 15.49b 
(9) (p—W(p+eo) p+c 
λ λ 
DES et up ο ές 15.49 
(0 = et (n - L2). (15.49¢ 


3. Second-Order Differential Equations 


The original and transformed equations are: 
y (t) + 2ay (t) + bylt) = f(t), — v(*0) —vs  y'(+0) = yp. (15.50a) 


(p? + 2ap + b) Y (p) — 2ayo — (pyo + yh) = F(p). (15.50b) 
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The solution for Y (p) results in 


F(p) + Qa + p) yo + Yo 
Y(p) = : 9t 
(p) D1 2ap4b (15.50c 
Distinction of Cases: 
a) b<a’: G(p)=(p—a)(p—as) (αι, ag real; αι Æ o3), (15.51a 
1 1 
t £o poat j,Q2t 1 5.514 
a(t) tan} lent et) (15.51b 
b) b=a*®: σρ) --(ρ-- α)”, (15.524) q(t) =te™. (15.52b 
c) b»a*: G(p) has complex roots, (15.53a 
1 1 
=1 2 at ss LADO 
uiy =L tam} eae e sin vb — a?t. (15.53b 
The solution y(t) can be obtained as the convolution of the original function of the numerator of Y (p 


and q(t). The application of the convolution can be avoided if a direct transformation of the right-hand 

side can be found. 

Β The transformed equation for the differential equation y(t) + 2y'(t) + 10y(t) = 37 cos 3t + 9e 
pt+2 37p P 9 

pe ἠ- 9-10 ᾿ (p?+9)(p2+2p+10) (p+ 1)(p? + 2p +10) ` 

IE = ο. E follows fro 

τς 5 E E ws irom 
p+2p+10 (p-2p-10) (p-9) (p-9) (p-1) 

partial fraction decomposition of the second and third terms of the right-hand side but not separating 

the second-order terms into linear ones. The solution after termwise transformation is (see Table 

21.13, p. 1109) y(t) = (— cos 3t — 6sin 3t)e^* + cos 3t + 6sin 3t + e™. 

4. n-th Order Differential Equations 

The characteristic equation G(p) = 0 of this differential equation (see (15.47a)) has only simple roots 

04,05, ..., Qn, and none of them is equal to zero. Two cases are distinguished for the perturbation 

function f(t). 

1. If the perturbation function f(t) is the jump function u(t) which often occurs in practical problems, 

then the solution is: 


with yo = 1 and yj = 0 is Y (p) = The 


representation Y (p) 


1 for t>0, 1 - 1 t 
[ = : 54a t ] pur 15.541 

u(t) fo for t « 0, ολ ule) G(0) » ανά αν). ae) 
2. For a general perturbation function f(t), one gets the solution y(t) from (15.54b) in the form of the 
Duhamel formula which uses the convolution (see 15.2.1.2, 11., p. 773): 


we) = © fut- niar = Fy κ] (15.55) 


15.2.3.2 Ordinary Linear Differential Equations with Coefficients 
Depending on the Variable 

Differential equations whose coefficients are polynomials in t can also be solved by Laplace transfor- 

mation. Applying (15.16), in the image space yields a differential equation, whose order can be lower 

than the original one. 

If the coefficients are first-order polynomials, then the differential equation in the image space is a first- 

order differential equation and may be it can be solved more easily. 
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2 
- + - +tf = 0 (see (9.524, p. 562) for n = 0). The 


transformation into the image space results in 


Β Bessel differential equation of 0 order: t 


oon be kaha ἀπ) _9 ο Æ? 
p” F(p) — pf (0) — f'(0)] + pF (p) — f(0) dp 0 or d sop 
ati ο μίο and περα alis Tos - PdP _ MD r1 
Separation of the variables and integration yields log F(p) — 2: lee log /p? + 1 4- logC, 
F(p) = y (C is the integration constant), f(t) = CJo(t) (see MM in 15.2.2.3,1., p. 779 with 


the Bessel function of 0 order). 
15.2.3.3 Partial Differential Equations 


1. General Introduction 
The solution of a partial differential equation is a function of at least two variables: u = u(x,t). Since 
the Laplace transformation represents an integration with respect to only one variable, the other vari- 
able should be considered as a constant in the transformation: 

oo 


£(u(x,t)) = [emus dt = U (x, p). (15.56) 
0 
x also remains fixed in the transformation of derivatives: 


£ eS η Spi ila i= ued; 


ot 
Paulx, t) 
Le. 
fe 
The differentiation with respect to x is supposed to be interchangeable with the Laplace integral: 
Ou(x, t) ὃ . ὃ . 
e an } τα Ὁ} = 5 U (n. p). (15.58) 


In this way, an ordinary differential equation is obtained in the image space. Furthermore, the boundary 
and initial conditions are to be transformed into the image space. 


2. Solution of the One-Dimensional Heat Conduction Equation for a 


} = p’L{u(x,t)} — u(x, +0)p — u(x, +0). 


Homogeneous Medium 


1. Formulation of the Problem Suppose the one-dimensional heat conduction equation with van- 
ishing perturbation and for a homogeneous medium is given in the form 


Une — à ^u, = Une — uy —0 (15.594) 
in the original space 0 < t < oo, 0 < x < l and with the initial and boundary conditions 

u(x, +0) = uo(r), u(+0,t) = ao(t), u(l — 0,8) = a(t). (15.59b) 
The time coordinate is replaced by y = at. (15.59a) is also a parabolic type equation, just as the 
three-dimensional heat conduction equation (see 9.2.3.3, p. 591). 
2. Laplace Transformation The transformed equation is 


d? 
4 = pU — u(x), (15.60a) 
da? 
and the boundary conditions are 
U(+0, p) = Ao(p, U(l—0,p) = Ai(p). (15.60b) 


The solution of the transformed equation for zero starting temperature ug(x) = 0 is 


U(x, p) = cie* V? + ege? VP. (15.60c) 
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It is a good idea to produce two particular solutions U; and U2 with the properties 


U\(0,p) 21, Uy(l,p) =9, (15.61a) U,(0,p) 20, U»(Lp)-—1, ie,  (15.61b 
ell—2) VP = ο.) VP et VP — e-* p 
U(x, p) = Ipe πα (15.619) U(x, p) = re (15.61d 


The required solution of the transformed equation has the form 
U (x, p) = Ao(p) Ui (x, p) + A (p) U2(x, p). (15.62 


3. Inverse Transformation The inverse transformation is especially easy in the case of | — oo: 


ma 
U(z,p)- ac(p)e ^V?, (15.630) u(x,t) LI =] dr. (15.63b 


15.3 Fourier Transformation 
15.3.1 Properties of the Fourier Transformation 
15.3.1.1 Fourier Integral 


1. Fourier Integral in Complex Representation 

The basis of the Fourier transformation is the Fourier integral, also called the integral formula of 
Fourier: If a non-periodic function f(t) satisfies the Dirichlet conditions (see 7.4.1.2, 3., p. 475) in 
an arbitrary finite interval, and furthermore the integral 


+00 +00 +00 
1 
/ |f(t) dt (15.644) is convergent, then a. J fet eu-Df(r)dudr (15.640) 


at every point where the function f(t) is continuous, and 


f(t +0) + f(t 9) 
2 


oo +00 
= =| dw / f(T) cos w(t — τ) dr (15.64c) 


at the points of discontinuity. 


2. Equivalent Representations 
Other equivalent forms for the Fourier integral (15.64b) are: 


-Foo +00 

1 E BE: 
1. f(t) ο δι f(r) cos [w (t — τὴ] dw dr. (15.65a 
2. f(t)— fio) coswt + b(w)sinwt]dw with the coefficients (15.65b 

0 

1 ι 

=> [ FO coswtat (15.65c) b(w) = — f F()sinwt at. (15.65d 

π. T. 

3. f(t) = | Alw) cos [wt + y(w)] dw. (15.66 
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oo 


4. f(t) — Eo sin [wt + p(w) | dw. (15.67) 


0 


The following relations are valid here: 


A(w) = ψα”(ω) + (w), (15.684) φίω) = vw) + 3 (15.68b) 
cos ψ(ω) = no (15.68c) sin ψ(ω) = EI (15.68d) 
cos p(w) = bot (15.68e) sin y(w) = τ . (15.68f) 


15.3.1.2 Fourier Transformation and Inverse Transformation 


1. Definition of the Fourier Transformation 
The Fourier transformation is an integral transformation of the form (15.1a), which comes from the 
Fourier integral (15.64b) by substituting 
+00 
Ε(ω) = / e" f(r) dr. (15.69) 
—oo 
The following relation is valid between the real original function f(t) and the usually complex transform 
F(w): 
+00 


oes [ Fw) dw. (15.70) 
27 . 
—o6 
In the brief notation one uses F: 
+0 
-FU- ο: (15.71) 
—oo 
The original function f(t) is Fourier transformable if the integral (15.69), i.e., an improper integral with 
the parameter w, exists. If the Fourier integral does not exist as an ordinary improper integral, then it 
is considered as the Cauchy principal value (see 8.2.3.3, 1., p. 510). The transform F'(w) is also called 
the Fourier transform; it is bounded, continuous, and it tends to zero for |w| — oc: 
lim F(w) — 0. (15.72) 


le |o 


The existence and boundedness of F (w) follow directly from the obvious inequality 


Fas fiere ats f µίϑ|άν. (15.73) 


The existence of the Fourier transform is a sufficient condition for the continuity of F (w) and for the 
properties F(w) — 0 for |ω| — oo. This statement is often used in the following form: If the function 
f (t) in (—oo, oo) is absolutely integrable, then its Fourier transform is a continuous function of w, and 
(15.72) holds. 

The following functions are not Fourier transformable: Constant functions, arbitrary periodic func- 
tions (e.g., sin wt, cos wt), power functions, polynomials, exponential functions (e.g., ew , hyperbolic 
functions). 
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2. Fourier Cosine and Fourier Sine Transformation 

In the Fourier transformation (15.71), the integrand can be decomposed into a sine and a cosine part. 
So, one gets the sine and the cosine Fourier transformation. 

1. Fourier Sine Transformation 


F,(w) = Ff f()} = / f(t) sin (wt) dt. (15.74a) 
0 
2. Fourier Cosine Transformation 
Few) = FA FŒ} = f F(E) cos (wt) dt. (15.74b) 


3. Conversion Formulas Between the Fourier sine (15.74a) and the Fourier cosine transformation 
(15.74b) on one hand, and the Fourier transformation (15.71) on the other hand, the following relations 
are valid: 


= F{ f(t) } = FA f(t) + ft} - 3E ft) — F(70 (15.75a) 
Flo) = i fütDsigntl, (15.759) Fw) = jt ΛΗ} }. (15.759) 


For an even or for an odd function f(t) the following representations hold: 
f(t) even: F{ f(t) } = 2F ef f(t) }, g 
F(t) odd: Ff f(t) } = 22. ΚΘ}. 


3. Exponential Fourier Transformation 
Differently from the definition of F(w) in (15.71), the transform 


+00 
Fw) = FASO} = 5 | sat (15.76) 


is called the exponential Fourier transformation, so that 
F(w) = 2Εε(--ω). (15.77) 


4. Tables of the Fourier Transformation 

Based on formulas (15.75a,b,c) one either does not need special tables for the corresponding Fourier 
sine and Fourier cosine transformations, or one uses tables for Fourier sine and Fourier cosine transfor- 
mations and calculates F (w) with the help of (15.75a,b,c). In Table 21.14.1 (see p. 1114) and Table 
21.14.2 (see p. 1120) the Fourier sine transforms F,(w), the Fourier cosine transforms F,(w) respec- 
tively, in Table 21.14.3 (sce p. 1125) for some functions the Fourier transform F(w) and in Table 
21.14.4 (see p. 1127) the exponential transform Fe(w) are given. 


W The function of the unipolar rectangular impulse f(t) — 1 for [| < to, f(t) = 
(Fig. 15.21) satisfies the assumptions of the existence of the Fourier integral (15. 


1 [ο 2 1 
(15.65c,d) the coefficients are a(w) = — f cos wtdt = — sin w to and b(w) = — Lh sin wt dt = 0 
T J—to TU T J—to 


0 for |t| > to (A.1) 
64a ο According to 


œ sin w to cos wt 


2 
(A.2) and so from (15.65b) follows f(t) = i dw (A.3). 
T JO 


ω 

5. Spectral Interpretation of the Fourier Transformation 

Analogously to the Fourier series of a periodic function, the Fourier integral for a non-periodic function 
has a simple physical interpretation. A function f(t), for which the Fourier integral exists, can be rep- 
resented according to (15.66) and (15.67) as a sum of sinusoidal vibrations with continuously changing 
frequency w in the form 
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A(w) dw sin [wt + φ(ω)], (15.78a) A(w) dw cos[wt + ψ(ω) ]. (15.78b) 


The expression A(w) dw gives the amplitude of the wave components and y(w) and ψ(ω) are the phases. 
The same interpretation holds for the complex formulation: The function f(t) is a sum (or integral) of 
summands depending on w of the form 


A gus dwet, (15.79) 
2π 


1 
where the quantity ἡ (w) also determines the amplitude and the phase of all the parts. 
π 


This spectral interpretation of the Fourier integral and the Fourier transformation has a big advantage 
in applications in physics and engineering. The transform 


F(w) =|F(w)|e* or Ε(ω) =|F(w)| ei? (15.80a) 
is called the spectrum or frequency spectrum of the function f(t), the quantity 
|F(w)| = 7 A(w) (15.80b) 


is the amplitude spectrum and ρ(ω) and ψ(ω) are the phase spectra of the function f(t). The relation 
between the spectrum F(w) and the coefficients (15.65c,d) is 


F(w) = π[α(ω) — ib(w)]. (15.81) 
from which one gets the following statements: 
1. If f (t) is a real function, then the amplitude spectrum | F'(w)| is an even function of w, and the phase 
spectrum is an odd function of w. 
2. If f(t) is a real and even function, then its spectrum F(w) is real, and if f(t) is real and odd, then 
the spectrum F (w) is imaginary. 


fF) | 
2ty 
2 
w 
> > 
t -37 -27 -T 0 π 2π 3T Wty 
Figure 15.21 Figure 15.22 
W Substituting the result (A.2) for the unipolar rectangular impulse function on p. 786 into (15.81), 


then one gets for the transform F(w) and for the amplitude spectrum |F(w)| (Fig. 15.22) 


3 t Si [ 
F(w) = F(f(t)) = παω) = a (A.3), |F(w)| = 2 SES (A.4). The points of contact of 
w w 
2 
the amplitude spectrum |F (w)| with the hyperbola — are at wto = (2n + D3 (n.—:0,1,2,...).. 
ω 


15.3.1.3 Rules of Calculation with the Fourier Transformation 
As it has been already pointed out for the Laplace transformation, the rules of calculation with integral 
transformations mean the mappings of certain operations in the original space into operations in the 
image space. Supposing that both functions f(t) and g(t) are absolutely integrable in the interval 
(—oo, oo) and their Fourier transforms are 

F(w) = F{ f(t)} and G(w) = F{ g(t) } (15.82) 
then the following rules are valid. 
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1. Addition or Linearity Laws 


If a and f are two coefficients from (—oo, oc), then: 
F{af(t) + 8g(t) } = aF(w) + 8G(»). 

2. Similarity Law 

For real a Æ 0, 
F{ f (t/a) } = |a| F(aw). 

3. Shifting Theorem 

For real a Z 0 and real 3, 
F{ f(at + B)} = (A/Jal) e*/*F(w/a) or 
F{ f(t — to) } = e^ F(w). 

If to is replaced by —to in (15.85b), then 
F{ f(t to) } = ew). 

4. Frequency-Shift Theorem 

For real a > 0 and 8 € (—co, 00), 
Ζ{οὐ' f(at)} = (1/α)Ε{(ω -- β)/α) or 
F{ eit f(t) } = Ρ(ω — wo). 


5. Differentiation in the Image Space 


(15.86a 
5.86b 


— 


If the function t" f (t) is absolutely integrable in (--οο, oo), then the Fourier transform of the function 


f (t) has n continuous derivatives, which can be determined for k = 1,2,...,n as 

d* F(w) T oF iw : Pa —iwt,k 

= I ax lems] d = (0t / ett F(t) dt, (15.872. 
where 

ο a F(w) E 

um. "E T 0. (15.87b 

With the above assumptions these relations imply that 
D 
Fay- ο (15.87¢ 


dw” 


6. Differentiation in the Original Space 


1. First Derivative If a function f(t) is continuous and absolutely integrable in (--οο, 00) and i 
tends to zero for { --» coo, and the derivative f'(t) exists everywhere except, maybe, at certain points, 


and this derivative is absolutely integrable in (—oo, oo), then 


ΤΘ} = iw Ft ΚΘ}. 


2. n-th Derivative Ifthe requirements of the theorem for the first derivative are valid for all deriva- 


tives up to f”), then 


F{ f(t) } = (iw)"F{ f(t) }. 


(15.88a 


(15.88b 


These rules of differentiation will be used in the solution of differential equations (see 15.3.2, p. 791). 


7. Integration in the Image Space 


Γεω dw = i[G(as) — σ(αι)] with G(w) = F{g(t)} and g(t) 


nm 
t 
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8. Integration in the Original Space and the Parseval Formula 
1. Integration Theorem If the assumption 


+00 t 
/ J(t)dt =0 (15.90a) is fulfilled, then zf f f(t) dt | = Ἔρίω (15.907) 
à iw 


2. Parseval Formula If the function f(t) and its square are integrable in the interval (—oo, oc), 
then 

-Foo 1 +00 

/ IOP dt = — / |F(w)|2 dw. (15.91) 


oo 


9. Convolution 
The two-sided convolution 


Too 
AO «πὸ f AORE- 1) ar (15.92) 
—oo 
is considered in the interval (—oo, oo) and it exists under the assumptions that the functions f1(t) and 


fo(t) are absolutely integrable in the interval (--οο, οο). If fi(t) and fa(t) both vanish for t < 0, then 
one gets the one-sided convolution from (15.92) 


AQ) * f(t) = [ A080 - 0 de for t20, (15.93) 
0 for t «0. 


So, it is a special case of the two-sided convolution. While the Fourier transformation uses the two- 
sided convolution, the Laplace transformation uses the one-sided convolution. 


For the Fourier transformation of a two-sided convolution 
F{filt) * fs) 9 FLA) F fo) } (15.94) 


holds, if both integrals 


+00 
/ |f dt and / | fo(t)|? dt (15.95) 
exist, i.e., the functions and their squares are integrable in the interval (—oo, oc). 
+00 
W Calculation of the two-sided convolution y(t) = f(t) * f(t) = J f(T)f(t— τ) dr (A.1) for the 
function of the unipolar rectangular impulse function (A.1) in 15.3.1.2, 4., p. 786. 
t tt 
Since y(t) = [. f(t—-7)dr = / ° f(r) dr (A.2) one gets for t < —2to and t > 2to, w(t) = 0 and 
J—to Ji—to 
t+to 


for —2to < t <0, v) - f dr —t-2ty. (A.3) 


—to 


t 
Analogously, for 0 < t < 2to: Y(t) = [. dr = —t + 2tọ (A.4) holds. 
Jt—to 


Altogether, for this convolution (Fig. 15.23) 
t+2t) for —2t9 €t € 0, 
w(t) = f(t) « f(t) = | —t-F2ty for 0<t<2to, (A.5) 
0 for |t| > 2to 
follows. For the Fourier transform F(w) of the unipolar rectangular impulse (A.1) (see p. 786 and 
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Fig. 15.21) V(») = F{v(t)} = F{ f(t) * 


v(t) 


» 
-2t, οἱ 2 t 


Figure 15.23 


yin’ ω Em 


FO — Πω 


sin wt 
amplitude spectrum of the function f(t) |F(w)| = 2 I 


jj (A.6) follows and for the 


(A.7) holds. 


e(t) 


Figure 15.24 


10. Comparing the Fourier and Laplace Transformations 
There is a strong relation between the Fourier and Laplace transformation, since the Fourier trans- 


formation is a special case of the Laplace transformation with p — 


iv. Consequently, every Fourier 


transformable function is also Laplace transformable, while the reverse statement is not valid for every 
f(t). Table 15.2 contains comparisons of several properties of both integral transformations. 


'Table 15.2 Comparison of the properties of the Fourier and the Laplace transformation 


Fourier transformation 


Laplace transformation 


Foo 
=F{f(t)}= f e™'f(t)dt 
—co 
w is real, it has a physical meaning, e.g., 
frequency. 


F(p) = Lf f(t),p} = fem f(t) dt 


pis complex, p = r + ir. 


One shifting theorem. 


Two shifting theorems. 


interval: (—oo, +00) 

Solution of differential equations, problems de- 
scribed by two-sided domain, e.g., the wave 
equation. 


interval: [0, co) 

Solution of differential equations, problems de- 
scribed by one-sided domain, e.g., the heat con- 
duction equation. 


Differentiation law contains no initial values. 


Differentiation law contains initial values. 


Convergence of the Fourier integral depends only 


on f(t). 


Convergence of the Laplace integral can be im- 
proved by the factor e7?'. 


It satisfies the two-sided convolution law. 


It satisfies the one-sided convolution law. 


15.3.1.4 Transforms of Special Functions 


W A: Which image function belongs to the original function f(t) = e 


+4 
= t for t > 0 with (15.71) one gets: / e ietcaltlqg = 
J-A 


sidering that |t| = —t for t < 0 and |t| 


αι 


Rea > 0 (A.1)? Con- 


—(iw— 0 „—(iw+a)t |t4 3(iw—a)A ο (iw 
f. m tate fe -liwat qu — e (iw—a)t e (iw+ayt alas” )JA J= e-GerajA (A.2). 
J-A iw—a |a iw+a |, lw — a iw +a 
Since |e~*4| = e~4Re¢ and Rea > 0, the limit of (A2) exists for A > oo, so that F(w) = F{ e7 } = 
2a . 
gag 49 


E B: Which image function belongs to the original function f(t) = e^ 


αἱ. Rea > 0? The function is 
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A o 
not Fourier transformable, since the limit of / e?" dt does not exist for A — oo. 
J-A 


W C: Determination of the Fourier transform of the bipolar rectangular impulse function (Fig. 15.24) 


1 for —2to < t < D; 
—1 for 0< t< 2to, 
0 Οι |t| >2to, 


(C.1) 


where y(t) can be expressed by using equation (A.1) given for the unipolar rectangular impulse on 


p. 786. There is y(t) = f(t + to) — f(t — to) (0.2). | 
(15.850, 15.856) one gets Φ(ω) = F{ p(t) } = εἰ“ ο F(uw)—e "^ F(w), (C.3) 


2sin w to 


With the Fourier transformation according to 
from which, using (A.1), 


olw) = (eie to ew to) =4 


ω 
W D: Image function of a damped osci 


given by the function f(t) = { -- o 


To simplify the calculations, the Fourier 


„sin? w to 
i 


at cos wot 


(C.4) follows. 


lation: The damped oscillation represented in Fig. 15.25a is 
for t « 0, 
for t 20. 


ransformation is calculated with the complex function f * (t) = 


elatiwo)t with f (t) = Re (f * (t)). The Fourier transformation gives 


zLreer-[ 


90 


ati(wo — w . 
(wo ) The result is the Loren 


OO du ida oo 
e eletivo) dt = [ 
0 


pat pi(w—wo)t |99 

" ai ee 1 

εί a+(w—wo)it dt : 
α-- iwo — w) 


—a + i(wo — w) fo 


z or Breit-Wigner curve (see also 2.11.2, p. 95) 


a 


a? + (w — wo)? 
F(f(t)) = ES (Fig. 15.25b). A damped oscillation in the time domain corresponds to 
a? + (w — wo)? 


a unique peak in the frequency domain. 


Af(t) 
1 


b) 


Figure 15.25 


Figure 15.26 


15.3.2 Solution of Differential Equations using the Fourier 
Transformation 


Analogously to Laplace transformation, an important field of application of the Fourier transformation 
is the solution of differential equations, since these equations can be transformed by the integral trans- 
formation into a simple form. In the case of ordinary differential equations one gets algebraic equations, 
in the case of partial differential equations one gets ordinary differential equations. 
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15.3.2.1 Ordinary Linear Differential Equations 
The differential equation 
l for |t| < to, 


5 P 
0 for |t| > to. ἩΘΉΒΕ 


ψίθ +ay(t)= ΠῚ with f(t) = { 


i.e., with the function f(t) of Fig. 15.21, is transformed by the Fourier transformation 
F{y(t)} -- Υίω) (15.960 


into the algebraic equation 


ΜΝ án wt 
iwy pay = 297 9fo. (15.96c) giving Y(w) =2 VESTE (15.96d 
The inverse transformation gives 
coo ; 
sin wt 1 οἷωί gj t 
y(t) 2 FY()) = 2 sin wto | / μι wto y (15.96e 
w(a + iw) πλ ω(α + iw) 
and 0 for —oo < t < —to, 
1 
= — e alt+to) CENE 
y(t)=4 a LL | for —to St S +o, (15.968) 
1 


= | emet) — λα fo tg«t«oo. 
a 


Function (15.96f) is represented graphically in Fig. 15.26. 
15.3.2.2 Partial Differential Equations 


1. General Remarks 
The solution of a partial differential equation is a function of at least two variables: u = u(x,t). As the 
Fourier transformation is an integration with respect to only one variable, the other variable is consid- 
ered a constant during the transformation. Here the variable x is kept constant and the transformation 
is to be performed with respect to t: 
coo 
Fiu(z,t)) = I e™tu(x,t)dt = U(x,w). (15.97) 
—oo 
During the transformation of the derivatives the variable 1 is again kept constant: 
au(x, t) 
F ot” 


The differentiation with respect to x is supposed to be interchangeable with the Fourier integral: 


} = (iw)"F{ u(x,t) } = (iw)"U (2, c). (15.98) 


Ox ὃ 
In this way an ordinary differential equation is obtained in the image space. Furthermore the boundary 
and initial conditions are to be transformed into the image space. 


F{ το) = I u(r,t)) = aU (no). (15.99) 


2. Solution of the One-Dimensional Wave Equation for a Homogeneous Medium 


1. Formulation of the Problem The one-dimensional wave equation with vanishing perturbation 
term and for a homogeneous medium is: 


Ure — ug = 0. (15.100a) 
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Like the three-dimensional wave equation (see 9.2.3.2,p. 590), the equation (15.100a) is a partial dif- 


ferential equation of hyperbolic type. The Cauchy problem is correctly defined by the following initia. 


conditions 
u(x,0) = f(x) (—-oo«r«0oo), w(x,0)= g(x) (0! «οο). ( 


2. Fourier Transformation The Fourier transformation is to be performed with respect to x 


the time coordinate is kept constant: 


F{ u(x,t) } = U(w,t). (15. 
One gets: 
(iw)?U(w, t) — 0 with ( 
F(u(z,0)) = Ulw, 0) = F{ f(e)} = Fl), (1 
F{ u(x, 0) } = U'(w,0) = F{ g(x) } = Ge). ( 
wU +U" = 0. ( 


The result is an ordinary differential equation with respect to { with the parameter w of the transform. 


The general solution of this known differential equation with constant coefficients is 


U(w,t) = Cet + Coe t, (15.102a 
Determining the constants ΟἹ and C» from the initial values 
U(w,0) = C1 + C2 = F(w), U'(w,0) =iwC, — iw C» = Ω(ω), (15.102b 
gives 
1 1 1 1 
€i = -[F(w)+ =G(w)], C2 = =[F(w) -- —G(w)]. (15.102c 
2 iw 2 iw 
The solution is therefore 
1 1 1 i 
U(w,t) = =[F(w) + τσω )]e** + =[ Fw) — —G(w)]e ***. (15.102d 
2 2 iw 
3. Inverse Transformation Using the shifting theorem 
F{ f(ax -- b) ) = 1/α : e" Είω/α), (15.103a. 
for the inverse transformation of F(w), yields 
FH e&tF(w)} = fla +t), FU[e*'Fw)] = f(x—t). (15.103b 


Applying the integration rule 


τμ λον JE τω) gives (15.103c 


F {= Gwe uth - q^ 1{ G(w)e! dr = E (τ. 1-1} ἅτ = if (z) dz (15.103d 


after substituting T + t = z. Analogously to the previous integral 
a—t 
Ff- ewe} =- fao ( 
follows. Finally, the solution in the original space is 
att 


undc TEDES μα- κ | ot 


5.100b 
x where 
5.101a 
5.101b 
5.101c 
5.101d 
5.101e 


15.103e) 


(15.104) 
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15.4 Z-Transformation 


In natural sciences and also in engineering one often has to distinguish between continuous and discrete 
processes. While continuous processes can be described by differential equations, the discrete pro- 
cesses result mostly in difference equations. The solution of differential equations mostly uses Fourier 
and Laplace transformations, however, to solve difference equations other operator methods have been 
developed. The best known method is the z-transformation, which is closely related to the Laplace 
transformation. 


15.4.1 Properties of the Z-Transformation 
15.4.1.1 Discrete Functions 


Ifa function f(t) (0 € t < oo) is known only at discrete values 
tn = nT (n = 0,1,2,...; T > 018 a constant) of the argu- 
ment, then one writes f(nT) = f, and forms the sequence 
{fn}. Such a sequence is produced, e.g., in electrotechnics by 
“scanning” a function f(t) at discrete time periods t,,. Its rep- 
resentation results in a step function (Fig. 15.27). 


0 T 2TOT ... n The sequence { fn } and the function f (nT) defined only at dis- 
crete points of the argument, which is called a discrete func- 
Figure 15.27 tion, are equivalent. 


15.4.1.2 Definition of the Z- Transformation 


1. Original Sequence and Transform 
The infinite series 
oo 1 n 
F(z) - Y f. (2) (15.105) 
z 
n=0 
is assigned to the sequence {Γη}. If this series is convergent, then the sequence { fn} is called z-transfor- 
mable, and it is denoted by 
F(z) = Z{ fa}. (15.106) 
{fn} is called the original sequence, F(z) is the transform, z denotes a complex variable and F(z) isa 
complex-valued function. 


E /,—1 (n=0,1,2,...). The corresponding infinite series is 
oo T n 
F(2)-», (5) (15.107) 
n=0 ο 


It represents a geometric series with common ratio 1/z, which is convergent if |-| < 1 and its sum is 


> 1. Therefore, the sequence {1} is z-transformable for |-| < 1, 


2 1 
F(z)= S It is divergent for 


i.e., for every exterior point of the unit circle |z| = 1 in the z plane. 

2. Properties 

Since the transform F(z) according to (15.105) is a power series of the complex variable 1/2, the prop- 
erties of the complex power series (see 14.3.1.3, p. 750) imply the following results: 

a) For a z-transformable sequence {fn}, there exists a real number R such that the series (15.105) is 
absolutely convergent for |z| > 1/R and divergent for |z| < 1/R. The series is uniformly convergent 
for |z| > 1/Ho > 1/R. R is the radius of convergence of the power series (15.105) of 1/2. If the series 
is convergent for every |z| > 0, then R = oo. For non z-transformable sequences there is R = 0. 


b) If { fa} is z-transformable for |z| > 1/18, then the corresponding transform F(z) is an analytic func- 
tion for |z| > 1/R and it is the unique transform of ( fa}. Conversely, if F(z) is an analytic function 
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for |z| > 1/R and is regular also at z = oo, then there is a unique original sequence {fn} for F(z). 
Here, F(z) is called regular at z = oo, if F(z) has a power series expansion in the form (15.105) and 
F(oc) = 

3. Limit Theorems 

Analogously to the limit properties of the Laplace transformation ((15.7b), p. 770), the following limit 
theorems are valid for the z-transformation: 

a) If F(z) = Z{ fn} exists, then 


Jo = lim F(z). (15.108) 
Here z can tend to infinity along the real axis or along any other path. Since the series 
1 1 
AFG)- fh) = fit hyt fato (15.109) 
2 1 1 i " 
z [ro - fo- fiz} Sat fare Q4 (15.110) 


are obviously z transforms, analogously to (15.108) one gets 


fi = lim 2{F(2)— fo}, f= lim 2 {Fl) - h- Ai) — (15.111) 


The original sequence { fn } can be determined from its transform F(z) in this way. 
b) If lim f, exists, then 
noo 


]im fn = lim (2 — 1)F(z). (15.112) 


However the value lim fn from (15.112) can be determined only if its existence is guaranteed, since the 
noo 
above statement is not reversible. 
z 
E /,—(-1" (n=0,1,2,...). Then Z{f,} = l 


not exist. 


15.4.1.3 Rules of Calculations 


In applications of the z-transformation it is very important to know how certain operations defined on 
the original sequences affect the transforms, and conversely. For the sake of simplicity here the notation 
F(z) = Z(f,) for |z| > 1/R is used. 


1. Translation 


and lim (2 - DT = 0, but lim (—1)" does 


Forward and backward translations are distinguished. 
1. First Shifting Theorem: — Z(f, 4) = 2 ^F(z) μμ (15.119) 
here fn-p = 0 is defined for n — k < 0. 


v=0 a 


2. Second Shifting Theorem: 2{f,.,} = z^ [Fe 
2. Summation 


1 
For |z| > max (1.3) 
r |z| > max F 


2{Σ ZE = FO. (15.115) 


v=0 
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3. Differences 
For the differences 


Afa = fari — fan, A” fa = A(A Tt fa)  (m=1,2,...; Af. = fa) (15.116) 

the following equalities hold 
Z{A fn} = Ga 1)F(z) — z fo, 
Z{A? fr} (z — 1} F(z) — z(z — 1) fo — 2A fo, 
: = : (15.117) 
k-1 

ZIA" f.) (z—1)*F(z) — z Σ (z — 1*7"! A" fo. 
4. Damping IA 
For an arbitrary complex number λ # 0 and |z| > R 

ZU fa} = F (3) (15.118) 


5. Convolution 
The convolution of two sequences { fn} and {gn} is the operation 


δι * In = iy Sv Gn—v- (15.119) 


v=0 
If the z-transformed functions Z{ fn} = F(z) for |z| > 1/R; and Z{g,} = G(z) for |z| > 1/Rz exist, 
then 

ZI fa * ga) = F(z)G(z) (15.120) 


for |z| > max Gx . Relation (15.120) is called the convolution theorem of the z-transformation. 


m) 
Ri’ R 
It corresponds to the rules of multiplying two power series. 
6. Differentiation of the Transform 
LF (z 
Z(nf,) = -, 2.03, 
dz 
Higher-order derivatives of F(z) can be determined by the repeated application of (15.121). 
7. Integration of the Transform 
Under the assumption fo = 0, 


z {5 = [Fa (15.122) 


15.4.1.4 Relation to the Laplace Transformation 
Describing a discrete function f(t) (see 15.4.1.1, p. 794) as a step function, then 
f(t) = f(nT) = fr for nT <t<(n4+1)T (n=0,1,2,...; T >0, T const) (15.123) 


holds. Using the Laplace transformation (see 15.2.1.1, 1., p. 770) for this piecewise constant function, 
for T = 1 yields: 


(15.121) 


oo "tl ορ. e—(n+1)p 1— e? & 


Lif(t)}}= Ε(ῳ) - 5; /4 se" dt => fas = ΣΣ fre ™. — (15124) 


n=0 n=0 p D n= 


n 


The infinite series in (15.124) is called the discrete Laplace transformation and is denoted by D: 


D(f()) = D{ fn} = Σ fre”. (15.125) 


n=0 
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After the substitution of e? = z in (15.125) 14 fa} represents a series with powers of 1/z, which is 
a so-called Laurent series (see 14.3.4, p. 752). The substitution e? = z suggested the name of the z 
transformation. With this substitution from (15.124) one finally gets the following relations between 
the Laplace and z-transformation in the case of step functions: 


pF(p) = (1 - =) F(z) (15.126a) or pl{f(t)} = (1 - =) Z{fa}. (5.1360) 


In this way the relations of z-transforms of step functions (see Table 21.15, p. 1128) can be transformed 
into relations of Laplace transforms of step functions (see Table 21.13, p. 1109), and conversely. 


15.4.1.5 Inverse of the Z-Transformation 
The inverse of the z-transformation is to find the corresponding unique original sequence { fn } from its 
transform F(z): 


Z-{F(2)} = {fa} (15.127) 
There are different possibilities for the inverse transformation. 
1. Using Tables 


If the function F(z) is not given in tables, then one can try to transform it to a function which is given 
in Table 21.15. 


2. Laurent Series of F(z) 


Using the definition (15.105), p. 794 the inverse transform can be determined directly if a series expan- 
sion of F(z) with respect to 1/z is known or if it can be determined. 


1 
3. Taylor Series of F (i) 


1 
Since F (5) is a series of increasing powers of z, from (15.105) and using the Taylor formula follows 


1 g” T 
h=- r (2) 
n! dz” ž 


4. Application of Limit Theorems 


Using the limits (15.108) and (15.111), p. 795, the original sequence {fn} can be directly determined 
from its transform F(z). 

2 
= οτι, . Using the previous four methods: 
1. Partial fraction decomposition (see 1.1.7.3, p. 15) of F(z)/z yields functions which are contained in 
Table 21.15. 


(n — 0,1,2,...). (15.128) 


ᾱ--0 


E F(z) 


F(z) and therefore f, —2(2"—m-—1) forn> 0. 


22 
z—2 (z-1)? z-1 
2. By division F(z) gets a series with decreasing powers of z: 
2z 1 1 1 1 1 
F(z)= S—4318:—2 273 | 85 +225 | 92-5 114-5 Eee. (15.129) 
From this expression one gets fo = fı = 0, f2 = 2, fg — 8, fa = 22, fs — 52, [εξ 114, ..., but nota 
closed expression is obtained for the general term fn. 


1 
3. For formulating F Ε 


) and its required derivatives, (see (15.128)) it is advisable to consider the 
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partial fraction decomposition of F(z) 


r(2) 2 25 2 ε(} -ο -- 
1 1-22 (l-z)? 1-z' d z a κ... 
ul 9) Š id : ie = (5) =0 for z=0 
dz (1-2:2? (1-29 (l-2)2? " dz S 
ap(l ο}. 1 
. ie. = rz=0, 
dz? (1—2z)3 (1-2)! (1-2) a. ᾿ 
a 1 
3 3 
J ii uid τ rds Yla for z=0 
dz3 (l—2z)4 (l-z)® (l1-2z)4’ " dz τ 
from which fo, fi, fo, fs, ... are easily obtained considering (15.128). 
4. Application of the limit theorems (see 15.4.1.2, 3., p. 795) gives: 
. j 2z 
fo= jim F(z)= jim 312153 =i; (15.131a) 
227 
= lim 2(F = li ; 15.1311 
fi pm (F(z) — fo) 2309 3915 424522 0, (15.131b) 
: ‘ 22} . 
fy = lim ? (Fe fo fiz) = lim -aa (15.1319) 
fs = lim αὖ (Fe fo — f 1 Í 3 — lim ét a =) =8 (15.131d) 
$7 29^ dE EE 209^ VB 4452-2 2 petuum 


where the Bernoulli-l’Hospital rule is applied (see 2.1.4.8, 2., p. 56). The original sequence ( fn} can 
be determined successively. 


15.4.2 Applications ofthe Z-' Transformation 


15.4.2.1 General Solution of Linear Difference Equations 


A linear difference equation of order k with constant coefficients has the form 


AkYntk + Ak—1Yn+k-1 + 11+ + azYn+2 + UYn41 + doys — Gn (n — 0,1,2...). (15.132) 
Here k is a natural number. The coefficients a; (i = 0,1,..., k) are given real or complex numbers and 
they do not depend on n. Here αρ and a; are non-zero numbers. The sequence {gn} is given, and the 
sequence {yn} is to be determined. 
To determine a particular solution of (15.132) the values yo, yi. ..., Yx—1 have to be previously given. 
Then the next value y; can be determined for n = 0 from (15.132). Next one gets y,41 for n = 1 from 
Vis Uo. es Yk and from (15.132). In this way all values y, can be calculated recursively. However 
a general solution can be given for the values y, with the z-transformation, using the second shifting 
theorem (15.114) applied for (15.132): 


ay z^ [Y (2 —9p—4Wiz =e κα] +++++a12[¥(z) -- yo] + aoY (z) = G(z). (15.133) 


Here one denotes Y(z) = Z{y,} and G(z) = Z(g,). Substituting a,z* + ay 41271 +--+» + aiz + ao = 
p(z), the solution of the so-called transformed equation (15.133) is 


Gl) + A ο, Y ajz. (15.134) 


p(z) i=0 j=i+1 
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As in the case of solving linear differential equations with the Laplace transformation, there is the 
similar advantage of the z-transformation that initial values are included in the transformed equation, 
so the solution contains them automatically. The required solution {yn} = Z7 Y(z)} follows from 


(15.134) by the inverse transformation discussed in 15.4.1.5, p. 797. 


15.4.2.2 Second-Order Difference Equations (Initial Value Problem) 


The linear second-order difference equation has the form 


Ynt2 + G1Un41 + Gon = Gn, ( 5. 
where yo and y; are given as initial values. Using the second shifting theorem for (15.135) the trans- 


formed equation is 


: 1 
27 [Y (z) — yo — f| aiz[Y (z) — yo] + aoY (z) = G(z). (15. 
Substituting z? + a4z + ag = p(z), the transform is 
1 z(z-a 2 
¥(2) = O(2) wA Γαι 4 y (15. 


p(z) 


35 


36 


37 


If the roots of the polynomial p(z) are αι and ag, then αι Z 0 and ay ¥ 0, otherwise αρ is zero, and 
then the difference equation could be reduced to a first-order one. By partial fraction decomposition 


and applying Table 21.15 for the z-transformation one gets from 


1 2 2 
for a αν. 
z αι — Q2 (: τῶ =z) 1 F 02, 
p(z) Car for a, = a», 
at — az 
2 ------- for o4 Y a», 
Στ) = {pa} 4 αι -- a2 res (15.138a) 
p(z) no? | for o4 = as. 


Since po — 0, by the second shifting theorem there is 


2 > 
gigi t = } = {pn 15.138b 
ΓΕ ple) - Pav} l 
and by the first shifting theorem 
1 1 5 
zt =Z! E } Dua. 15.138c 
πο z paj j T un) ( 


Substituting here p_; = 0, based on the convolution theorem one gets the original sequence with 


n 
Un — 5 Pv-19n—v + YolPn+1 + apn) Tub. (15.138d 


v=0 


Since p. , = po = 0, this relation and (15.138a) imply that in the case of αι Z az it follows 


ie αἴ} abt oT oH ο αἰταθὶ, αἴταἳ 
Yn 5 Üu + Yo Fa Fu , (15.138e) 
= αι — Q3 αι — Q2 O1 — 0» O1 — 0» 
This form can be further simplified, since αι = --(αι + αὐ) and αρ = 0402 (see the root theorems of 
Vieta, 1.6.3.1, 3., p. 44), so 
n at= at tak . αἰ ταῦ - 
Yn = Y, dn» Yoao Hyr : (15.138f) 
er) αι — Q3 αι — Q3 O4 — Q2 
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In the case of αι = a» similarly 
n 
Yn = Y, da (v — Vat? — aoln — Yat? + yino. (15.138g) 
v=2 


In the case of second-order difference equations the inverse transformation of the transform Y (z) can 
be performed without partial fraction decomposition using correspondences such as, e.g., 


z sinh bn 
g^ { } Lupe 15.139 
22 — 2α2 cosh b + a? * ^ osnhn ( ) 
and the second shifting theorem. By substituting a; = —2a cosh b, and ag = a? the original sequence 
of (15.137) becomes: 
1 , 
Yn = nb x ds a^ ? sinh(v — 1)b — yoa” sinh(n — 1)b + ya"! sinh nbd} . (15.140) 
sinh b | 


This formula is useful in numerical computations especially if ay and αι are complex numbers. 
Remark: Notice that the hyperbolic functions are also defined for complex variables. 


15.4.2.3 Second-Order Difference Equations (Boundary Value Problem) 


It often happens in applications that the values y, of a difference equation are needed only for a finite 
number of indices 0 € n € N. In the case of a second-order difference equation (15.135) both boundary 
values yo and yy are usually given. To solve this boundary value problem one starts with the solution 
(15.138f) of the corresponding initial value problem, where instead of the unknown value y; it is to 
introduce yy. Substituting n = N into (15.138f), y; can be obtained which depends on yo and yy: 

1 N 


κ | yoto(ay — ad!) + yw (o1 — a2) - Σ (at | — oZ gu] - (15.141) 
αι — A v=2 


y= 


Substituting this value into (15.138f) 


1 n 1 a” — o? N 
ge ν-1 ν-1 1 2 ν-1 ν-1 
Yn Silay a3") In—v N N (αἵ αν )üN-v 
αι — 02 4 ay — ag ayy — ay 55 
1 P No n n N ; n n 15.142 
EN EN ax Unos ay — o103 ) + ynlaat — o5)]. (15.142) 
1 —05 


The solution (15.142) makes sense only if a] — αὖ 7 0 holds. Otherwise, the boundary value prob- 
lem has no general solution, but analogously to the boundary value problems of differential equations 
eigenvalues and eigenfunctions emerge. 


15.5 Wavelet Transformation 
15.5.1 Signals 


If a physical object emits an effect which spreads out and can be described mathematically, e.g., by a 
function or a number sequence, then it is called a signal. 

Signal analysis means to characterize a signal by a quantity that is typical for the signal. This means 
mathematically: The function or the number sequence, which describes the signal, will be mapped into 
another function or number sequence, from which the typical properties of the signal can be clearly 
seen. For such mappings, of course, some informations can also be lost. 

The reverse operation of signal analysis, i.e., the reconstruction of the original signal, is called signal 
synthesis. 

'The connection between signal analysis and signal synthesis can be well represented by an example 
of Fourier transformation: A signal f(t) (t denotes time) is characterized by the frequency w. Then, 
formula (15.143a) describes the signal analysis, and formula (15.143b) describes the signal synthesis: 
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F(w) = fewro dt (15.143a) and f(t) = = / et F(u) dw. (15.143b) 


15.5.2 Wavelets 


The Fourier transformation has no localization property, i.e., if a signal changes at one position, then 
the transform changes everywhere without the possibility that the position of the change could be 
recognized “at a glance”. The basis of this fact is that the Fourier transformation decomposes a signal 
into plane waves. These are described by trigonometric functions, which oscillate with the same period 
for arbitrary long time. However, for wavelet transformations there is an almost freely chosen function 
ib, the wavelet (small localized wave), that is shifted and compressed for analysing a signal. 

Examples are the Haar wavelet (Fig. 15.28a) and the Mexican hat (Fig. 15.28b). 


E A Haar wavelet: y(t) A y(t) 
1 if ULTA 1 1 
p= 4-1 if $< a<i, (15.14) | 4 


0 otherwise. πι uir 

E B Mexican hat: i i 
PO 

w(x) = ——.e t2 15.145 

να) dx? ( a) b) 

(1— α λε”, (15.146) 


Figure 15.28 


Generally, it holds that every function ~ comes into consideration as a wavelet if it is quadratically 
integrable and its Fourier transform Ψ(ω) according to (15.143a) results in a positive finite integral 


| HOP (15.147) 


ω] 


Concerning wavelets, the following properties and definitions are essential: 
1. For the mean value of the wavelet: 


/ w(t) dt = 0. (15.148) 
2. The following integral is called the k-th moment of a wavelet w: 
mm / tlt) dt. (15.149) 


The smallest positive integer n such that μη 4 0, is called the order of the wavelet 4. 

W For the Haar wavelet (15.144), n = 1, and for the Mexican hat (15.146), n = 2. 

3. When up = 0 for every k, w has infinite order. Wavelets with bounded support always have finite 
order. 

4. A wavelet of order n is orthogonal to every polynomial of degree € n — 1. 


15.5.3 Wavelet Transformation 


For a wavelet y(t) a family of curves can be formed with parameter a: 


dst) = (5) (a 0). (15.150) 


la| νά 
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In the case of |a| > 0 the initial function v(t) is compressed. In the case of a < 0 there is an additional 
reflection. The factor 1/4/|a| is a scaling factor. 


The functions ψα({) can also be shifted by a second parameter b. Then a two-parameter family of curves 
arises: 


hasy (‘ = J (a treal; 420): (15.151) 


la| a 
'The real shifting parameter b characterizes the first moment, while parameter a gives the deviation of 
the function ψαρ(ί). The function Ya p(t) is called a basis function in connection to the wavelet trans- 
formation. 
The wavelet transformation of a function f(t) is defined as: 


Loflad) =e f Πθνω(θά- a ] row (‘ - J d. (15.152) 
o al oo 
For the inverse transformation: 
f(t) 2c / NE da db. (15.152b) 
Herecisa men dependent on the special wavelet w. 
W Using the Haar wavelets (15.146) gives 


= i if berzcbkaE 
«( )- -1 if b+a/2<t<b+a, 


a 0 otherwise 


and therefore 


1 b+a/2 bra 
£e) e T (P roa- (ora) 


E L E Lr" roa =f F(t) a). (15.153) 


The value £y f (a, b) given in (15.153) represents the difference of the mean values of a function f(t) 


, " s a Š 
over two neighboring intervals of length ΕΣ connected at the point b. 


Remarks: 


1. The dyadic wavelet transformation has an important role in applications. As basis functions are 
used the functions 


1 -- 21} 
vi;(t) = 7” (‘ zi η, (15.154) 


i.e., different basis functions can be generated from one wavelet (t) by doubling or halving the width 
and shifting by an integer multiple of the width. 


2. A wavelet v(t) is called an orthogonal wavelet, if the basis functions given in (15.154) form an 
orthogonal system. 

3. The Daubechies wavelets have especially good numerical properties. They are orthogonal wavelets 
with compact support, i.e., they are different from zero only on a bounded subset of the time scale. 
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They do not have a closed form representation (see [15.9]). 


15.5.4 Discrete Wavelet Transformation 
15.5.4.1 Fast Wavelet Transformation 


The integral representation (15.152b) is very redundant, and so the double integral can be replaced 
by a double sum without loss of information. Considering this idea at the concrete application of the 
wavelet transformation one needs 

1. an efficient algorithm of the transformation, which leads to the concept of multi-scale analysis, and 
2. an efficient algorithm of the inverse transformation, i.e., an efficient way to reconstruct signals from 
their wavelet transformations, which leads to the concept of frames. 

For more details about these concepts see [15.9], [15.1]. 

Remark: The great success of wavelets in many different applications, such as 

e calculation of physical quantities from measured sequences 

e pattern and voice recognition 

e data compression in news transmission 

is based on “fast algorithms”. Analogously to the FFT (Fast Fourier Transformation, see 19.6.4.2, 
p. 993) one talks here about FWT (Fast Wavelet Transformation). 


15.5.4.2 Discrete Haar Wavelet Transformation 


An example of a discrete wavelet transformation is the Haar wavelet transformation: The values f; (i = 


1,2,..., N) are given from a signal. The detailed values d; (i = 1,2,..., N/2) are calculated as: 
1 1 
Si —Va- T 2i]: d == 2i—1 — J2i)- 15.155 
yg e ic fai), yg 1— foi) ( ) 


The values d; are to be stored while the rule (15.155) is applied to the values s;, i.e., in (15.155) the 
values f; are replaced by the values s;. This procedure is continued, sequentially so that finally from 


N- 1 n n n 1 n n 
οἱ T Vi (s, +4 ο) f dl μπε Vi Cu = si) (15.156) 


a sequence of detailed vectors is formed with components df”. Every detailed vector contains infor- 
mation about the properties of the signals. 

Remark: For large values of N the discrete wavelet transformation converges to the integral wavelet 
transformation (15.1522). 


15.5.5 Gabor Transformation 

Time-frequency analysis is the characterization of a signal with respect to the contained frequencies 

and time periods when these frequencies appear. Therefore, the signal is divided into time segments 

(windows) and a Fourier transform is used. It is called a Windowed Fourier Transformation (WFT). 

The window function should be chosen so that a signal is con- 

sidered only in the window. Gabor applied the window function 
g 


g(t) = > e 20? (15.157) 
πσ 


(Fig. 15.29). This choice can be explained as g(t), with the 
“total unit mass”, is concentrated at the point t = 0 and the 


. width of the window can be considered as a constant (about 
Figure 15.29 20). 
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The Gabor transformation of a function f(t) then has the form 
)= f ro Ff (g(t — s)e“™* dt. (15.158) 


This determines, with which complex amplitude the dominant wave (fundamental harmonic) εἰσ! oc- 
curs during the time interval [s — c, s + o] in f, i.e., if the frequency w occurs in this interval, then it 
has the amplitude |G f (w, s)|. 


15.6 Walsh Functions 
15.6.1 Step Functions 


Orthogonal systems of functions have an important role in the approximation theory of functions. For 
instance, special polynomials or trigonometric functions are used since they are smooth, i.e., they are 
differentiable sufficiently many times in the considered interval. However, there are problems, e.g., the 
transition of points of a rough picture, when smooth functions are not suitable for the mathematical 
description, but step functions, piecewise constant functions are more appropriate. Walsh functions 
are very simple step functions. They take only two function values +1 and —1. These two function 
values correspond to two states, so the Walsh functions can be implemented by computers very easily. 


15.6.2 Walsh Systems 


Analogously to trigonometric functions also periodic step functions can be considered. The interval 
I = [0, 1) is used as a period interval and it is divided into 2” equally long subintervals. Suppose Sn 
is the set of periodic step functions with period 1 over such an interval. The different step functions 
belonging to S, can be considered as vectors of a finite dimensional vector space, since every function 
g € S, is defined by its values go, g1, φ9,..., gn 1 in the subintervals and it can be considered as a 
vector: 


g = (g0; 91; 92,- - 92 3). (15.159) 
The Walsh functions belonging to 5, form an orthogonal basis with respect to a suitable scalar product 
inthis space. The basis vectors can be enumerated in many different ways, so one can get many different 
Walsh systems, which actually contain the same functions. There are three of them which should be 
mentioned: Walsh-Kronecker functions, Walsh-Kaczmarz functions and Walsh-Paley functions. 


'The Walsh transformation is constructed analogously to the Fourier transformation, where the role 
of the trigonometric functions is taken by the Walsh functions. One gets, e.g., Walsh series, Walsh 
polynomials, Walsh sine and Walsh cosine transformations, Walsh integral, and analogously to the fast 
Fourier transformation there is a Fast Walsh Transformation. For an introduction in the theory and 
applications of Walsh functions see [15.6]. 


16 Probability Theory and 
Mathematical Statistics 


When experiments or observations are made, various outcomes are possible even under the same con- 
ditions. Probability theory and statistics deal with regularity of random outcomes of certain results 
with respect to given experiments or observations. (In probability theory and statistics, observations 
are also called experiments, since they have certain outcomes.) It is supposed, at least theoretically, 
that these experiments can be repeated arbitrarily many times under the same circumstances. Its ap- 
plication have these disciplines of mathematics in the statistic assessment of mass phenomena. The 
mathematical handling of random phenomena is also summarized in the notion stochastics. 


E è 
16.1 Combinatorics 

From the elements of a given set often new sets, systems or sequences are composed. Depending on 
the way how to do it, one gets the notions permutation (ordering), combination (selection), and partial 
permutation or arrangement. The notion arrangement combines ordering and selection. The basic 
problem of combinatorics is to determine how many different choices or arrangements are possible with 
the given elements. 


16.1.1 Permutations 


1. Definition 
A permutation of n elements is an ordering of the n elements. 
2. Number of Permutations without Repetition 
The number of different permutations of n different elements is 
Pa=n!. (16.1) 
W ina classroom 16 students are seated on 16 places. There are 16! different possibilities for seating. 
3. Number of Permutations with Repetitions 
The number P, ? of different permutations of n elements containing k identical elements (k < n) is 
P,® = is (16.2) 
W In a classroom 16 school-bags of 16 students are placed on 16 chairs. Four of them are identical. 
There are 16!/4! different placements of the school-bags. 
4. Generalization 


The number P, 1^2») of different permutations of n elements containing m different types of ele- 


ments with multiplicities k1, k»,..., km respectively (kı + k2 +... + km =n) is 


n! 
p, ome) — : 16.3 
a kk! ιν km! ( ) 
E Suppose one composes five-digit numbers from the digits 4, 4, 5, 5, 5. One can compose P? = 
5! p 
2 7 10 different numbers. 


16.1.2 Combinations 
1. Definition 


A combination is a choice of k elements from n different elements not considering the order of them. 
This is called a combination of k-th order and one distinguishes between combinations with and without 
repetition. 
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2. Number of Combinations without Repetition 


The number C,,“) of different possibilities to choose k elements from n different elements not consid- 
ering the order is 


t n ; ] , -- 
Οἱ = (;) with 0 k Xn (see binomial coefficient in 1.1.6.4, 3., p. 13), (16.4) 
if any element is chosen at most once. This is called a combination without repetition. 
W There are ( i) = 27405 possibilities to choose an electoral board of four persons from 30 partici- 


pants. 


3. Number of Combinations with Repetition 


The number of possibilities to choose k elements from n different ones, repeating each element arbi- 
trarily times and not considering the order is 


f n+k—-1 

Ge = : (16.5) 
k 

In other words, the number of different selections of k elements chosen from n different elements is 

considered, where each of n can be chosen more then once. 

k+6-1 


E Rolling k dice, Cs) = ( s 


) different results are possible. Consequently, for two dice there 


are Ορ) = J = 21 different results. 


16.1.3 Arrangements 


1. Definition 
An arrangement is an ordering of k elements selected from n different ones, i.e., arrangements are 
combinations considering the order. 


2. Number of Arrangements without Repetition 


The number V,™® of different orderings of k different elements selected from n different ones is 


Va") = a(t) —n(n—-1)n—2)...(n—k--1) (O<k <n). (16.6) 


W How many different ways are there to choose a chairman, his deputy, and a first and a second assistant 


T : "E . [30 
for them from 30 participants at an election meeting? The answer is ( 4 ) 4! = 657720. 


3. Number of Arrangements with Repetition 
An ordering of k elements selected from n different ones, where any of the elements can be selected 
arbitrarily many times, is called an arrangement with repetition. Their number is 

y, 9 = mf. (16.7) 
Wl A: In a soccer-toto with 12 games there are 3! different outcomes. 


B DB: With the digital unit called a byte which contains 8 bits can be represented 2° = 256 different 
symbols (see for example the well-known ASCII table). 
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16.1.4 Collection of the Formulas of Combinatorics (see Table 16.1) 


Table 16.1 Collection of the formulas of combinatorics 


Type of choice or Number of possibilities 
selection of kfrom | without repetition with repetition 
n elements (k € n) (k € n) 

n! 
Permutations P, = ni(n=k)| PM = W 

; 1 : n+k—-1 

Combinations c," = (;) c," = (" τ k ) 
Arrangements γι = k! (;) WO = nk 


16.2 Probability Theory 


16.2.1 Event, Frequency and Probability 
16.2.1.1 Events 
1. Different Types of Events 


All the possible outcomes of an experiment are called events in probability theory, and they form the 
fundamental probability set A. 

There are to be distinguished the certain event, the impossible event and random events. 

The certain event occurs every time when the experiment is performed, the impossible event never oc- 
curs; a random event sometimes occurs, sometimes does not. All possible outcomes of the experiment 
excluding each other are called elementary events (see also Table 16.2). Here the events of the funda- 
mental probability set A are denoted by A, B,C,..., the certain event by J, the impossible event by 
O. Some operations and relations between the events are denoted as given in Table 16.2. 


2. Properties of the Operations 
The fundamental probability set forms a Boolean algebra with complement, addition, and multiplica- 
tion defined in Table 16.2, and it is called the field of events. The following rules are valid: 


1.4) ΑΒ- Β.Α, (16.8) 1. b) ΑΒ = BA. (16.9 
2.a) A+A=A, (16.10 2. b) ΑΑ-- A. (16.11 
3.a) A+(B+C)=(A+B)+C, (1612 3. b) A(BC) = (AB)C. (16.13 
4.a) A+A=I, (16.14 4. b) AA=O. (16.15 
5.a) A(B+C)=AB+AC, (16.16 5.b) A+BC=(A+B)(A+C). (16.17 
6.a) A+B=AB (16.18 6.b) AB=A+B (16.19 
7.a) Β- Α- BÀ (16.20 7.b) A=I-A (16.21 
8.a) A(B—C)- AB — AC, (16.22 8.5) AB-C-(A-C)B-C) (16.23 
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9.a) OCA, (16.24) 9.b) ACI. (16.25 


10. From Ας B follows a) A= AB (16.26) and b) B=A+BA_ and conversely. (16.27 


11. Complete System of Events: A system of events A, (a € 0, 0 is a finite or infinite set o 
indices) is called a complete system of events if the following is valid: 


11.2) AAs — O fora #8 (16.28) and 1l. b) A=]. (16.29 


ac 


Table 16.2 Relations between events 


Name Nota- Definition 
tion 
1.| Complementary event of A: A A occurs exactly if A does not. 
2. | Sum of events A and B: A+B | A+Bisthe event which occurs if A or B or both occur. 
3.) Product of the events A AB AB is the event which occurs exactly if both A and B 
and B: occur. 
4. | Difference of the events A| A— B | A— B occurs exactly if A occurs and B does not. 
and B: 
5. | Event as a consequence of Cc A C B means that from the occurrence of A follows 
the other: the occurrence of B. 
6. Elementary or simple E From E = A+ Bit follows that E = Aor E — B. 
event: 
7. | Compound event: Event, which is not elementary. 
8. | Disjoint or exclusive events | AB — O | The events A and B cannot occur at the same time. 
A and B: 
ΒΑ: Tossing two coins: Elementary events for the separate toss- 
ing: See the table on the right. l l l Head | Tall | 
1. Elementary event for tossing both coins, e.g.: First coin shows T. Com) Ay | Ap 
head, second shows tail: A11 Ago. 2. Coin | An | Az 


Compound event for tossing both coins: First coin shows head: 
Ay = AnA + AA» 
2. Compound event for tossing one coin, e.g., the first one: First coin shows head or tail: Ay, + A15 = 1. 
Head and tail on the same coin are disjoint events: Ay, 415 = O. 
W B: Lifetime of light-bulbs. 
Defining the elementary events Απ: the lifetime t satisfies the inequalities (n — 1)At < t < nAt (n = 
1,2,..., and At > 0, arbitrary unit of time). " 
Compound event A: The lifetime is at most nAt, i.e., A = Σ Αν. 


νι 


16.2.1.2 Frequencies and Probabilities 


1. Frequencies 

Let A be an event belonging to the field of events A of an experiment. If event A occurred n4 times 
repeating the experiment n times, then n4 is called the frequency, and n4/n = ha is called the relative 
frequency of the event A. The relative frequency satisfies certain properties which can be used to built 
up an axiomatic definition of the notion of the probability P(A) of event A in the field of events A. 

2. Definition of the Probability 

A real function P defined on the field of events is called a probability if it satisfies the following proper- 
ties: 
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1. For every event A € A 


0x P(A) <1, and 0<hg <1. (16.30) 
2. For the impossible event O and the certain event [ 
P(O)=0, P(I) 21, and ho=0, hy; =1. (16.31) 


3. Ifthe events A; € A (i =1,2,...) are finite or countably many mutually exclusive events 
(A; Aj = O for i Æ k), then 
P(A, + Ag+...) = P(A1) + P(Ag) -..., and hau A. = ha, tha, Ἔ .... (16.32) 


Remark: The axiomatization of the probability theory, which is based on the assumption of three 
conditions of the preceding kind, succeeded in 1933 A.N. KOLMOGOROFF (see [16.11]). 


3. Rules for Probabilities 


1. BCA yields P(B) x P(A). (16.33) 
2. P(A)+P(A)=1. (16.34) 
3.a) For n mutually exclusive events A; (i = 1,...,n; A; A, = O, i # k), holds 

P(A, + Ao Ε... An) = P(A1) + Ρ(Α9) +... + P(An). (16.35a) 
3.b) In particular for n = 2 holds 

P(A + B) = P(A) + P(B). (16.35b) 
4. a) For arbitrary events A; (i = 1,...,n), holds 

P(A, +--+ + Απ) = P(Ai) +--+ + P(An) - P(A145) — +++ — Ρ(Αι An) 

—P(A2A3) Hid P(A2An) Ey P(As Αν) 


ΕΡ(ΑΙΑΟ ΑΛ) + +- + Ρ(ΑιΙΑΟΑ/) +: + P(An—-2An-1An) — 


4 (—1) 1 P(A, Aa... An). (16.36a 


4.9) In particular for n = 2 holds P(A; + A2) = P(A1) + P(A2) — P(A, Ao). (16.36b 
5. Equally likely events: If every event A; (i = 1,2,...,n) ofa finite complete system of events occurs 
with the same probability, then 
1 
P(Ai) 2 -. (16.37 
n 
If Aisasum of m. (m < n) events A; (i = 1,2,...,n) with the same probability of a complete system, 
then 
P(A) = τ (16.38 


4. Examples of Probabilities 

1 
W A: The probability P(A) to get a 2 rolling a fair die is: P(A) = 6 
W B: What is the probability of guessing four numbers for the lotto “6 from 49”, i.e., 6 numbers are 
to be chosen from the numbers 1,2,...,49. 
6 


If 6 numbers are drawn, then there are 6 


) possibilities to choose 4. On the other hand there are 
DX) = (3) possibilities for the false numbers. Altogether, there are a different possibilities to 


draw 6 numbers. Therefore, the probability P(A4) is: 


6 43 , 
P(Aa) a ) SEU 0.0968 %. 


6 
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Similarly, the probability P(A¢) to get a direct hit is: 


1 7 T 
P(Ac) = zu. = 0.715 - 1077 = 7.15 - 10 5 96. 
(9) 
B C: What is the probability P(A) that at least two persons have birthdays on the same day among 
k persons? (The years of birth must not be identical, and one supposes that every day has the same 
probability of being a birthday.) 
It is easier to consider the complementary event A: All the k persons have different birthdays. One 
gets: 
=, 365 365—1 365-2 365 —k +1 
P(A . . edgy 
(4) 365 365 365 365 

From this it follows that 


365 - 364 - 363-...- (365 — k +1) 
365* ` 
k | 10 20 23 30 60 
P(A) | 0.117 0.411 0.507 0.706 0.994 
It is to see that the probability that among 23 and more persons at least two have the same birthday is 
greater than 50 %. 


16.2.1.3 Conditional Probability, Bayes Theorem 
1. Conditional Probability 


The probability of an event B, when it is known that some event A has already occurred, is called a 
conditional probability and it is denoted by P(B|A) or PA(B) (read: The probability that B occurs 

given that Α has occurred). It is defined by 

P(AB) 
P(A) ' 

The conditional probability satisfies the following properties: 

a) If P(A) Z 0 and P(B) 4 0 holds, then 


P(B|A) ΡΙΑ|8) 


P(A) 21- Ρ(Α) --1 


Some numerical results: 


P(B|A) = P(A) #0. (16.39) 


P(B) P(A) (16.40a 
b) If P(A; 4141... An) Z 0 holds, then 
P(AiAa ... Αν) = P(Ai)P(Ao| Ai)... P(As| A Aa... Ana). (16.40b 


2. Independent Events 

The events A and B are independent events if 
P(A|B) = P(A) and P(B|A) = P(B) (16.41a. 

holds. In this case, it is 


P(AB) — P(A)P(B). 


6.41b 


~ 


3. Events in a Complete System of Events 

If A is a field of events and the events B; € A with P(B;) > 0 (i = 1,2,...) form a complete system 
of events, then for an arbitrary event A € A the following formulas are valid: 

a) Total Probability Theorem 


P(A) = 3; P(A|Bi) P(Bi). (16.42) 
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b) Bayes Theorem with P(A) > 0 


Pema ος οκ (16.43) 

> P(A|Bi) P(Bi) 
WB Three machines produce the same type of product in a factory. The first one gives 20 % of the total 
production, the second one gives 30 96 and the third one 50 96. It is known from past experience that 
5 96, 4 96, and 2 96 of the product made by each machine, respectively, are defective. Two types of 
questions often arise: 
a) What is the probability that an article selected randomly from the total production is defective? 
b) If the randomly selected article is defective, what is the probability that it was made, e.g., by the 
first machine? 
Using the following notation gives: 
ο A; denotes the event that the randomly selected article is made by the i-th machine (i = 1, 2,3) with 
P(A1) = 0.2, P(A3) = 0.3, P(A3) = 0.5. The events A; form a complete system of events: 
ο AjA; — 0, Αι + Ag+ As = I. 
e 4 denotes the event that the chosen article is defected. 
ο P(A|A,) = 0.05 gives the probability that an article produced by the first machine is defective; 
analogously P(A|A5) = 0.04 and P(A|A3) = 0.02 hold. 
Now, the answers to the questions are: 
a) P(A) = P(A:))P(A|A1) + P(Az) P(A[A;) + P(As) PCA| Aa) 

= 0.2- 0.05 + 0.3 - 0.04 + 0.5 - 0.02 = 0.032. 

P(A|A1) 0.05 


—02 = 031. 
P(A) 0.032 


16.2.2 Random Variables, Distribution Functions 


Το apply the methods of analysis in probability theory, the notions of variable and function are neces- 
sary. 


16.2.2.1 Random Variable 

A set of elementary events is to be described by a random variable X. The random variable X can be 
considered as a quantity which takes its values x randomly from a subset R of the real numbers. 

If R contains finitely or countably many different values, then X is called a discrete random variable. In 
the case of a continuous random variable, R can be the whole real axis or it may contain subintervals. 
For the precise definition see 16.2.2.2, 2., p. 812. There are also mixed random variables. 

W A: Assigning the values 1, 2, 3, 4 to the elementary events A11, A12, A21, A22, respectively, in example 
A, p. 808, then a discrete random variable X is defined. 


b) P(A1|A) = P(A1) 


W B: Thelifetime T ofarandomly selected light-bulb is a continuous random variable. The elementary 
event T' — t occurs if the lifetime T' is equal to t. 


16.2.2.2 Distribution Function 


1. Distribution Function and its Properties 
The distribution of a random variable X can be given by its distribution function 

F(r)—- P(X €x) for —oo€ x € oo. (16.44) 
It determines the probability that the random variable X takes a value between —oo and x. Its domain 
is the whole real axis. The distribution function has the following properties: 
1. F(-o0) 20, Ε(ή-οο) — 1. 
2. F(x) is a non-decreasing function of x. 
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3. F(x) is continuous on the right. 

Remarks: 

1. From the definition it follows that P(X = a) = F(a) — lim F(z). 
1—a— 


2. In the literature, also the definition F(x) = P(X < x) is often used. In this case 
P(X =a)= lim, F(z) — F(a). 
ra 


2. Distribution Function of Discrete and Continuous Random Variables 
a) Discrete Random Variable: A discrete random variable X, which takes the values x; (i = 
1,2,...) with probabilities P(X = x;) = p; (i = 1,2,...), has the distribution function 

F(x) = Mp. (16.45) 

πισω 

b) Continuous Random Variable: A random variable is called continuous if there exists a non- 
negative function f(x) such that the probability P(X € 5) can be expressed as P(X € S) = fg f(x)dx 
for any domain S such that it is possible to consider an integral over it. This function is the so-called 
density function. A continuous random variable takes any given value x; with 0 probability, so one 
rather considers the probability that X takes its value from a finite interval (a, b]: 


b 
P(ac X «5)— EO dt. (16.46) 
A continuous random variable has an everywhere continuous distribution function: 
F(z)- P(X «α)-- / f(t) dt. (16.47) 


F'(x) = f(x) holds at the points where f(x) is continuous. 

Remark: When there is no confusion about the upper integration limit, often the integration variable 
is denoted by «x instead of t. 

3. Area Interpretation of the Probability, Quantile 

By introducing the distribution function and density function in (16.47), the probability P(X € x) = 
F(x) can be represented as an area between the density function f(t) and the x-axis on the interval 
—oo < t € x (Fig. 16.1a). 


Ελ f(t) 
F(x) 


ev 


> 
0 x t 0 | Χ 
8) b) 
Figure 16.1 
Often there is given (frequently in %) a probability value a. If 
P(X >x)=a (16.48) 


holds, the corresponding value of the abscissa x = za is called the quantile or the fractile of order a 
(Fig. 16.1b). This means the area under the density function f(t) to the right of £a is equal to a. 


Remark: In the literature, the area to the left of x, is also used for the definition of quantile. 
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In mathematical statistics, for small values of a, e.g., a = 5% or a = 1%, is also used the notion 
significance level of first type or type 1 error rate. The most often used quantile for the most important 
distributions in practice are given in tables (Table 21.16, p. 1131, to Table 21.20, p. 1138). 


16.2.2.3 Expected Value and Variance, Chebyshev Inequality 

For a coarse characterization of the distribution of a random variable X , mostly the parameters expected 
value, denoted by u, and variance, denoted by a? are used. The expected value can be interpreted with 
the terminology of mechanics as the abscissa of the center of gravity of a surface bounded by the curve of 
the density function f(x) and the x-axis. The variance represents a measure of deviation of the random 
variable X from its expected value ju. 


1. Expected Value 
If g(X) is a function of the random variable X, then g(X) is also a random variable. Its expected value 
or expectation is defined as: 


a) Discrete Case: E(g(X)) = 5 g(xx)pe, if the series » lg(xi)|py. exists. (16.492) 
k ki 
b) Continuous Case: E(g(X)) = 1 glx) f (x) dz, it [7 lg x)| f(x) da exists. (16.49b) 


The expected value of the random variable with g(.X) = X is defined as 


+00 


ux = E(X) =F tep or / af(x) dz (16.50a) 


—oo 
if the corresponding sum or integral with the absolute values exists. Because of (16.49a,b) 

E(aX +b) — aux +b (a,b const) (16.50b) 
is also valid. Of course, it is possible that a random variable do not have any expected value. 


2. Moments of Order n 


Furthermore one introduces: 


a) Moment of Order n: E(X”), (16.51a) 
b) Central Moment of Order n: E((X — μι)”. (16.51b) 


3. Variance and Standard Deviation 
In particular, for n = 2, the central moment is called the variance or dispersion: 


Σα. nd Ax) pr or 
E((X — ux)) = D(X) = oX = 9 + (16.52) 


if the expected values occurring in the formula exist. The quantity σχ is called the standard deviation. 
The following relations are valid: 


D?(X) = 0% = E(X?) — μι, D(aX--b)-—a?D*(X). (16.53) 
4. Weighted and Arithmetical Mean 
In the discrete case, the expected value is obviously the weighted mean 

E(X) = pıtı +... + Pr&n (16.54) 
of the values z4,..., ἴα “with the μόν pr as weights (k = 1,..., n). The probabilities for the 
uniform distribution are Pi = P2 =... = Pn = 1/n, and E(X) is the arithmetical mean of the values 
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ad 

πι $2... $n 
n ` 

In the continuous case, the density function of the continuous uniform distribution on the finite interval 


(a, b] is 


Ε(Χ) = (16.55) 


l fora<zx<b 
f(x) = baa Jor dE (16.56) 
0 otherwise, 
and it follows that 
b η 
1 ath 2 (-α) 
E(X) = r3 τς Tt^, dm το (16.57) 


5. Chebyshev Inequality 
If the random variable X has the expected value μ and standard deviation c, then for arbitrary A > 0 
the Chebyshev inequality (see 1.4.2.10, p. 31) is valid: 


A 1 
P(|X — u| > ào) < Xx (16.58) 
That is, it is very unlikely that the values of the random variable X are farther from the expected value 
u than a multiple of the standard deviation (A large). 


16.2.2.4 Multidimensional Random Variable 


If the elementary events mean that n random variables X4,..., X, take n real values z1,. .., r4, then 
a random vector X = (X1, Xa, ..., Xn) is defined (see also random vector, 16.3.1.1, 4., p. 830). The 
corresponding distribution function is defined by 


F(z1,...,2,) = P(Xy σπιν... Xu < 2s). (16.59) 
The random vector is called continuous if there is a function f(t1,...,£,) such that 
Tl Xn 
F(zi,...,24) = / n I f(ti,..-,tn) dtr... dt, (16.60) 
—oo —oo 
holds. The function f(t,...,t,) is called the density function. It is non-negative. If some of the 
variables z1,...,x, tend to infinity, then one gets the so-called marginal distributions. Further inves- 
tigations and examples can be found in the literature. 
The random variables X,..., X, are independent random variables if 
F(zi...,2,) = Fi(xi)Fo(z23)...Fu(ua), Jeeta) = falt) a fu (ts). (16.61) 


16.2.3 Discrete Distributions 


1. Two-Stage Population and Urn Model 

Suppose one has a two-stage population with elements, i.e., the population to be considered has two 
classes of elements. One class has M elements with a property A, the other one has N — M elements 
which does not have the property A. If investigating the probabilities P(A) = p and P(A) = 1 — p for 
randomly chosen elements, then one has to distinguish between two cases: When selecting n elements 
one after the other, either the previously selected element is replaced before selecting the next one, or 
it is not. The selected n elements, which contain k elements with the property A, is called the sample, 
n being the size of the sample. 'This can be illustrated by the urn model. 

2. Urn Model 

Suppose there are a lot of black balls and white balls in a container. The question is: What is the 
probability that among n randomly selected balls there are k black ones. If putting every chosen ball 
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back into the container after the determination of its color, then the number k of black ones among the 
chosen n balls has a binomial distribution. If one does not put back the chosen balls and n « M and 
n € N — M,then the number of black ones has a hypergeometric distribution. 


16.2.3.1 Binomial Distribution 

Suppose in an experiment only the two events A and A are possible and the experiment is repeated 
n times and the accompanying probabilities are P(A) = p and P(A) = 1 — p every time, then the 
probability that A takes place exactly k times is 


μη) = (ο να —pyr* (k=0,1,2,...,n). (16.62) 
For every choice of an independent element from the population, the probabilities are 
M _, N-M 
P(A)= 37 =P, PA=- loa (16.63) 
Pi i 


'The probability of getting an element with property A for the first k choices, then an element with 
the remaining property A for the n — k choices is p*(1 — p)"~*, because the results of choices are 
independent of each other. The sequence of the choices plays no role because the combinations have 
the same probability independently of the order of the choices, and these events are mutually exclusive, 
so one adds 


n n! 
(;) ~ Kn =k)! (16.64) 


equal numbers to get the required probability. 

A random variable X,, for which P(X, = k) = W} (k) holds, is called binomially distributed with 
parameters n and p. 

1. Expected Value and Variance 


E(X,) ==n-p, (16.65a) D?(X,) = σ᾽ =n- p(1— p). (16.65b) 


2. Approximation of the Binomial Distribution by the Normal Distribution 
If Xn has a binomial distribution, then 
x 5 


: X, = E(X,) 1 =t" 
eae πω 65ο 
πι ο y mel exp ( 5 ) dt (16.65c) 


This means that, if n is large, the binomial distribution can be well approximated by a normal distri- 
bution (see 16.2.4.1, p. 818) with parameters μχ = E(X,) and o? = D?(X,), if p or 1 — p are not too 
small. The approximation is the more accurate the closer p is to 0.5 and the larger n is, but acceptable if 
np 4and n(1— p) > 4 hold. For very small p or 1 — p, the approximation by the Poisson distribution 
(see (16.68) in 16.2.3.3) is uscful. 

3. Recursion Formula 


'The following recursion formula is recommended for practical calculations with the binomial distribu- 
tion: 


n—k Dom 
k+1 q P 
4. Sum of Binomially Distributed Random Variables 


If Xn and X,, are both binomially distributed random variables with parameters n, p and m, p, then 
the random variable X = X, + X,, is also binomially distributed with parameters n + m, p. 


Fig. 16.2a,b,c represents the distributions of three binomially distributed random variables with pa- 
rameters n — 5; p — 0.5, 0.25, and 0.1. Since the binomial coefficients are symmetric, the distribution 
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is symmetric for p = q = 0.5, and the farther p is from 0.5 the less symmetric the distribution is. 
16.2.3.2 Hypergeometric Distribution 


Just as with the binomial distribution, a two-stage population with N elements is considered, i.e., the 
population has two classes of elements. One class has M elements with a property A, the other one has 
N — M elements which does not have the property A. In contrast to the case of binomial distribution, 
the chosen ball of the urn model is not replaced before choosing the next one. 

The probability that among the n chosen balls there are k black ones is 


ο B E 
P(X =k) = Wf, y (k) = HUE with (16.662) 


0<k<n, kx M, n—kzN M. (16.66b) 


If also n € M and n € N — M hold, then the random variable X with the distribution (16.66a) is 
called hypergeometrically distributed. 


A A 
0.7 0.7 | 07 + 
0.6 0.6 4 0.6 4 
0.5 _ 05- 7 0.5 E 
0.4 peo 04 -| p=0.20 0.4 p=0.10 
0.3 0.3 - 0.3 
0.2 0.2 02 
0.1 0.1 0.1 
0.0 0.0 0.0 
012345 k 012345 k 012345 k 


Figure 16.2 


1. Expected Value and Variance of the Hypergeometric Distribution 


fd $ E 3 
PES FIRM PASE NE d (16.672) 


=n 


k=0 Ν᾽ 
> 


c? = D'(X) = E(X?) - [E(X)? -Σ k2 (i) pus (ως) 


-. G) 


Mf, MYN-n u 
EB ( x uer (16.67b) 


2. Recursion Formula 


Wy w(k--1)— (n — k)(M — k) 


(k--1(N—-M-n-ck-1) 


Wn, x(k). (16.679) 


In Fig. 16.3a,b,c three hypergeometric distributions are represented for the cases Ν = 100, M = 50, 
25 and 10, for n = 5. These cases correspond to the cases p = 0.5; 0.25, and 0.1 of Fig. 16.2a,b,c. 
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There is no significant difference between the binomial and hypergeometric distributions in these ex- 


amples. If also M and N — M are much larger than n, then the hypergeometric distribution can be 
well approximated by a binomial one with parameters as in (16.63). 
A 
0.7 4 0.7 0.7 
0.6 η 0.6 0.6 
0.5 η 0.5 0.5 
0.3 0.3 0.3 
0.2 η 0.2 0.2 
0:1 - 0.1 0.1 
0.0 0.0 j 
012345 k 012345 k Ti? eas k 


a) b) c) 
Figure 16.3 


16.2.3.3 Poisson Distribution 
If the possible values of a random variable X are the non-negative integers with probabilities 
X 
P(X =k)= ae (k =0,1,2,...; A » 0), (16.68) 
then it has a Poisson distribution with parameter A. 


1. Expected Value and Variance of the Poisson Distribution 


E(X) =), (10.094) D(X) =). (16.698) 


2. Sum of Independent Poisson Distributed Random Variables 


If X, and X» are independent Poisson distributed random variables with parameters λι and As, then 
the random variable X = X, + X» also has a Poisson distribution with parameter A = A, + A». 


3. Recursion Formula 


P(X =k+1)= E P(X =k)). (16.69c) 


+1 


4. Connection between Poisson and Binomial Distribution 


The Poisson distribution can be obtained as a limit of binomial distributions with parameters n and p 
if n — oo, and p (p — 0) changes with n so that np = A = const, i.e., the Poisson distribution is a 
good approximation for a binomial distribution for large n and small p with A = np. In practice, one 
uses it if p < 0.08 and n > 1500p hold, because the calculations are easier with a Poisson distribution. 
Table 21.16, p. 1131, contains numerical values for the Poisson distribution. Fig. 16.4a,b,c repre- 
sents three Poisson distributions with A = np = 2.5, 1.25 and 0.5, i.e., with parameters corresponding 
to Figs. 16.2 and 16.3. 


5. Application 


'The number of independently occurring point-like discontinuities in a continuous medium can usually 
be described by a Poisson distribution, e.g., number of clients arriving in a store during a certain time 
interval; number of misprints in a book, the rate of radioactive decay, etc. 
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A 

0.74 0.7 0.7 

0.64 0.6 0.6 

0.54 0.5 0.5 

0.4- 2 0.4 1=1.25 0.4 

λΞ2.5 = 

0.3 4 0.3 0.3 ἘΠ 

0.2 0.2 0.2 

0.14 0.1 0.1 

0 0.0 0.0 
012345 k 012345 k 01234 k 
a) b) c) 
Figure 16.4 

16.2.4 Continuous Distributions 
16.2.4.1 Normal Distribution 
1. Distribution Function and Density Function 
A random variable X has a normal distribution if its distribution function is 

PU «gj- Fg. / SP a (16.70a) 

z)-— r)= e 20? dt. (Ua 
a ov2x J. 


Then it is also called a normal variable, and the distribution is called a (1, o?) normal distribution. 'The 
function 
κος LL Esel (16.70b) 
t) e 2c ).70b 
σν2π 
is the density function of the normal distribution. It takes its maximum at t = p and it has inflection 
points at p + ø (see (2.59), p. 73, and Fig. 16.5a). 


p(t) 
(x) 


=y 
=Y 


u 


0|u-c nmm 
a) b) 
Figure 16.5 

2. Expected Value and Variance 


The parameters jz and σ᾽ of the normal distribution are its expected value and variance, respectively, 
ie., 


1 MCZDM 
E(X) = / ve 36? dx — u, (16.71a) 


σνπ . 
--οο 


+00 2 
3 (α-μ)᾽ 


5 /α ue 39 dx = 0°. (16.71b) 
σνρπ. 


D'(X) -εα-μ]- 
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If the normal random variables X; and X» are independent with parameters μι, σι and μα, σα, resp., 
then the random variable X = kj X1 + k2Xə (ky, ko real constants) also has a normal distribution with 


parameters u = kı μη 4 zu o =V ko + ko? 


i— 
By the substitution 7 = in (16.70a), the calculation of the values of the distribution function 


of any normal distribution is reduced to the calculation of the values of the distribution function of 
the (0,1) normal distribution, which is called the standard normal distribution. Consequently, the 
probability P(a < X < b) of a normal variable can be expressed by the distribution function @(2) of 
the standard normal distribution: 


P(ax X <b) ο 8) ο(---). (16.72) 


σ 


16.2.4.2 Standard Normal Distribution, Gaussian Error Function 


1. Distribution Function and Density Function 
From (16.70a) with u = 0 and c? = 1 follows the distribution function 


P(X <2) = Φ(α) = zm. je oie fo dt (16.73a) 


of the so-called standard normal distribution. Its density function is 


| 
"|n 


1 
(t) --κ--ε 16.73b 
ol) = -Ῥ- (16.736) 
it is called the Gaussian error curve (Fig. 16.5b). 
The values of the distribution function $(x) of the (0, 1) normal distribution are given in Table 21.17, 
p. 1133. Only the values for the positive arguments x are given, while the values for the negative 
arguments can be got from the relation 


Φ(--α) = 1 — Φ(α). (16.74) 
2. Probability Integral 
The integral (x) is also called the probability integral or Gaussian error integral. In the literature the 
functions Φρ(α) and erf (x) are sometimes denoted as error integral with the following definitions: 


1 
p(x — 5+ (16.75a) αξία EL oP dt = 2. Φον σα). (16.75b) 


== fe -5 dt = Φ(α) 
With erf the error function is denoted. 

16.2.4.3 Logarithmic Normal Distribution 
1. Density Function and Distribution Function 


The continuous random variable X has a logarithmic normal distribution, or lognormal distribution 
with parameters jz, and c2 if it can take all positive values, and if the random variable Y, defined by 
= log X, (16.76) 
has à normal distribution with expected value juz, and variance of (see also remark b) on page 820). 
Consequently, the random variable X has the density function 
0 for t <0, 


E oe TN 2 
f) = loge “ο (logt e) for 130, 


m (16.772) 
tory 2π 202 
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and the distribution function 


0 for a <0, 
log x ð 
F(x) = 1 I f (t — με)’ M" (16.77b) 
Es d. exp 203 dt for x90. 


In practical applications either the natural or the decimal logarithm is used. 

2. Expected Value and Variance 

Using the natural logarithm one gets the expected value and the variance of the lognormal distribution 
as: 

92 

u = exp (m + i) g^c (exp o? — 1) exp (2u; + ej). (16.78) 
3. Remarks 
a) The density function of the lognormal distribution is continuous everywhere and it has positive 
values only for positive arguments. Fig. 16.6 shows the density functions of lognormal distributions 
for different jjj and σι. Here has been used the natural logarithm. 

b) Here the values uz and o? are not the expected value and variance of the lognormal random variable 
itself, but of the variable Y = log X, while p and σ᾽ are in conformity with (16.78) expected value and 
variance of the random variable X. 


c) The values of the distribution function F(x) of the lognormal distribution can be calculated by the 
distribution function d (x) of the standard normal distribution (see (16.73a)), in the following way: 


loga — u 

F(x) =@ (sm) (16.79) 
σι, 

d) The lognormal distribution is often applied in lifetime analysis of economical, technical, and biolog- 

ical processes. 

e) The normal distribution can be used in additive superposition of a large number of independent ran- 


dom variables, and the lognormal distribution is used for multiplicative superposition of a large number 
of independent random variables. 


(01 


1.5 


w=ln 0.5 ; σι-0.5 
10 


0.5 Uu -0;o;-1 
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Figure 16.6 Figure 16.7 


16.2.4.4 Exponential Distribution 
1. Density Function and Distribution Function 


A continuous random variable X has an exponential distribution with parameter A (À > 0) if its density 
function is (Fig. 16.7) 


16.2 Probability Theory 821 


0 fo t«0 
145 { Ae for t20, (16.80a) 


consequently, the distribution function is 


f 0 for x «0, 3 
F(x) = / f(t)dt = { l-e% fo 250, (16.80b) 


2. Expected Value and Variance 
1 > 1 


=S ge (16.81) 


Usually, the following quantities are described by an exponential distribution: Length of phone calls, 
lifetime of radioactive particles, working time of a machine between two stops in certain processes, 
lifetime of light-bulbs or certain building elements. 


16.2.4.5 Weibull Distribution 


1. Density Function and Distribution Function 


The continuous random variable X has a Weibull distribution with parameters a and 6 (a > 0, 8 > 0), 
if its density function is 


0 for t«0, 
ft-12a(tV^ a (16.82a) 
ic ο oxp |— [| — r > 
δι exp B for 120 
and so its distribution function is 
0 for «<0, 
Ρα) -- 4 1 _ exp |- (5) ] ας ο (16.828) 
B 
2. Expected Value and Variance 
1 r ; 2 , 1 
ΜΕ (112). σῦ -- ϱ Ir (1 ! ) r? (1 Jl (16.83) 
Q α α 


Here I(x) denotes the gamma function (see 8.2.5, 6., p. 514): 
Γ(α) = / £-lecdt for α»0. (16.84) 
o 


In (16.82a), α is the shape parameter and £ is the scale parameter (Fig. 16.8, Fig. 16.9). 
Remarks: 


a) The Weibull distribution becomes an exponential distribution for a = 1 with A = 7 


b) The Weibull distribution also has a three-parameter form by introducing a position parameter y. 
Then the distribution function is: 


F(x) 21— exp |- (z 7 20 (16.85) 
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c) The Weibull distribution is especially useful in life expectancy theory, because, e.g., it describes the 
functional lifetime of building elements with great flexibility. 


16.2.4.6 x? (Chi-Square) Distribution 
1. Density Function and Distribution Function 


Let Xi, Xo,..., X, be n independent (0, 1) normal random variables. Then the distribution of the 
random variable 
χ᾽ -- Χι) - Χο’ t Xn (16.86) 


is called the X? distribution with n degrees of freedom. Its distribution function is denoted by Fj»(x), 
and the corresponding density function by f,2(t). 


no] 5 
12^ "e 3 for (t > 0) 


felt) 2 ὁ mer (5) (16.87a) 
0 fort <0. 
ri n t 
Fa(r) = P(x? E α : t27 leidt (x0) (16.87b) 


2. Expected Value and Variance 
E(x’) =n, (16.88a) D?(x?) = 2n. (16.88b) 


3. Sum of Independent Random Variables 
If X, and X» are independent random variables both having a X? distribution with n and m degrees of 
freedom, then the random variable X = X, + Χο has a x? distribution with n + m degrees of freedom. 


4. Sum of Independent Normal Random Variables 
If X4, Xo...., Xn are independent, (0, σ) normal random variables, then 
LE 1 t 
X = yx? has the density function f(t) = — fie (3) ; (16.89) 
i—l σ σ 
Du eos : : n nt 
X —--MX has the density function f(t) = fy (5) , (16.90) 
n £4 c? σ 


pe ei 2t P 
X= -5 Xj? παρ the density function f(t) = Shy (5) 9 (16.91) 
σα’; 
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5. Quantile 


For the quantile (see 16.2.2.2, 3., p. 812) y2.,,, of the x? distribution with m degrees of freedom (Fig. 
16.10), 


P(X > x24) = a. (16.92) 


Quantiles of the x? distribution can be found in Table 21.18, p. 1135. 


felt) f,(t) 
a a 
2 X > 
0 one t 0 Luo t 
Figure 16.10 Figure 16.11 


16.2.4.7 Fisher F Distribution 


1. Density Function and Distribution Function 


If X, and X» are independent random variables both having X? distribution with mı and ma degrees 
of freedom, then the distribution of the random variable 


Xx 
Pie — "n = (16.93) 
mı! m» 


is a Fisher distribution or F distribution with m1, mz degrees of freedom. The density function is 


m m x 
du ME NÉ I e + 9 ) tx -1 - Sô 
m m or : 
fr(t) = 2 2 r(@)r() (a (16.94a) 
2 2 2 2 
0 for t <0. 


Fora € 0 holds Εε(α) = P(F,,,,, € £) — 0, for x > 0: 


Fp(x) = P( Fiia < w) 


mil] 
(: 2 ) dt 
My m2 (16.94b) 


m ms SAT. ma 
JOHOLCN E 


2. Expected Value and Variance 


mj | m» a 
= Snow Γη κ) j 


2 2 


mao 2m2? (mı + ma — 2) 


ma(ms — 2)? (m — 4) 


EU ms) στο (16.95a) De as) = (16.95b) 


πιο — 2 
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3. Quantile 
The quantiles (see 16.2.2.2, 3., p. 812) tammy, of the Fisher distribution (Fig. 16.11) can be found in 
Table 21.19, p. 1136. 


16.2.4.8 Student { Distribution 


1. Density Function and Distribution Function 
If X is a (0,1) normal random variable and Y is a random variable independent from X and it has a 
x? distribution with m = n — 1 degrees of freedom, then the distribution of the random variable 
X 
T -——— (16.96) 
y Y/m 

is called a Student t distribution or t distribution with m degrees of freedom. The distribution function 
is denoted by Fs(x), and the corresponding density function by fs(t). 


(= + 5) 
1 9 1 
y= 16.97a 
fs( VR ο. g me (16.97a) 
2 (: EN 
ΤΊ, 
. (5 + 3 . 
f 1 2 f dt : 
Fs(z) = P(T < 2) =l f(t) dt = ης : ση 1 mE (16.97b) 
f) 
α 
> > 
οἱ to, m t t 
a) ᾽ 
Figure 16.12 
sl 2. Expected Value and Variance 
E(T)-0 (m1) (16.984) D?(T) = -- (m > 2). (16.98b) 


3. Quantile 
The quantiles ta,m and ta/2,m of the t distribution (Fig. 16.12a,b), for which 


P(T > tam) =a (16.99a) or P(|T| > ta/2m) = a (16.99b) 


holds, are given in Table 21.20, p. 1138. 

The Student t distribution, introduced by Gosset under the name Student, is used in the case of sam- 
ples with small sample size n, when only estimations can be given for the mean and for the standard 
deviation. The standard deviation (16.98b) no longer depends on the deviation of the population from 
where the sample is taken. 
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16.2.5 Law of Large Numbers, Limit Theorems 

The law of large numbers gives a relation between the probability P(A) of a random event A and its 
relative frequency n4/n with a large number of repeated experiments. 

1. Law of Large Numbers of Bernoulli 

The following inequality holds for arbitrary given numbers € > 0 and 7 > 0 


1 
m P(A) < e) >1-n, (16.100a) if n> Ten (16.100b) 


For other similar theorems see [16.5]. 

E How many times should a not necessarily fair die be rolled if the relative frequency of the 6 should 
be closer to its probability than 0.1 with a probability of at least 95 % ? 

Now, € = 0.01 and η = 0.05, so 4ε7η = 2- 1077, and according to the law of large numbers of Bernoulli 
n > 5-10* must hold. This is an extremely large number, which can be reduced, if the distribution 
function is known (see [16.12]). 


2. Central Limit Theorem of Lindeberg-Levy 


If the independent random variables X;,..., Xn all have the same distribution with an expected value 
μ and a variance c?, then the distribution of the random variable 


AEN i=l 
Ya =a (16.101) 


tends to the (0, 1) normal distribution for n — oo, i.e., for its distribution function F, (y) follows 


y 2 
. 1 .& 
jim Pay) = = / eT dt. (16.102) 
—oo 

If n > 30 holds, then F;(y) can be replaced by the (0, 1) normal distribution. Further limit theorems 
can be found in [16.5], [16.7]. 

W Givenasample of 100 items from a production of resistors. It is supposed that their actual resistance 
values are independent and they have the same distribution with deviation c? = 150. The mean value 
for these 100 resistors is T = 10500. In which domain is the true expected value μ with a probability 
of 99 96 ? 
Looking for an € such that P(|X — u| € £) = 0.99 holds. Supposing (see (16.101)) that the random 


ERE has a (0.1) normal distribution. From P(|Y| < A) = P(-A < Y < å) = P(Y < 
A) — P(Y < —2), and from P(Y € —3) = 1 — P(Y € X) it follows that P(Y| € A) = 2P(Y < 
à) --1-- 0.99. 
So, P(Y € A) = Φ(λ) = 0.995 and from Table 21.17, p. 1133, one gets λ = 2.58. Since σ/ν 100 = 
1.225 there is with a 99 % probability: |1050 — u| < 2.58 - 1.225, i.e., 1046.89 < u < 1053.20. 


variable Y — 


16.2.6 Stochastic Processes and Stochastic Chains 


Many processes occurring in nature and those being studied in engineering and economics can be real- 
istically described only by time-dependent random variables. 

W The electric consumption of a city at a certain time { has a random fluctuation that is dependent 
on the actual demand of the households and industry. The electric consumption can be considered 
as a continuous random variable X. When the observation time t changes, electric consumption is a 
continuous random variable at every moment, so it is a function of time. 


The stochastic analysis of time-dependent random variables leads to the concept of stochastic pro- 
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cesses, which has a huge literature of its own (see, e.g., [16.7], [16.9]). Some introductory notions will 
be given next. 


16.2.6.1 Basic Notions, Markov Chains 


1. Stochastic Processes 
A set of random variables depending on one parameter is called a stochastic process. The parameter, 
in general, can be considered as time f, so the random variable can be denoted by X, and the stochastic 
process is given by the set 
{X,|t e T). (16.103) 
The set of parameter values is called the parameter space T', the set of values of the random variables 
is the state space Z. 
2. Stochastic Chains 
If both the parameter space and the state space are discrete, i.e., the state variable X, and the param- 
eter { can have only finite or countably infinite different values, then the stochastic process is called a 
stochastic chain. In this case the different states and different parameter values can be numbered: 


Z-—113,..., tO 14: (16.104) 
PS isee m πο Wi OR tp <i X nS ορ πα (16.105) 
The times to, t1,... are not necessary equally spaced. 


3. Markov Chains, Transition Probabilities 
If the probability of the different values of X;,,,, in a stochastic process depends only on the state at 
time tm, then the process is called a Markov chain. The Markov property is defined precisely by the 
requirement that 

PG a = maa | Xto = io, Xt, = ih, ttt Ain = im) = Pl Kiwa = ΠΡΌ = im) 


for all m € {0,1,2,...} and for all ig, ii, ... ii € Z. (16.106) 
Consider a Markov chain and times tm and tm+1. The conditional probabilities 
Ρ(ΧΙ μι = j|X,, =i) = Pig (tm, tm+1) (16.107) 


are called the transition probabilities of the chain. The transition probability determines the probability 
by which the system changes from the state X;,, = i at tm into the state X;,,,, = j at £i. 


tm 
If the state space of a Markov chain is finite, i.e., Z = (1,2,..., N}, then the transition probabilities 
Dij (ti, t2) between the states at times tı and tz can be represented by a quadratic matrix P(t), t2), by 
the so-called transition matrix: 


Ῥη(ἑι»19)  pio(ti,t2) ... mx (tuta) 
pai(t ta) poo(ti,te) .-. pou (ti, t2) 
P(t, t2) = . (16.108) 
Pyi(ti,te) puo(ti ο). --.. pwn(th, t2) 
The times tı and {0 are not necessarily consecutive. 
4. Time-Homogeneous (Stationary) Markov Chains 
If the transition probabilities of a Markov chain (16.107) do not depend on time, i.e., 
Dij (tm, ἔπη) = Pij, (16.109) 
then the Markov chain is called time-homogeneous or stationary. A stationary Markov chain with a 
finite state space Z = {1,2,..., N} has the transition matrix 
Pu Pi --- DN 
P21 po -... P2N 
P=| παν (16.110a) 


DNi DN2 --- PNN 
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where 
a) pij 2 0 for alli, j and (16.110b) 


N 

b) X` py =1 forall i. (16.110c) 

151 

Being independent of time p;; gives the transition probability from the state i into the state j during 
time unit. 
W The number of busy lines in a telephone exchange can be modeled by a stationary Markov chain. 
For the sake of simplicity it is supposed that there are only two lines. Hence, the states are i = 0,1,2. 
Let the time unit be, e.g., 1 minute. Suppose the transition matrix pi; is: 


0.7 0.3 0.0 

(pj) = | 0.2 0.5 0.3 (à, j = 0,1,2). 
0.1 0.4 0.5 

In the matrix (pij) the first row corresponds to the state i = 0. The matrix element μι» = 0,3 (second 

row, third column) shows the probability that two lines are busy at time tm given that one was busy at 

tm—1- 

Remark: Every quadratic matrix P = (pij) of size N x N satisfying the properties (16.110b) and 

(16.110c) is called a stochastic matrix. Their row vectors are called stochastic vectors. 

Although the transition probabilities of a stationary Markov chain do not depend on time, the distri- 

bution of the random variable X, is given at a given time by the probabilities 


N 
P(X, =i) =pi(t) (@=1,2,...,N) (16.111a) with S~p,(t) 21 (16.111b) 
i=1 
since the process is in one of the states with probability one at any time t. 
5. Probability Vector and Transition Matrix 
Probabilities (16.111a) can be written in the form of a probability vector 


P = (pi(t) po(t),---, p (0). (16.112) 
The probability vector p is a stochastic vector. It determines the distribution of the states of a station- 
ary Markov chain at time period t. Let the transition matrix P of a stationary Markov chain be given 
(according to (16.110a,b,c)). Starting with the probability distribution at time period t determine the 
probability distribution at t + 1, that is, calculate p(t + 1) from P and p(t): 


p(t+1)=p(t)-P (16.113) 
and furthermore 

p(t +k) = p(t): P*. (16.114) 
Remarks: 
1. For t = 0 it follows from (16.114) that 

p(k) = p(0)P*, (16.115) 


that is, a stationary Markov chain is uniquely determined by the initial distribution p(0) and the tran- 
sition matrix P. 

2. If matrices A and B are stochastic matrices, then C = AB is a stochastic matrix, as well. Conse- 
quently, if P is a stochastic matrix, then the powers P" are also stochastic matrices. 

W A particle changes its position (state) X,(1 € x < 5) along a line in time periods t = 1,2,3,... 
according to the following rules: 

a) If the particle is at x — 2,3,4, then it moves to the right by a unit during the next time unit with 
probability p — 0.6 and to the left with probability 1 — p — 0.4. 
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b) At points z = 1 and x = 5 the particle is absorbed, i.e., it stays there with probability 1. 

c) At time t = 0 the position of the particle is x = 2. 

Determine the probability distribution p(3) at time period t — 3. 

By (16.115) the probability distribution p(3) = p(0) P? holds with p(0) = (0, 1,0, 0, 0) and with the 


transition matrix 


1 0 0 0 0 1 0 0 0 0 
04 0 06 0 0 0.496 0 02388 0 0216 
P-|0 04 0 06 0 |. Hence P= | 0.160 0.192 0 0.288 0.360 
0 0 04 0 06 0064 0 0192 0 0.744 
0 0 0 ο 1 0 0 0 0 1 


and finally p(3) = (0.496; 0; 0.288; 0; 0.216) . 
16.2.6.2 Poisson Process 


1. The Poisson Process 

In the case of a stochastic chain both the state space Z and the parameter space T are discrete, that 
is, the stochastic process is observed only at discrete time periods to, t1, t2,... . Now, there is studied a 
process with continuous parameter space T, and it is called a Poisson process. 

1. Mathematical Formulation of the Poisson Process The following assumptions are made for 
the mathematical formulation of the Poisson process: 

a) Let the random variable X, be the number of signals in the time interval [0,t); 

b) Let the probability px (t) = P(X, = x) be the probability of x signals during the time interval [0, t). 
Additionally, the following assumptions are required, which hold in the process of radioactive decay 
and many other random processes (at least approximately): 

c) The probability P(X; = x) of x signals in a time interval of length t depends only on x and t, and 
does not depend on the position of the time interval on the time axis. 

d) The numbers of signals in disjoint time intervals are independent random variables. 

e) The probability to get at least one signal in a very short interval of length At is approximately 
proportional to this length. The proportionality factor is denoted by A (A > 0). 

2. Distribution Function By properties a)-e) the distribution of the random variable X, is deter- 
mined. One gets: 


(ty g^ 


Ρ(Χι-τ) = a ; 


(16.116) 


where u = At is the expected value and σ᾽ = At the variance. 

3. Remarks 

1. From (16.116) the Poisson distribution follows as a special case for t — 1 (see 16.2.3.3, p.817). 

2. To interpret the parameter A or to estimate its value from observed data the following properties 
are useful: 

e Aisthe average number of signals during a time unit, 


ezis the average distance (in time) between two signals in a Poisson process. 


3. The Poisson process can be interpreted as the random motion of a particle in the state space Z = 
{0,1,2,...}. The particle starts in the state 0, and at every sign it jumps from state i into the next 
state i + 1. Furthermore, for a small interval At the transition probability p;;,; from state i into the 
state i + 1 should be: 

Digi © AAt. (16.117) 
À is called the transition rate. 
4. Examples of Poisson Processes 


16.2 Probability Theory 829 


E Radioactive decay is a typical example of a Poisson process: The number of decays (signals) are 


registered with a counter and 


marked on the time axis. The observation interval should be relatively 


small with respect to the half-period of the radiating matter. 


W Consider the number of calls registered in a telephone exchange until time t and calculate, e.g., the 


probability that at most x cal 


of calls during a time unit is A. 


s are registered until time t with the assumption that the average number 


W In reliability testing, the number of failures of a reparable system is counted during a period of duty. 


W Queuing theory considers 
a booking office or to a gasoli 


che number of customers arriving at the counter of a department store, to 
ne station. 


2. Birth and Death Processes 


One of the generalizations of t 


he Poisson process is the assumption that the transition rate A; in (16.117) 


depends on the state 7. Anot 


jer generalization is the transition from state 7 into state i — 1 is allowed. 


The corresponding transition rate is denoted by ju. The state i can be considered, e.g., as the number of 
individuals in a population. It increases by one at transition from state i into state i+ 1, and decreases 
by one at transition from i into i — 1. These stochastic processes are called birth and death processes. 
Let p(X; = i) = p;(t) be the probability that the process is in state i at time t. Analogously to the 
Poisson process for the transition probability holds: 


from i— 1 into 4: pii; ££ A; AM, 
from i+] into i: pia 5 paa AM, (16.118) 
from i into i: pi; 55 l— (Ai μι) Δί. 


Remark: The Poisson process is a pure birth process with a constant transition rate. 

3. Queuing 

The simplest queuing system is considered as a counter where customers are served one by one in the 
order of their arrival time. The waiting room is sufficiently large, so no one needs to leave because 
it becomes full. The customers arrive according to a Poisson process, that is, the inter-arrival time 
between two clients is exponentially distributed with parameter A, and these inter-arrival times are 
independent. In many cases also the serving time has an exponential distribution with parameter μ. 
The parameters A and u have the following meanings: 

9 


À: average number of arrivals per time unit, 


P X average inter-arrival time, 


e u: average number of servec 


1 ; : 
—: average serving time. 
H 


Remarks: 


clients per time unit, 


1. Ifthe number of clients standing in the queue is considered as the state of this stochastic process, then 
the above simple queuing model is a birth and death process with constant birth rate \ and constant 
death rate μ. 

2. The above queuing model can be modified and generalized in many different ways, e.g., there can be 
several counters where the clients are served and/or the arrival times and serving times follow different 
distributions (see [16.9], [16.19]). 
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16.3 Mathematical Statistics 

Mathematical statistics provides an application of probability theory for given mass phenomena. Its 
theorems allow to make statements with certain probability about properties of given sets, which state- 
ments are based on the results of experiments whose number should be kept low for economical reasons. 


16.3.1 Statistic Function or Sample Function 
16.3.1.1 Population, Sample, Random Vector 
1. Population 


The Population is the set of all elements of interest in a particular study. Any set of things having the 
same property in a certain sense can be considered, e.g., every article of a certain production process or 
all the values of a measuring sequence occurring in a permanent repetition of an experiment. The num- 
ber N of the elements of a population can be very large, even practically infinite. The word population 
is often used to denote also the set of numerical values assigned to the elements. 

2. Sample 

In order not to check the total population about the considered property, data are collected only from 
a subset, from a so-called sample of size n (n < N). Talking about a random choice means that every 
element of the population has the same chance of being chosen. A random sample of size n from a 
finite population of size N is a sample selected such that each possible sample of size n has the same 
probability of being selected. A random sample from an infinite population is a sample selected such 
that each element is selected independently. The random choice can be made by mixing, blind taking 
out or by so-called random numbers. The word sample is used for the set of values assigned to the 
selected elements. 

3. Random Choice with Random Numbers 

It often happens that a random selection is physically impossible on the spot, e.g., in the case of piled 
material, like concrete stabs. Then random numbers are applied for a random selection (see Table 
21.21, p. 1139). 

Most calculators can generate uniformly distributed random numbers from the interval [0, 1]. Pressing 
the key RAN yields a number between 0.00...0 and 0.99...9. The digits after the decimal point form 
a sequence of random numbers. 


Random numbers are often taken from tables. Two-digit random numbers are given in Table 21.21, 
p. 1139. If larger ones are necessary, then one can compose several-digit numbers by writing them after 
each other. 

W A random sample is to be examined from a transport of 70 piled pipes. The sample size is supposed 
to be 10. First the pipes are numbered from 00 to 69. A two-digit table of random numbers is applied 
to select the numbers. Then the way is fixed how to choose the numbers, e.g., horizontally, vertically 
or diagonally. If during this process random numbers occur repeatedly, or they are larger than 69, then 
they are simply omitted. The pipes corresponding to the chosen random numbers are the elements of 
the sample. If there is a several-digit table of random numbers, they can be decomposed into two-digit 
numbers. 


4. Random Vector 

A random variable X can be characterized by its distribution function, by its parameters, where the 
distribution function itself is determined completely by the properties of the population. These are 
unknown at the beginning of a statistical investigation, so one want to collect as much information as 
possible with the help of samples. Usually the investigation is not restricted to one sample but more 
samples are applied (with same size n if it is possible, for practical reasons). The elements of a sample 
are chosen randomly, so the realizations take their values randomly, i.e., the first value of the first 
sample is usually different from the first value of the second sample. Consequently, the first value of 
a sample is a random variable itself denoted by Χι. Analogously, the random variables Χο, X3,...Xn 
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can be introduced for the second, third,..., n-th sample values, and they are called sample variables. 
Together, they form the random vector 

X= (Xip ος Χα). (16.119a) 
Every sample of size n with elements x; can be considered as a vector 

X = (21,20,...,Zn), (16.119b) 


as a realization of the random vector. 


16.3.1.2 Statistic Function or Sample Function 


Since the samples are different from each other, their arithmetic means x are also different. They can 
be considered as realizations of a new random variable denoted by X which depends on the sample 
variables X1, Xo, ..., Xn- 


l.sample: rjj, 12, ... Zın with mean ΤΙ. 
2. sample: x91, 200, ... ro, with mean Τη. 
Μον ΚΕ Ὁ w (16.120) 
m-th sample: m1, Um2, ... Emn With mean Fp. 
The realization of the j-th sample variable in the i-th sample is denoted by σι (i = 1,2,..., m; j = 
1:2, 5: :m). 
A function of the random vector X = (Xi, X5,..., Xn) is again a random variable, and it is called a 


statistic or sample function. 'The most important sample functions are the mean, variance, median and 
range. 


1. Mean 
'The mean X of the random variables X; is: 
e j n 
X--Y Xx. (16.121a) 
TL ic 
The mean z of the sample (x1, 22,..., En) is 
1 n 
ες r (16.121b) 
TL 


It is often useful to introduce an estimate value xo in the calculations of the mean. It can be chosen 
arbitrarily but possibly close to the mean 7T. If, e.g., αι, (i = 1,2,...) are several-digit numbers in a 
long measuring sequence, and they differ only in the last few digits, it is simpler to do the calculations 
only with the smaller numbers 

Zi = Ti — TQ. (16.121c) 
Then follows 


S—god4— M z = to +Z. (16.1214) 


2. Variance 
The variance S? of the random variables X; with mean X is defined by: 


ν 1 i — 
S? = c ΣΝ: - xy. (16.1222) 
i=1 


The realization of the variance with the help of the sample (£1, £2, ... x5) is 


: p 7 9 
= Σα: - my. (16.122b) 


n—lii 
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It is proven that the estimation of the variance of the original population gives a more accurate estima- 
tion by dividing n — 1 than by dividing n. With the estimated value xo follows 


y 2° zy, Zi Σ z? — n(z — zy)? 
gd isl del . (16.122c) 
π-- 1 n—1 


n 
For xo = T the correction is zy, zi = 0 because Z = 0 holds. 
i=1 


3. Median 
Let the n elements of the sample be arranged in ascending (or descending) order. If n is odd, then the 
n+1 


median X is the value of the -th item; if n is even, then the median is the average value of the 


Sth and G + 1) -th items, the two items on the middle. 


The median £ in a particular sample (2, 2,...,%,), whose elements are arranged in ascending (or 
descending) order, is 
Sols if n — 2m 4 1, 
T= | Ὁ πα ip) og (16.123) 
2 
4. Range 
R = max X; — min X; (i8 ρα ο, (16.124a) 
The range R of a particular sample (21, £2,- . . , 2p) is 
R= Vox mu (16.124b) 
Every particular realization of a sample function is denoted by a lowercase letter, except the range R, 
i.e., for a particular sample (11, x5, ... , &n) the particular values 7, s?, Z, and R are calculated. 
i πι i τι T T; W Choosing a sample of 15 loudspeakers from a running 
production the interesting quantity X is the air gap induc- 
1 | 1.01 | 6 | 1.00} 11 1.00 tion B, measured in Tesla. From the measured data in the 
2) 102) 7 | 0.99 | 12 | 1.00 table to the left follows: 
3 | 1007 8 | L01| 13 | 1.02 T = 1.0027 or T = 1.0027 with αρ = 1.00; 
4 [098] 9 | 101 | 14 | 1.00 s? = 1.2095 - 10-4 or s? = 1.2076 - 1074 with αρ = 1.00; 
5 |0.99 | 10 | 1.00 | 15 | 1.01 č = 1.00; R = 0.04. 


16.3.2 Descriptive Statistics 


16.3.2.1 Statistical Summarization and Analysis of Given Data 

In order to describe statistically a property of a certain element this property must be characterized 
by a random variable X. Usually, the n measured or observed values x; of the property X form the 
starting point of a statistical investigation, which is made to find some parameters of the distribution 
or the distribution itself of X. 

Every measured sequence of size n can be considered as a random sample from an infinite population, if 
the experiment or the measurement could be repeated infinitely many times under the same conditions. 
Since the size n of a measuring sequence can be very large, the statistical investigation proceeds as 
follows: 

1. Protocol, Prime Notation The measured or observed values x; are recorded in a protocol list. 

2. Intervals or Classes Grouping the measured n data x; (à = 1,2,...,n) of the sample into k 
subintervals, so-called classes or class intervals of equal length or width A ; usually 10-20 classes. 
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3. Frequencies and Frequency Distribution The absolute frequencies hj (j = 1,2,..., k) are the 
numbers h; of data (occupancy number) belonging to a given interval Az;. The ratios hj /n (in %) are 
called relative frequencies. If the values h;/n are represented over the classes as rectangles, then one 
gets a graphical representation of the given frequency distribution, and this representation is called a 
histogram (Fig. 16.184). The values h;/n can be considered as the empirical values of the probabilities 
or the density function f(x). 


Table 16.3 Frequency table 4. Cumulative Frequency Adding the absolute or rel- 
= 5 ative frequencies, gives the cumulative absolute or relative 
Class hi | hi/n(%) | F; (96 frequency 
50 — 70 1 0.8 0.8 Ti hig dee Es i f . 
τι 90| 1] 08 16] Aja i% (j=1,2,...,k). (16125) 
i E ea : z d 7 : A graphical representation of the empirical distribution 
131 —150 15| 120 22.4 function, which can be considered as an approximation 
151 — {πῃ 22 17 6 40. of the unknown underlying distribution function F(x) is 
171 190 30 | 240 64. shown in Fig. 16.13b. 
191 — 210 27 21.6 85.6 W Suppose during a study n = 125 measurements have 
211 — 230 9 7.2 92.8 been performed. The results spread in the interval from 
231 — 250 6 48 97.6 50 to 270, so it is reasonable to divide this interval into 
251 — 270 3 24 100. k = 11 classes with a length h = 20. The frequency table 
sce Table 16.3. 
Ih. 1E /% 
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Figure 16.13 
16.3.2.2 Statistical Parameters 


After summarizing and analyzing the data of the sample as given in 16.3.2.1, p. 832, it follows the 
approximation of the parameters of the distribution belonging to the random variable by the following 
parameters: 


1. Mean 
Using all measured data from the sample directly, the sample mean is 
1 n 
πο αι. (16.126a) 
TL ic 


Using the means T; and frequencies h, of the classes 


14 
T=- X hj. (16.126b) 
nil 
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2. Variance 
Using all measured data directly the sample variance is 


: ij S 5 
= P αι -- T}. (16.127a) 
m— lj 
Using the means Τ; and frequencies h; of the classes 
Ι΄. 
s M hv; - zy. (16.127b) 
T= 
j-l 


The class midpoint u; (the midpoint of the corresponding interval) is also often used instead of z;. 
3. Median 


The median Ζ of a distribution is defined by 


1 

P(X «τ) - "E (16.128a. 

The median may not be a uniquely determined point. The median of a sample is 
PETS ifn — 2m 4 1, 
=< Όπιιι + Em ifn = 2m. (16.128b 
2 

4. Range 

R= Tirar — Tmin (16.129 


5. Mode or Modal Value 


is the data value that occurs with greatest frequency. It is denoted by D. 


16.3.3 Important Tests 


One of the fundamental problems of mathematical statistics is to draw conclusions about the population 
from the sample. There are two types of the most important questions: 

1. The type of the distribution is known, and one wants to get some estimate for its parameters. A 
distribution can be characterized mostly quite well by the parameters jj and c? (here p is the exact value 
of the expected value, and c? is the exact variance), consequently one of the most important questions 
is how good an estimation can be given for them, based on the samples. 

2. Some hypotheses are known about these parameters, and it is desirable to check if they are true. 
The most often occurring questions are: 

a) Is the expected value equal to a given number or not? 

b) Are the expected values for two populations equal or not? 

c) Does the distribution of the random variable with jz and c? fit a given distribution or not? etc. 
Because in observations and measurements, the normal distribution has a very important role, it is to 
be discussed the goodness of a fit test for a normal distribution. The basic idea can be used for other 
distributions, too. 


16.3.3.1 Goodness of Fit Test for a Normal Distribution 

There are different tests in mathematical statistics to decide if the data of a sample come from a normal 
distribution. Here a graphical one is discussed based on normal probability paper , and a numerical 
one based on the use of the chi-square distribution (“x° test"). 

1. Goodness of Fit Test with Probability Paper 

a) Principle of Probability Paper The z-axis in a right-angled coordinate system is scaled equidis- 
tantly, while the y-axis is scaled on the following way: It is divided equidistantly with respect to Z, but 
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scaled by 
y = (z) = — | e 2 dt. (16.130) 


If a random variable X has a normal distribution with expected value jz and variance σ᾽, then for its 
distribution function (see 16.2.4.2, p. 819) 


F(a) =6 ς - £) = &(2) (16.131a) 
holds, i.e., (16.131c) 
zat (16.131b) 
σ 


must be valid, and so there is a linear relation between x and z and (16.131c). 


b) Application of Probability Paper 


Considering the data of the sample, calculat- 

Me r ἑ d : . A A Ay 

ing the cumulative relative frequencies accord- z (0 

ing to (16.125), and sketch these onto the proba- 

bility paper as the ordinates of the points with 3 99.86 L-—-i—-—— t 1-334 7 
abscissae the upper class boundaries. If these 2 97.72, == 
points are approximately on a straight line, 1 8413 ++ + >» Lr 
(with small deviations) then the random variable 0 50.00 AK -.. 
can be considered as a normal random variable ` H a 

(Fig. 16.14). 1 1587 το aa aca a 
As it can be seen from Fig. 16.14, the distribu- -2 2.28 7-1-4} pp 
tion to which the data of Table 16.3 belong, can -3 014» oe eee ee ae emer ac 

be considered as a normal distribution. Further- ! ! 

more one can see that μ zz 176, σ 56 37.5 (from | | 
the x values belonging to the 0 and +1 values of 70 110 150 190 230 270 x 
2). μ-σ u peo 

Remark: The values F; of the relative cumula- 

tive frequencies can be plotted more easily on the Figure 16.14 


probability paper, if its scaling is equidistant with respect to y, which means a non-equidistant scaling 
for the ordinates. 

2. x? test for Goodness of Fit 

It is to check if a random variable X can be considered as normal in the sense of 16.2.4.2, p. 819. The 
range of X is divided into k classes and the upper limit of the j-th (j = 1,2,..., k) class is denoted by 
£j . Let pj be the “theoretical” probability that X is in the j-th class, i.e., 


p; = F(E) - F(-1), (16.1322) 
where F(x) is the distribution function of X (j = 1,2,..., k ; & is the lower limit of the first class with 
F'(&) = 0). Because X is supposed to be normal, then 

F(&) — (=) (16.132b) 

o 


must hold, where (x) is the distribution function of the standard normal distribution (see 16.2.4.2, 
p. 819). The parameters µ and σ᾽ of the population are usually not known. Therefore z and s? are to 
be used as an approximation of them. The decomposition of the range of X should be made so that the 
expected frequencies for every class should exceed 5, i.e., if the size of the sample is n, then np; > 5. 

Now, considering the sample (σι, £2, . . . , &n) of size n and calculating the corresponding frequencies hj 
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(for the classes given above), then the random variable 
3 E (h; — np;)? 
xg = (h fis) (16.132c) 
ja np; 

has approximately a y? distribution with m = k—1 degrees of freedom, if jj and o? are known, m = k—2 
if one of them is estimated from the sample, and m = k — 3 if both are estimated by Τ and s?. 
The test, if a random variable X has a normal distribution (x?-approximation test) consists in the com- 
parison of the experimental x2 of the sample with the corresponding Χάλι from Table 21.18, p. 1135 
for a given statistical significance level a and m degrees of freedom. 
Now one decides a significant level a and one takes out of the Table 21.18, p. 1135 the quantile X2... ; 
of the corresponding x? distribution (i depends on the number of unknown parameters). This means 
P(x? > x2,,)- holds. Comparing the value x% from (16.132c) and this quantile, and if 


XS < ΟΦ (16.132d) 

holds, one can accept the assumption that the sample came from a normal distribution. This test is 
also called the X? test for goodness of fit. 
E The following x? test is based on the example on p. 833. The sample size is n = 125, with the 
mean 7 = 176.32 and variance s? = 36.70. These values are used as approximations of the unknown 
parameters ji and c? of the population. Determining the test statistic y% according to (16.132c) after 
performing the calculations according to (16.132a) and (16.132b), yields the data in Table 16.4. 


Table 16.4 x? test 


De "- h;—mnp: 
Éj hj &-n Φ (=+) Dj np; (hy πρ)” 
σ σ npj 
TI 1 —2.90 0019 0.0019 | 0.2375 
9 1 —2.35 0094 0.0075 | 0.9375 
110| 2{5| -181 0351 | 0.0257 | 3.2125 f 129750 |  0-00005 
13 9 —1.26 .1038 0.0687 | 8.5857 
150 | 15 —0.72 .2358 0.1320 | 16.5000 0.1635 
170 | 22 —0.17 .4325 0.1967 | 24.5875 0.2723 
190 | 30 0.37 .6443 0.2118 | 26.4750 0.4693 
210 | 27 0.92 .8212 0.1769 | 22.1125 1.0803 
23 9 1.46 .9279 0.1067 | 13.3375 1.4106 
25 6 2.01 9778 0.0499 | 6.2375 
270 | 3 j 9 | 255 9946 | 0.0168 | 2.1000 j Barm 0:0526 
X3 = 3.4486 
It follows from the last column that x2 = 3.4486. Because of the requirement np; > 5, the number of 
classes is reduced from k = 11 to k* = k —4 = 7. Since for the calculation of the theoretical frequencies 
np; the estimated values z and s? of the sample are used instead of jj and o? of the population, so the 


number of degrees of freedom of the corresponding X? distribution is reduced by 2. The critical value is 
the quantile Xe ee 1-2: For a = 0.05 one gets x5 os. = 9.5 from Table 21.18, p. 1135, so because of the 
inequality X2 < X205, there is no contradiction to the assumption that the sample is from a population 
with a normal distribution. 


16.3.3.2 Distribution of the Sample Mean 


Let X be a continuous random variable. Suppose arbitrarily many samples of size n can be taken 
from the corresponding population. Then the sample mean is also a random variable X, and it is also 
continuous. 
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1. Confidence Probability of the Sample Mean 

If X has a normal distribution with parameters jj and σ᾽, then X is also a normal random variable with 
parameters ju and c? /n, i.e., the density function f(x) of X is concentrated more around p than the 
density function f(x) of the population. For any value € > 0 


€ -— eyn 
P(|X — u| < £) = 28 ( ) 1, P(X -u| < £) = 28 1 (16.133) 
σ σ 
Table 16.5 Confidence level holds. Consequently with increasing sample size n the probability 
for the sample mean that the sample mean is a good approximation of ji is also increasing. 


1 = 1 1 
n|P ( IX — pl] < 7) W Fore = 50 from (16.133) P ( [X -al< σσ) = 20 (3v7) =, 
2 and for different values of n one gets the values listed in Table 16.5. 


1 38.29 % As can be seen from Table 16.5, e.g., that with a sample size n = 49, 
4 68.27 % the probability that the sample mean 7 differs from ju by less than 
16 95.45 % 1. " 
25 98.76 % +57 is 99.95 %. 
49 99.96 % 


2. Sample Mean Distribution for Arbitrary Distribution of the Population 

The random variable X has an approximately normal distribution with parameters jz and c? /n for any 
distribution of the population with expected value u and variance o?. This fact is based on the central 
limit theorem. 


16.3.3.3 Confidence Limits for the Mean 


1. Confidence Interval for the Mean with a Known Variance c? 
If X is a random variable with parameters μ and σ᾽, then according to 16.3.3.2, p. 836, X is approxi- 
mately a normal random variable with parameters u and o?/n. Then substitution of 


2 X- 
Επ (16.134 


Z= 
yields a random variable Z which has approximately a standard normal distribution, therefore 


P(|Z| < ε) = {ΠΟ dx = 2Φ(ε) — 1. (16.135 
If the given significance level is a, namely, 
P(|Z| ε) 21- a, (16.136 
is required, then € = e(a) can be determined from (16.135), e.g., from Table 21.17, p. 1133, for the 
standard normal distribution. From |Z| < ε(α) and from (16.134) the relation 


σ 
= Tr Ea 6.137 
u e ( ( 

c 
follows. The values © + —~=e 
yn 
interval between them is called a confidence interval for the expected value ji with a known c? and given 
significance level a. In other words: The expected value ju is between the confidence limits (16.137) 

with a probability 1 — o. 


(a) in (16.137) are called confidence limits for the expected value and the 


Remark: If the sample size is large enough, then s? can be used instead of o? in (16.137). The sample 
size is considered to be large, if n > 100, but in practice, depending on the actual problem, it is consid- 
ered to be sufficiently large if n > 30. If n is not large enough, then in the case of a normally distributed 
population the { distribution is applied to determine the confidence limits as in (16.140). 
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2. Confidence Interval for the Expected Value with an Unknown Variance σ2 


If the population is approximately normally distributed and its variance c? is unknown, then it is re- 

placed by the sample variance s? and instead of (16.134) one gets the random variable 

X-u 
E 


m vn, (16.138) 
which has a t distribution (see 16.2.4.8, p. 824) with m = n — 1 degrees of freedom. Here n is the size 
of the sample. If n is large, e.g., n > 100 holds, then T can be considered as a normal random variable 
as Z in (16.134). For a given significance level α holds 


P(|T| € £) - [ sto r= P(E vase] -1-ao. (16.139) 


From (16.139) follows that £ = e(a,n) = ta/an—1, Where ta/2m~1 is the quantile of the t distribution 
(with n — 1 degrees of freedom) for the significance level a (Table 21.20, p. 1138). From |T] = μι 


E 
=T} relma (16.140) 
follows. The values 7 + ae ee are called the confidence limits for the expected value u of the 


νη 
distribution of the population with an unknown variance o? and with a given significance level a. The 
interval between these limits is the confidence interval. 

W A sample contains the following 6 measured values: 0.842; 0.846; 0.835; 0.839; 0.843; 0.838. Then 
T = 0.8405 and s = 0.00394 is obtained. 

What is the maximum deviation of the sample mean z from the expected value u of the population 
distribution, if the significance level a is given as 5 96 or 1 96 ? 

1. a = 0.05: one reads from Table 21.20, p. 1138, that ta/2:5 = 2.57, consequently 

[X — u| € 2.57 - 0.00394/./6 = 0.0042. Thus, the sample mean 7 = 0.8405 differs from the expected 
value x by less than 40.0042 with a probability 95 96. 

2. a = 0.01: έαρι = 4.03; [X — u| € 4.03 - 0. 0394/ /6 = 0.0065, i.e., the sample mean z differs from 
u by less than +0.0065 with a probability 99 96. 


16.3.3.4 Confidence Interval for the Variance 


If the random variable X has a normal distribution with parameters u and σ᾽, then the random variable 


" 
x! (n- 155 (16.141) fe(x) 
has a x? distribution with m = n — 1 degrees of 
freedom, where n is the sample size and s? is the 
sample deviation f(x) denotes the density func- 0/2 i α/2 
tion of the y? distribution in Fig. 16.15, and one o 
sees that x 
Bos a 0! χὶ x x 
POO < xu) = POC > XG) τσ. (16-142) u ο 
Thus, using the quantiles of the x? enn Figure 16.15 
(see Table 21.18, p. 1135) gives 
Xa = Xi —a/2;n—11 x% = χάρη 1. (16.143) 


Considering (16.141) one gets the following estimation for the unknown variance σ of the population 
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distribution with a significance level a: 
— 1)s? — 1)s 
ud )s ο. τ )s? 
Xa/2;n—1 XL. a/2;n—1 


(16.144) 


The confidence interval given in (16.144) for c? is fairly large for small sample sizes. 

Β For the numerical example (p. 838) with 6 measured values and for a = 5% one gets from Table 
21.18, p. 1135, X2 095.5 = 0.831 and 6975.5 = 12.8, so it follows from (16.144) that 0.625 -s < ø < 
2.453. s with s = 0.00394. 


16.3.3.5 Structure of Hypothesis Testing 

A statistical hypothesis testing has the following structure: 

1. First a hypothesis H is to be developed that the sample belongs to a population with some given 
properties, e.g., 

H: The population distribution has a normal distribution with parameters µ and σ᾽ (or another given 
distribution), or 

H: For the non-known ji an approximative value (estimate value) µο is inserted which can be got ,e.g., 
by rounding off the sample mean Z, or 
H: Two populations have the same expected value, j44 — H2 = 0, etc. 

2. Aconfidence interval B is to be defined, based on the hypothesis H (in general with tables). The 
value of the sample function should be in this interval with the given probability, e.g., with probability 
99% for a = 0.01. 
3. Calculating the value of the sample function and accepting the hypothesis if this value is in the 
given interval B, otherwise rejecting it. 


E Test of the hypothesis H: jj = jio with a significance level a. 


X - mw 


The random variable T = Vn has at distribution with m = n — 1 degrees of freedom according 


to 16.3.3.3, p. 837. It follows do this that this hypothesis is to reject, if © is not in the confidence 
interval given by (16.140), i.e., if 


S 
[τ m Hol 2 Vn bo /2n-1 (16.145) 


holds. One says that there is a significant difference. For further problems about tests see [16.16]. 


16.3.4 Correlation and Regression 

Correlation analysis is used to determine some kind of dependence between two or more properties of 
the population from the experimental data. The form of this dependence between these properties is 
determined with the help of regression analysis. 


16.3.4.1 Linear Correlation of two Measurable Characters 


1. Two-Dimensional Random Variable 
In the following the formulas are mostly used for continuous random variables, but it is easy to replace 
them by the corresponding formulas for discrete variables. Suppose that X and Y, as a two-dimensiona 
random variable (X,Y), have the joint distribution function 


zy 
F(x,y) = P(X <2,Y <y)= / [ flay) dx dy, (16.146a 
F(x) = P(X <2,Y «oo, Fly) = P(X «ooY < y). (16.146b 


The random variables X and Y are called independent of each other if 
F(x,y) = F(x) Fo(y) (16.147 
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holds. The fundamental quantities assigned to X and Y determined from their joint density function 
f(x,y) are: 
1. Expected Values 


ux = E(X) -/ fe f(x,y) dxdy, (16.148a) py = E(Y) =) IE f(x,y)dvrdy, | (16.148b 
2. Variances 

ox = E((X — με)”, (16.1492) oy = E((Y — py)’). (16.149b 
3. Covariance 

oxy = E(X — ux)(Y — ny). (16.150 
4. Correlation Coefficient 

Oxy = -a (16.151 

σχογ 

It is assumed that every expected value above exists. The covariance can also be calculated by the 
formula MET 

oxy = E(XY) — uxuy where E(XY) = / [vy f(x,y) dx dy. (16.152 


The correlation coefficient is a measure of the linear dependence of X and Y, because of the following 
facts: 

All points (X,Y) are on one line with probability 1 if o$,. = 1 holds. If X and Y are independent 
random variables, then their covariance is equal to zero, oxy = 0. From oxy = 0, it does not follow 
that X and Y are independent, but it does if they have a two-dimensional normal distribution which is 
defined by the density function 


Hey) 1 | 1 [2 μα)” 
T.U exp : : 
2πσχσγψ!-- ory 2(1— oxy) ox 
oxv(x — nx)(y — nv) , (u— | 
OXOy | σῇ ` 


2 


(16.153) 


2. Test for Independence of two Variable 

In practical problems often the question arises of whether the variables X and Y can be considered inde- 
pendent with oxy = 0, if the sample with size n and with the measured values (α;, y;) (i = 1,2,...,7) 
comes from a two-dimensional normal distributed population. The test is performed in the following 
way: 
1. Setting up the hypothesis H: oxy = 0. 


2. Determining a significance level a and determining the quantile ta,m of the t distribution from Table 
21.20, p. 1138, for m — n — 2. 

3. Calculation of the empirical correlation coefficients Τεν and calculation of the test statistics (sample 
function) n 


Σία; — v)(yi — V) 


fas (16.1542) with Των = 
Jli-r72 τ ΤΝ y) 
vy {55 — τ᾽ Mi = y) 


4. Rejection of the hypothesis if |t| > tam holds. 


t= 


(16.154b) 
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16.3.4.2 Linear Regression for two Measurable Characters 


1. Determination of the Regression Line 
If a certain dependence is detected between the variables X and Y by the correlation coefficient, then 
the next problem is to find the functional dependence Y = f(X). Here mostly linear dependence is 
considered. 
In the simplest case of linear regression it is supposed that for any fixed value of x the random variable 
Y in the population has a normal distribution with the expected value 

E(Y) —- a bx (16.155) 
and a variance c? independent of x. The relation (16.155) means that the mean value of the random 
variable Y depends linearly on the fixed value of x. The values of the parameters a, b and c? of the 
population are usually unknown. To estimate them approximately from a sample with values (z;, yi) 
(i = 1,2,...,n) the least squares method can be used. The least squares method requires that 


Yl — (a 4- ba;)]? = min! (16.156) 


is valid so one gets the estimates 


n 


b= = — ἄ--τ-- bT σα r2) with (16.1572) 
Σα - T) 
i=l 
E 1 n 1 n " 1 n E 
ow E no. AP OR. 16.1571 
77m ath ding" δ. E 5-124 9 PM 


The empirical correlation coefficient ray is given in (16.154b). The coefficients à and b are called regres- 
sion coefficients. The line y(x) = à + bx is called the regression line. 
2. Confidence Intervals for the Regression Coefficients 


The next question is, after the determination of the regression coefficients à and b, how well do the 
estimates approximate the theoretical values a and b. Therefore the test variables are to be formed as 


$,V/n — 2 yn 
yf l -r2 
d» 
Li 
i=1 
These are realizations of random variables having a t distribution with m = n — 2 degrees of freedom. 


For a given significance level a the quantile ta/2;m is taken from Table 21.20, p. 1138, and because 
P( |t| € ta/2m) = 1 — a holds for t = t, and t = ty : 


Μαξ a IS oe 
|b — b| < toj; 2 , (16.1592) |à — αἱ € ta/2m-2 


s,v/n — a s,vn —2- n 


A confidence region for the regression line y = a + bx can be determined with the confidence interval 
given in (16.159a,b) for a and b (see Lit. [16.4], [16.11]). 


(16.158a) and t,-— (à 


(16.158b) 


(16.159b) 
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16.3.4.3 Multidimensional Regression 


1. Functional Dependence 
Suppose that there is a functional dependence between the characters X4, Xo,..., Xn, and Y, which is 
described by the theoretical regression function 


y= ft onin) = 5 259j(*1, 32, . τον tà). (16.160) 
j=0 
The functions gj(z1,2,...,v,) are known functions of n independent variables. The coefficients a; 
in (16.160) are constant multipliers in this linear combination. The expression (16.160) is also called 
linear regression, although the functions g; can be arbitrary. 


E The function f (zi, £2) = ao + a121 + Gory + αι’ + a4v5? ο which is a complete quadratic 
polynomial of two variables with gy = 1, gi = £1, ga = £2, 93 = 242, g4 = 207, and gg = πιο, is an 
example for a theoretical linear regression function. 


2. Writing in Vector Form 
It is useful to write formulas in vector form in the multidimensional case 

x= (21,22, ..., 24), (16.161) 
so, (16.160) now has the form: 


y= f) = Y agla). (16.162) 


j=0 


3. Solution and Normal Equation System 
The theoretical dependence (16.160) cannot be determined by the measured values 


(x, fi),  (i—21,2,...,N) (16.163a) 
because of random measuring errors. Looking for the solution in the form 
y = f(x) = > ἄν) (16.163b) 
150 


and using the least squares method (see 16.3.4.2, 1., p. 841) the coefficients @; as the estimations of the 
theoretical coefficients a; can be determined, from the equation 


Σ [i -- F (x)]° = mint. (16.163c) 


i=1 


Introducing the notation 


(1) (1) a) 
aio fi Jo (x ) gı (x ) e Os (x ) 
ü ν (2) (2) (2) 
a-la], £- h ERE (x?) g(x) ... (κ) (16.1634) 
as ἦν Jo (x) g (x) e Qs (x) 
one gets from (16.163c) the so-called normal system of equations 
G7TGà = GTf (16.1630) 


to determine à. The matrix GTG is symmoetric, so the Cholesky method (see 19.2.1.2, p. 958) is espe- 
cially good to solve (16.163e). 

Β Consider the sample whose result is given in the next table. Determine the coefficients of the re- 
gression function (16.164): 
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σι | 5. 3 5 3 
T2 0.5 0.5 0.3 0.3 (as, £2) = ao + a121 + ager. (16.164) 
flansa) |15 35 62 32 


From (16.1634) it follows that 


ὃς 1.5 1 5 0.5 
x. iss [93.5 11 3 05 
a= (s). f= | gale f—[3 5 03 (16.165) 
s 3.2 1 3 03 
and (16.1636) is 
430 + 164, + 1.6 à = 14.4, a = 7.0, 
1620 + 684, + 6.4 ἆ; = 58.6, i.e., à4 = 0.25, (16.166) 
1.6αρ + 6.4a4 + 0.684» = 5.32, ñ= —11. 
4. Remarks ] 
1. To determine the regression coefficients one starts with the interpolation f (κ) =f (i= 
1,2,..., N), i.e., with 
Gà — f. (16.167) 


In the case s < N, (16.167) is an over determined system of equations which can be solved by the 
Householder method (see 19.6.2.2, p. 985). The multiplication of (16.167) by GT to get (16.163e) is 
also called Gauss transformation. If the columns of the matrix G are linearly independent, i.e., rank 
G = s + 1 holds, then the normal system of equations (16.163e) has a unique solution, which coincides 
with the result of (16.167) got by the Householder method. 

2. Also in the multidimensional case, it is possible to determine confidence intervals for the regression 
coefficients with the t distribution, analogously to (16.159a,b). 

3. Bythehelp ofthe F distribution (see 16.2.4.7, p. 823), it is possible to use a so-called equivalence test 
to analyze the assumption (16.163b). This test shows if a solution in the form (16.163b) but with less 
terms yields a sufficient approximation to the theoretic regression function (16.160). (see Lit. [16.10]). 


16.3.5 Monte Carlo Methods 
16.3.5.1 Simulation 


Simulation methods are based on constructing equivalent mathematical models. These models are then 
easily analyzed by computer. In such cases, one talks about digital simulation. A special case is given 
by Monte Carlo methods when certain quantities of the model are randomly selected. These random 
elements are selected by using random numbers. 


16.3.5.2 Random Numbers 

Random numbers are realizations of certain random quantities (see 16.2.2, p. 811) satisfying given 
distributions. In this way it is possible to distinguish different kinds of random numbers. 

1. Uniformly Distributed Random Numbers 

These numbers are uniformly distributed in the interval [0, 1], they are realizations of the random vari- 
able X with the following density function fo(x) and distribution function F(x): 


0 for 0<a 
1 for <n < 1, pr 
fo(x) =f or Οςα a) Γο(α) -f 


0 otherwise: 2 fo θ«σς1, (16.168) 
| ME 1 fo r1. 

1. Method of the Inner Digits of Squares A simple method to generate random numbers is 

suggested by J. v. Neumann. It is also called the method of the inner digits of squares, and it starts 

from a decimal number z € (0,1) which has 2n digits. First z? is formed, getting a decimal number 
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which has 4n digits. Erasing its first and its last n digits, so one again has a number with 2n digits. To 
get further numbers, this procedure is repeated. In this way 2n digit decimal numbers are produced 
from the interval [0, 1] which can be considered as random numbers with a uniform distribution. The 
value of 2n is selected according to the largest number representable in the computer. For example, 
choosing 2n = 10. This procedure is seldom recommended, since it produces more smaller numbers 
than it should. Several other different methods have been developed. 

HE 2n-—4: 
z = zo = 0,1234, 20 = 0,01|5227 56, 

z= zi = 0,5227, 27 = 0,273215 29, 

z = 2% = 0,3215 etc. 

The first three random numbers are zo, 21 and 2ο. 


2. Congruence Method The so-called congruence method is widely used: A sequence of integers 
zi (i = 0,1,2,...) is formed by the recursion formula 


Ziy 9 c:z; modm. (16.169) 


Here zo is an arbitrary positive number and c and m denote positive integers, which must be suitably 
chosen. For z;,, one takes the smallest non-negative integer satisfying the congruence (16.169). The 
numbers z;/rn are between 0 and 1 and can be used for uniformly distributed random numbers. 


3. Remarks 

a) Choosing m — 2", where r is the number of bits in a computer word, e.g., r — 40. Then the number 
cis to be chosen in the order of ym. 

b) Random number generators using certain algorithms produce so-called pseudo-random numbers. 


[1 


c) On calculators and also in computers, “ran” or “rand” is used for generating random numbers. 


2. Random Numbers with other Distributions 


To get random numbers with an arbitrary distribution function F(x) the following procedure is in use: 
Consider a sequence of uniformly distributed random numbers £, £,... from [0,1]. With these num- 
bers the numbers n; = F'~'(€;) are formed for i = 1,2,... . Here F^! (x) is the inverse function of the 
distribution function F(x). Then one gets: 

P(n, € x) = P(F !(&) € x) = P(& € F(x)) = / fo(t) dt = F(x), (16.170) 


0 
i.e., the random numbers m, rj, . . . satisfy a distribution with the distribution function F(x). 


3. Tables and Application of Random Numbers 

1. Construction Random number tables can be constructed in the following way. Ten identical 
chips are indexed by the numbers 0, 1,2,...,9. Placing them into a box and shuffle them. One of them 
is then selected, and its index is written into the table. Then the chip is to be replaced into the box, 
again shuffling, and choosing the next one. In this way a sequence of random numbers is produced, 
which is written in groups (for easier usage) into the table. In Table 21.21, p. 1139, four random 
digits form a group. 

In the procedure, it must be guaranteed that the digits 0, 1,2, ..., 9 always have equal probability. 


2. Application of Random Numbers The use of a table of random numbers is demonstrated with 
an example. 

W Suppose one chooses randomly n = 20 items from a population of N = 250 items. The objects 
are renumbered from 000 to 249. Then a number is chosen in an arbitrary column or row in Table 
21.21, p. 1139, and a rule is determined of how the remaining 19 random numbers should be chosen, 
e.g., vertically, horizontally or diagonally. Only the first three digits are considered from these random 
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numbers, and they are used only if they form a number smaller than 250. 


16.3.5.3 Example of a Monte Carlo Simulation 


'The approximate evaluation of the integral 
1 
I= EO dz (16.171) 
0 


is considered as an example of the use of uniformly distributed random numbers in a simulation. Two 
solution methods are discussed here. 
1. Applying the Relative Frequency 
Supposing 0 < g(a) < 1 holds; this condition can always be guaranteed by 
atransformation (see (16.175), p. 845). Then the integral J is an area inside 
the unit square E (Fig. 16.16). Considering the numbers of a sequence 
of uniformly distributed random numbers from the interval [0. 1] in pairs 
as the coordinates of points of the unit square E, one gets n points P; (i = 
1,2,...,n). Denoting by m the number of points inside the area A, then 
considering the notion of the relative frequency (see 16.2.1.2, p. 808) for 
the integral follows: 

1 


m 

Figure 16.16 "E do S (16.172) 
0 

To achieve relatively good accuracy with the ratio in (16.172), a very large number of random numbers 

is necessary. This is the reason why one is looking for possibilities to improve the accuracy. One of 

these methods is the following Monte Carlo method. Some others can be found in the literature (see 

Lit. [16.20]). 

2. Approximation by the Mean Value 

To determine (16.171), one starts with n uniformly distributed random numbers £1, £,...,€, as the 

realization of the uniformly distributed random variable X. Then the values g; = g(&) (i = 1,2,...,n) 

are realizations of the random variable g(.X ), whose expectation according to formula (16.49a,b), p. 813, 


1s: 
oo 


EUX) = f Κο e ΤΣ αν (16.173) 
0 i=1 


This method, which uses a sample to obtain the mean value, is also called the usual Monte Carlo 
method. 


16.3.5.4 Application of the Monte Carlo Method in Numerical 
Mathematics 
1. Evaluation of Multiple Integrals 


First, it is to show how to transform a definite integral of one variable (16.174a) into an expression 
which contains the integral (16.174b). 


b 1 
r= nio dx (16174. I= [ 99 dx with O<g(x)<1.  (16.74b) 
a 0 
Then the Monte Carlo method given in 16.3.5.3 can be applied introducing the following notation: 
xz —a-4 (b-— a)u, m= min h(x), M = max h(a). (16.175) 
x€[a,b] € [a,b] 


Then (16.174a) becomes 
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f h(a + (b — a)u) - m 
M-m 


du + (b — a)m, (16.176) 


h(a + (b — aju) — m 
M-m 

W The approximate evaluation of multiple integrals with Monte Carlo methods is demonstrated by an 

example of a double integral 


V= / Παιν) ασαν with h(x,y) 7 0. (16.177) 
A 


where the integrand = g(u) satisfies the relation 0 < g(u) < 1. 


S denotes a plane surface domain given by the inequalities a < x € b and yi(x) € y < p2(x), where 
Φι(α) and p2(x) denote given functions. Then V can be considered as the volume of a cylindrical solid 
K, which stands perpendicular to the x, y plane and its upper surface is given by h(a, y). Ifh(x,y) € e 
holds, then this solid is in a block Q given by the inequalities a < x < b; e<y<d,0<2z< 
e (a,b,c,d,e const). After a transformation similar to (16.175), one gets from (16.177) an expression 
containing the integral 


Vr= ΠΟ dudv with 0< g(u,v) € 1, (16.178 


where V* can be considered as the volume of a solid K* in the three-dimensional unit cube. The integra 
(16.178) is approximated by the Monte Carlo method in the following way: 
Triplets of the numbers of a sequence of uniformly distributed random numbers from the interval [0, 1 


are considered as the coordinates of points P; (i = 1,2,...,n) of the unit cube, and count how many 
points among P; belong to the solid K*. If m point belong to K*, then analogously to (16.172) 
m 
V'z—. (16.179 
n 


Remark: In definite integrals with one integration variable one should use the methods given in 19.3, 
p. 963. For the evaluation of multiple integrals, the Monte Carlo method is still often recommended. 
2. Solution of Partial Differential Equations with the Random Walk Process 

'The Monte Carlo method can be used for the approximate solution of partial differential equations with 
the random walk process. 


y^ a) Example of a Boundary Value Problem: Con- 
| 
sider the following boundary value problem as an ex- 
||] r ample: 

ΥΠ bee Pu Pu 
Au--—-c—-—0 for (rzy)eG, (16.180a 
y-r- * * i Ox? 2 Oy? or (zy) ( a) 
στις S u(z,y)— f(x,y) for (x,y) eT. (16.180b) 
Here G is a simply connected domain in the x, y plane; 
Eod d I' denotes the boundary of G (Fig.16.17). Similarly 
eh > to the difference method in paragraph 19.5.1, p. 976 
0 x-1 x ΧΕΙ x G is covered by a quadratic lattice, where it can be 
assumed, without loss of generality, that the step size 

Figure 16.17 can be chosen as h = 1. 


In this way interior lattice points P(x, y) and boundary points R; are obtained. The boundary points 
R;, which are at the same time also lattice points, are considered in the following discussion as points 
of the boundary I of G, i.e.: 

u(R;) = f(Ri) (6—1,2,...,N) (16.181) 


16.3 Mathematical Statistics 847 


b) Solution Principle: One imagines that a particle starts a random walk from an interior point 
P(x,y). That is: 
1. The particle moves randomly from P(x, y) to one of the four neighboring points. To each of these 
four grid points 1/4 is assigned as the probability to move into them. 
2. If the particle reaches a boundary point R;, then the random walk terminates there with probability 
one. 
It can be proven that a particle starting at any interior point P reaches a boundary point R; after a 
finite number of steps with probability one. Denoting by 

p(P. Ri) = p((z, y), Ri) (16.182) 
the probability that a random walk starting at P(x,y) will terminate at the boundary point R;, then 
one gets 


p(R; R;)-—1, p(R,R;)—0 for ὑπ} and (16.183) 


p((x, y), Ri) G 1,y), Ri) +p((£+1, y), Ri) +p((x, y—1), Ri)+p((x,y+1), ΠΕ]. (16.184) 


The equation (16.184) is a difference equation for p((x, y), Ri). If starting n random walks from the 
point P(x,y), from which m; terminates at R; (m; € n), then 
m; 


p((z,y), R))  —- (16.185) 


The equation (16.185) gives an approximate solution of the differential equation (16.180a) with the 
boundary condition (16.181). The boundary condition (16.180b) will be fulfilled if substituting 


v(P) = v(z,y) = Σ, F(R), y), Ri), (16.186) 


because of (16.184), v(R;) = ο Ri) = f(R;). 


To calculate v(x, y) (16.184) is multiplied by f (R;). Summation yields the following difference equation 
for u(x, y): 


v(r,y) = lve ly) + v(x 1-1, υ) 1-υ(α,υ — 1) + v(z, y -- 1) . (16.187) 


Starting n random walks from an interior point P(x, y), and among them m; terminate at the boundary 
point R; (i = 1,2,..., N), then an approximate value is obtained at the point P(x, y) of the boundary 
value problem (16.180a,b) by 


υ(α, y) 56 - Ymf(R). (16.188) 


16.3.5.5 Further Applications of the Monte Carlo Method 


Monte Carlo methods as stochastic simulation, sometimes called methods of statistical experiments, are 
used in many different areas. For examples: 

e Nuclear techniques: Neutrons passing through material layers. 

e Communication: Separating signals and noise. 

e Operations research: Queueing systems, process design, inventory control, service. 

For further details of these problem areas see for example [16.15]. 
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16.4 Calculus of Errors 


Every scientific measurement, giving certain numerical quantities — regardless of the care with which 
the measurements are made -- is always subject to errors and uncertainties. There are observational 
errors, errors of the measuring method, instrumental errors and often errors arising from the inherent 
random nature of the phenomena being measured. Together they compose the measurement error. 
All measurement errors arising during the measuring process are called deviations. As a consequence 
a measured quantity represented by a number of significant digits can be given only with a rounding 
error, i.e., with a certain statistical error, which is called the uncertainty of the result. 

1. The deviations of the measuring process should be kept as small as possible. On this basis it is to 
be evaluated the possible best approximation, what can be done with the help of smoothing methods 
which have their origin in the Gaussian least squares method. 


2. The uncertainties have to be estimated as well as possible, what can be done with the help of meth- 
ods of mathematical statistics. 


Because of the random character of the measuring results they can be considered as statistical samples 
(see 16.2.3, 1., p. 814) with its probability distribution, whose parameters contain the desired informa- 
tion. In this sense, measurement errors can be seen as sampling errors. 


16.4.1 Measurement Error and its Distribution 


16.4.1.1 Qualitative Characterization of Measurement Errors 

Qualifying the measurement errors by their causes, the following three types of errors are distinguished: 
1. Rough errors are caused by inaccurate readings or confusion; they are excludable. 

2. Systematic measurement errors are caused by inaccurately scaled measuring devices and by the 
method of measuring, where the method of reading the data and also the measured error of the mea- 
surement system can play a role. They are not always avoidable. 


3. Statistical or random measurement errors can arise from random changes of the measuring con- 
ditions that are difficult or impossible to control and also by certain random properties of the events 
observed. 

In the theory of measurement errors usually it is assumed that the rough errors and the systematic 
measurement errors are excludable, so that only the statistical properties and the random measure- 
ment errors are included into the calculation of the errors. 


16.4.1.2 Density Function of the Measurement Error 


1. Measurement Protocol 

To calculate the characterization of the uncertainties it must be supposed that the measured results 
are listed in a measurement record as a prime notation and that the relative frequencies or the density 
function f(a) or the cumulative frequencies or the distribution function F(x) (see 16.3.2.1, p. 832) of 
the uncertain values are available. The realization of the random variable X under consideration is 
denoted by x. 

2. Error Density Function 
Special assumptions about the properties of the measurement error result in certain special properties 
of the density function of the error distribution: 

1. Continuous Density Function Since the random measurement errors can take any value in a 
certain interval, they are described by a continuous density function f(x). 

2. Even Density Function If measurement errors with the same absolute value but with different 
signs are equally likely, then the density function is an even function: f(—x) = f(x). 

3. Monotonically Decreasing Density Function If a measuring error with larger absolute value 
is less likely than an error with smaller absolute value, then the density function f(x) is monotonically 
decreasing for x > 0. 
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4. Finite Expected Value The expected value of the absolute value of the error must be finite: 
E(|X|) = / [ε]{(α) dz < oo. (16.189) 


Different properties of the errors result in different types of density functions. 
3. Normal Distribution of the Error 


1. Density Function and Distribution Function In most practical cases it can be supposed that 
the distribution of the measurement error is a normal distribution with expected value = 0 and 
variance c?, i.e., the density function f(x) and the distribution function F(x) of the measurement 
error are: 


1 -24) 1 2 LP üt 
α) = > 20 16.1902 " i= » 2? dt = (5) 
f(x) τπτ ( a) and F(x) ml e 207 qt - (=). (16.1900) 


Here (x) is the distribution function of the standard normal distribution (see (16.73a), p. 819, and 
Table 21.17, p. 1133). In the case of (16.190a,b) one speaks about normal errors. 

2. Geometrical Representation The density function (16.1902) is represented in Fig. 16.184 with 
inflection points and points at the center of gravity, and its behavior is shown in Fig. 16.18b when the 
variance changes. The inflection points are at the abscissa values +ø; the centers of gravity of the half- 


areas are at +7. The maximum of the function is at x = 0 and it is 1/(o 2x). The curve widens as c? 
increases, the area under the curve always equals one. This distribution shows that small errors occur 
often, large errors only seldom. 


f(x) 


Figure 16.18 


4. Parameters to Characterize the Normally Distributed Error 


Beside the variance c? or the standard deviation σ which is also called the mean square error or standard 
error, there are other parameters to characterize the normally distributed error, such as the measure 
of accuracy h, the average error or mean error η, and the probable error y. 


1. Measure of Accuracy Beside the variance c?, the measure of accuracy 
h = —= (16.191) 
is used to characterize the width of the normal distribution. A narrower Gauss curve results in better 


accuracy (Fig. 16.18b). Replacing σ by the experimental value 6 or õ, obtained from the measured 
values, the measure of accuracy characterizes the accuracy of the measurement method. 
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2. Average or Mean Error The expected value η of the absolute value of the error is defined as 
T 2 
n= E(|X|) = 2 f sf) dx = "E (16.192 
T 
0 
3. Probable Error The bound y of the absolute value of the error with the property 
> 1 
Ρ(ΙΧΙ«γ)Ξς (16.193a 


2 
is called the probable error. It implies that 


+7 9 yo 1 

EO dz = 29 (2) eie I PP E (16.193b 
» o 3T κ 

=y 0 


where (x) is the distribution function of the standard normal distribution. The condition (16.193b 
is a non-linear equation which can be solved approximately by the help of a computer algebra system 
to get y/o. The result is 


2 & 0.6745. (16.193c 


4. Given Error Bounds If an upper bound a > 0 of an error is given, then with (16.193b) can be 
calculated the probability that the error is in the interval [—a, a]: 
P(|X| < a) = 28 (5) xc (16.194) 
σ 
5. Relations between Standard Deviation, Average Error, Probable Error, 


and Accuracy If the error has a normal distribution, then the following relations hold using 
(16.193c): 


1 
n= 2c. (16.195a) y = 0.67450, (16.1955) h= BUG. (16.195c) 


16.4.1.3 Quantitative Characterization of the Measurement Error 

1. True Value and its Approximations 

The true value z,, of a measurable quantity is usually unknown. Therefore the expected value of the 
random variables, whose realizations are the measured values x; (i = 1,2,...,n), is chosen as an esti- 
mated value of £u. Consequently, the following means can be considered as an approximation of £w. 
1. Arithmetical Mean 


k 
T=- r (16.196a) οἳ T= X hj, (16.196b) 
j=l 


if the measured values are distributed into k classes with absolute frequencies h; and class means 
Ἐν (9 S15 234525): 
2. Weighted Mean 


z9) -. δ gi | »» s (16.197) 
i=1 i=1 


Here the single measured values are weighted by the weighting factors g; (g; > 0) (see 16.4.1.6, 1., 
p. 854). 

2. Error of a Single Measurement in a Measurement Sequence 

1. The True Error of a Single Measurement in a Measurement Sequence is the difference 
between the true value z,, and the measuring result. Because this is usually unknown, the true error 
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ει of the i-th measurement with the result v; is also unknown: 
Ei = By = αι. (16.198) 
2. The Mean Error or Apparent Error of a Single Measurement in a Measurement Se- 
quence is the difference of the arithmetical mean and the measurement result z;: 
vj =T- ti. (16.199) 
3. Mean Square Error of a Single Measurement or Standard Error of a Single 
Measurement Since the expected value of the sum of the true errors e; and the expected value 
of the sum of the apparent errors v; of n measurements are zero (independently of how large they are), 
they are characterized by the sum of the error squares: 
n n 
2 2 2 2 : 
E —) 8, (16.2002) v? u. (16.200b) 
ixl i=1 
From a practical point of view only the value of (16.200b) is interesting, since only the values of v; can 
be determined from the measuring process. Therefore, the mean square error of a single measurement 
of a measurement sequence is defined by 


ὅ- P - 1). (16.200c) 


The value 6 is an approximation of the standard deviation σ of the error distribution. 
One gets for & = c in the case of normally distributed error: 


P(|e| € &) = 24 (1) — 1 = 0.6826. (16.200d) 
That is: The probability that the absolute value of the true value does not exceed g, is about 68%. 


€ 


4. The Probable Error of a single measurement is the number 7, for which 


€ 


P( 


E375: (162014) 


That is: The probability that the absolute value of the error does not exceed y, is 50%. The abscis- 
sae +y divide the area of the left and the right parts under the density function into two equal parts 
(Fig. 16.18a). 
In the case of a normally distributed error in accordance to (16.193c) holds 

y ~ 0.67450 ~ 0.67450 =7. (16.201b) 


5. Average Error of a single measurement is the number η, which is the expected value of the 
absolute value of the error: 


n= Ε(Ιε}} = I [αἱ f(x) dz. (16.202a) 


/2 
In the case of a normally distributed error follows η = ϱ/ --σ and 
π 


<n) -- 26 (2) 1=20 (\2) 120.5751: (16.2025) 
σ π 


The probability that the error does not exceed the value η is about 57.6 %. The centers of gravity of 
the left and right areas under the density function (Fig. 16.18a) are at abscissae +7. In the case of 
normally distributed errors: 


2 2 
n= \26 RI τὸ =). (16.202c) 
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3. Error of the Arithmetical Mean of a Measurement Sequence 

The error of the arithmetical mean 7 of a measurement sequence is given by the errors of the single 
measurement: 

1. Mean Square Error or Standard Deviation 


OAM = (nn ni vi VA n — 1)] κ”. (16.203) 


2. Probable Error 


Jam ~ 0.6745 x vi VA n — 1)] = 0.6745 (16.204) 


δ 


3. Average Error 


; δ 
flame 55 0.7979 2J inf n(n — 1)] = 0.7979— . 16.205 
ΑΜ D ) yn ( ) 


4. Accessible Level of Error Since the three types of errors defined above (16.203)-(16.205) are 
directly proportional to the corresponding error of the single measurement (16.200c), (16.201b) and 
(16.202c) and they are proportional to the reciprocal of the square root of n, it is not reasonable to 
increase the number of the measurements after a certain value. It is more efficient to improve the 
accuracy h of the measuring method (16.191). 

4. Absolute and Relative Errors 

1. Absolute Uncertainty, Absolute Error The uncertainty of the results of measurement is char- 
acterized by errors €;, vj, σι, Yi; Ni, OF €, V, σ, y, N, Which measure the reliability of the measurements. 
The notion of the absolute uncertainty, given as the absolute error, is meaningful for all these types of 
errors and for the calculation of error propagation (see 16.4.2, p. 854). They have the same dimension 
as the measured quantity. 

The word “absolute” error is introduced to avoid confusion with the notion of relative error. Often the 
notation Az; or Ax is used. The word “absolute” has a different meaning from the notion of absolute 
value: It refers to the numerical value of the measured quantity (e.g., length, weight, energy), without 
restriction of its sign. 

2. Relative Uncertainty, Relative Error The relative uncertainty, given by the relative error, 
is a measure of the quality of the method of measurement with respect to the numerical value of the 
measured quantity. In contrast to the absolute error, the relative error has no dimension, because it is 
the quotient of the absolute error and the numerical value of the measured quantity. If this value is not 
known, one replaces it by the mean value of the quantity x: 


Aw; Δα; 
be; =e ο. (16.206a) 
d T 
The relative error is given mostly as a percentage and it is also called the percentage error: 
δαι/ 6 = δα; - 100 96. (16.206b) 


5. Absolute and Relative Maximum Error 

1. Absolute Maximum Error If the quantity z is to be determined and is z a function of the mea- 
sured quantities £1, £2, .. . , En, Le. Z = f(r1,22,..., En), then the resulting error must be calculated 
taking also the function f into consideration. ‘Phere are two different ways to examine errors. The 
first approach is that statistical error analysis is applied by smoothing the data values using the least 
squares method (min (z; — z)?). In the second approach, an upper bound AZmax is determined for 
the absolute error of the quantities. With n independent variables α;, holds: 


n 


ὃ 
ος Σ Jz, f (21,22, ..., 24)| Avi, (16.207) 
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where the mean value z; should be substituted for zi. 
2. Relative Maximum Error The relative maximum error is the absolute maximum error divided 
by the numerical value of the measured value (mostly by the mean of z): 


bz A Zmax pe Azmax 
“πχ Ἢ ". C 


(16.208) 


2 2 


16.4.1.4 Determining the Result of a Measurement with Bounds on the 
Error 

A realistic interpretation of a measurement result is possible only if the expected error is also given; 

error estimations and bounds are components of measurement results. It must be clear from the data 

what is the type of the error, what is the confidence interval and what is the significance level. 

1. Defining the Error The result of a single measurement is required to be given in the form 


t= Agy ng, (16.209a) 
and the result of the mean has the form 
x ο. ΑΥΤ πα (16.209b) 
Here for Ax the most often used standard deviation σ is applied. ^ and ἢ could also be used. 
X-—m 
2. Prescription of Arbitrary Confidence Limits The quantity T = τ. has a t distribution 
CAM 
(16.97b) with f = n — 1 degrees of freedom in the case of a population with a distribution N (ji, o?) 


according to (16.96). For a required significance level a or for an acceptance probability S = 1 — a the 
confidence limits for the unknown quantity £u = ji with the t quantile ta/2;f are 


H =T E ta/z;f -ΖΑΜ- (16.210) 
That is, the true value z,, is in the interval given by these limits with a probability S = 1 — a. 
Mostly it is of interest keeping the size n of the measurement sequence at its lowest possible level. The 
length 2ta/2;fõam of the confidence interval decreases for a smaller value of 1 — a and also for a larger 
number n of measurements. Since &4y decreases proportionally to 1/,/n and the quantile ta yo.f With 
f = n — 1 degrees of freedom also decreases proportionally to 1/,/n (for values of n between 5 and 10, 
see Table 21.20, p. 1138) the length of the confidence interval decreases proportionally to 1/n for such 
values of n. 


16.4.1.5 Error Estimation for Direct Measurements with the Same 


Accuracy 
If there is the same variance c; for all n measurements, one talks about measurements with the same 
accuracy h = const. In this case, the least squares method results in the error quantities given in 


(16.200c), (16.2015), and (16.202b). 


W Determine the final result for the measurement sequence given in the following table which contains 
n — 10 direct measurements with the same accuracy. 


x; | 1.592 | 1.581 | 1.574 | 1.566 | 1.603 | 1.580 | 1.591 | 1.583 | 1.571 | 1.559 
v:10 | -12| —1| +6] +14] -23 0 11 3 +9} +21 
υ «1061 144 1 36| 196] 529 0] 121 9 8 441 


T = 1.580,6 = | Mov? /(n — 1) = 0.0131, Gay = δ/νη = 0.004. 
i=1 
Final result: x = T + õm = 1.580 + 0.004. 
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16.4.1.6 Error Estimation for Direct Measurements with Different 
Accuracy 

1. Weighted Measurements 

If the direct measurement results x; are obtained from different measuring methods or they represen 


means of single measurements, which belong to the same mean z with different variances σι”. then a 
weighted mean 


zo -Ysu/Ys (16.211 
i=1 i—l 


is calculated, where g; is defined as 


o2 
gi =. (16.212 


σι 


Here 6 is an arbitrary positive value, mostly the smallest σι. It serves as a weight unit of the deviations, 
i.e., for c; = σ it is g; = 1. It follows from (16.210) that a larger weight of a measurement results in a 
smaller deviation σι. 

2. Standard Deviations 

The standard deviation of the weight unit is estimated as 


Bh = Yaw? / n Edi (16.213) 
i=1 


It must be sure that δ) < δ. In the opposite case, if 6/? > δ, then there are x; values which have 
systematic deviations. 


The standard deviation of the single measurement is 


o) σί9) 
το ον (16.214) 
Ji σ 


where 6; «δι can be expected. 
The standard deviation of the weighted mean is: 


ο, -οὐ/ᾖξια-{Σοο/{--υξοὴ (16215) 
i=1 i=1 


i=l 


3. Error Description 

The error can be described as it is represented in 16.4.1.4, p. 853, either by the definition of the error 
or by the t quantile with f degrees of freedom. 

W The final results of measurement sequences (n = 5) with different means z; (i = 1,2,...,5) and 
with different standard deviations 9645; are given in Table 16.6. 

Calculating (7;),, = 1.5830 and choosing xp = 1.585 and ὅ = 0.009 with z; = T; — zo, gi = 6?/6;? one 


πι 
gets Z = —0.0036 and T = £o +Z = 1.582. The standard deviation is ¢ = |» give f in -1) -- 
i=1 


0.0088 < & and σι = σαι = 0.0027. The final result is x = F + σι = 1.585 + 0.0027. 


16.4.2 Error Propagation and Error Analysis 


Measured quantities appear in final results often in the form of functional dependencies. One talks 
about error propagation. If the error is small, then a Taylor expansion with respect to the error can be 
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Table 16.6 Error description of a measurement sequence 


Ti ΖΑΜ. AM; gi Zi giZi zi? gizi? 
1.573 0.010 1.0.1074 0.81 —1.2.10 2 | —9.7-10-3 | 1.44-10 4 | 1.16104 
1.580 0.004 1.6107? 5.06 | —5.0-107? | —2.5-10-? | 92.50.1075 | 126.104 
1.582 0.005 2.5.107? 3.24 | —3.0.107? | —9.7-107 | 9.01079 | 2.91-107? 
1.589 0.009 8.1.1075 1.00 | +4.0:107-% | 40.107 1.6.1075 1.6.1075 
1.591 0.011 191.104 | 0.66 | 46.0.10? | 3.9.10? 3.6.107? | 2.37.1075 


n n n 
(Zi)m σ Σο, Y gizi Σοφια 
πι. i=1 i=1 


= 1.583 | = 0.009 — 10.7 = 3.6.10? —3.1107 


used neglecting the terms of second and higher order. 
16.4.2.1 Gauss Error Propagation Law 


1. Problem Formulation 

Suppose it is to determine the numerical value and the error of a quantity z given by the function 
z = f(ri,22,..., αρ) of the independent variables x; (j = 1,2,..., k). The mean value z; obtained 
from nj measured values is considered as realizations of the random variable z;, with variance o;?. The 
question is how the errors of the variables affect the function value f (x1, 9, .. . , 2p). It is assumed that 
the function f (x1, £2, . . . , ορ) is differentiable and its variables are stochastically independent. However 
they may follow any type of distribution with different variances c;?. 

2. Taylor Expansion 

Since the error represents relatively small changes of the independent variables, the function f (21, x», . . . 
£p) can be approximated in the neighborhood of the mean z; by the linear part of its Taylor expansion 
with the coefficients aj, so its error A f is: 


Af = f(xi,22,..., vk) — f (Ti 2,..., Xx), (16.2162) 
of of of τ ὃ} i ; 
Af xd, dx, + Ll—dzra-ee --σι = —dx; = 21, 10.216] 
f if Lm 21 + Bis T2 F dn. Tk Σ ar Σω Tj (16.216b) 
where the partial derivatives O f /Ox; are taken at (T1, οι... , Ep). 


The variance of the function is 


k 
σι a αι σε,” + aj! Ong” Tee aj On, = a3 Oa (16.217) 
j=l 


3. Approximation of the Variance c} 


Since the variances of the independent variables x; are unknown, they can be approximated by the 


variance of their mean, which is determined from the measured values x; (l = 1, 2, . . . , nj) of the single 
variables as follows: 
nj 
> (xj — Tj) 
=9 ii 
OL EU ———— —u (16.218) 


Using these values one can form an approximation of o j?: 


k 
δι = Y ajo. (16.219) 
2Ξ1 


856 16. Probability Theory and Mathematical Statistics 


The formula (16.219) is called the Gauss error propagation law. 
4. Special Cases 


1. Linear Case An often occurring case is the summation of the values of the errors of linearly oc- 
curring error quantities with a; = 1: 


8; = J82 +52 82. (16.220) 


W The pulse length is to be measured at the output of a pulse amplifier of a detector channel for spec- 
trometry of radiation, whose error can be deduced for three components: 

1. statistical energy distribution of the radiation of the part passing through the spectometer with an 
energy Eo, which is characterized by 6g, 

2. statistical interference processes in the detector with σῃοι, 

3. electronic noice of the amplifier of the detector impulse õe. 

The total pulse length has the error 


δ} = Jl, + Pa +. (16.221 


2. Power Rule The variables Ὁ; often occur in the following form: 


z= f(z122,...25) = ax «m ... Lp”. (16.222 
By logarithmic differentiation the relative error is 

d dx dx: dtp 

Ca y n (16.223 

f Tı T2 Tk 


from which by the error propagation law follows for the mean relative error: 


(16.224 


E Suppose the function f (x1, £2, τα) has the form f(z1, £2, £3) = ψπιπο πα”, and the standard devia- 
tions are o,,, σι, and σι. The relative error is then 


ise ha Y ry + (stm). 
f 27, T2 T3 
5. Difference to the Maximum Error 
Declaring the absolute or relative maximal error (16.207), (16.208) means that no smoothing for the 
values of the measurement is been made. For the determination of the relative or absolute error with 
the error propagation laws (16.219) or (16.222), smoothing between the measurement values ; means 


that a confidence interval for a previously given level is to be determined. This procedure is given in 


16.4.1.4, p. 853. 
16.4.2.2 Error Analysis 


The general analysis of error propagation in the calculations of a function y(x;), when quantities of 
higher order are neglected, is called error analysis. In the framework of the theory of error analysis one 
investigates using an algorithm, how an input error Az; affects the value of y(x;). In this context one 
also talks about differential error analysis. 


In numerical mathematics, error analysis means the investigation of the affect of errors of methods, of 
rounding, and of input errors to the final result (see [19.24]). 


17 Dynamical Systems and Chaos 


17.1 Ordinary Differential Equations and Mappings 


17.1.1 Dynamical Systems 
17.1.1.1 Basic Notions 


1. The Notion of Dynamical Systems and Orbits 

A dynamical system is a mathematical object to describe the development of a physical, biological 
or another system from real life depending on time. It is defined by a phase space M, and by a one- 
parameter family of mappings «οί: M — M, where t is the parameter (the time). In the following 
discussion the phase space is often R”, a subset of it, or a metric space. The time parameter t is from 
R (time continuous system) or from Z or from Z, (time discrete system). Furthermore, it is required 
for arbitrary x € M that 

a) e" (x) = x and 

b) e'(q*(x)) = qe **(x) for all t, s. The mapping o! is denoted briefly by v. 

Hereafter, the time set is denoted by I’, hence, T = R, l = R}, I = Z or I = Z}. If I = R, then the 
dynamical system is also called a flow; if I = Z or I 2 Z,, then the dynamical system is discrete. In 
case I'= Rand TI = Z, the properties a) and b) are satisfied for every t € I’, so the inverse mapping 
(pt)! = ο also exists, and these systems are called invertible dynamical systems. 
I 


the dynamical system is not invertible, then ^ '(A) means the pre-image of A with respect to v, 
for an arbitrary set A C M and arbitrary t > 0, i.e., o (4) = {x € M: t(x) € A). If the mapping 
y': M — M is continuous or k times continuously differentiable for every t € I (here M C R”), then 
the dynamical system is called continuous or C*-smooth, respectively. 

For an arbitrary fixed x € M, the mapping t +> v! (x), t € T, defines a motion of the dynamical 
system starting from x at time t = 0. The image y(x) of a motion starting at x is called the orbit (or 
the trajectory) through x, namely y(x) = {y'(x)}ier. Analogously, the positive semiorbit through 2 is 
defined by 4*(r) = {φ'(υ)}ιρο and, if © A Ry, or P z Z,, then the negative semiorbit through x is 
defined by Υ (x) = {φ'(α)}ιςο. 

The orbit y(x) is a steady state (also equilibrium point or stationary point) if y(x) = {x}, and it is 
T-periodic if there exists a T € I’, T > 0, such that eT (x) = q'(x) for all t € P, and T € T is the 
smallest positive number with this property. The number T is called the period. 


2. Flow of a Differential Equation 
Consider the ordinary linear planar differential equation 

i= f(x), (17.1) 
where f: M — R” (vector field) is an r-times continuously differentiable mapping and M = R” or 
M is an open subset of R”. From now on, the Euclidean norm || : || is used in R”, i.e., for arbitrary 

n 

zr € R”, x = (zi,...,2,), its norm is ||x|| = 5 2 If the mapping f is written componentwise 
i= 
f = (fi,---, fn), then (17.1) is a system of n scalar differential equations t; = fi(zi1,..., £n) i = 
1,254. Th: 
The Picard-Lindelóf theorem on the local existence and uniqueness of solutions of differential equations 
and the theorem on the r-times differentiability of solutions with respect to the initial values (see (17.11]) 
guarantee that for every zo € M, there exist a number ¢ > 0, a sphere Bs(xo) = (x: ||x — xo|| < ὃ} in 
M and a mapping v: (--ε,ε) x Bs(ao) — M such that: 
1. y(-,-) is (r + 1)-times continuously differentiable with respect to its first argument (time) and r- 
times continuously differentiable with respect to its second argument (phase variable); 
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2. For every fixed x € Bs(xo), y (^, Ὁ) is the locally unique solution of (17.1) in the time interval (—e, £) 
0 
ot 
Φ(0, α) = x, and every other solution with initial point x at time t = 0 coincides with y(t, x) for all 
small |t]. 


which starts from x at time t = 0, i.e., (t, x) = p(t,x) = f(y(t,x)) holds for every t € (--ε,ε). 


Suppose that every local solution of (17.1) can be extended uniquely to the whole of R. Then there 
exists a mapping y: R x M — M with the following properties: 

1. φ(θ,α) 2r forall «ΕΜ. 

2. p(t + s,x) = v(t, (s, α)) for all t, s € Rand all z € M. 

3. φ(-,:) is continuously differentiable (r + 1) times with respect to its first argument and r times with 
respect to the second one. 

4. For every fixed x € M, (+, x) is a solution of (17.1) on the whole of R. 

Then the C"-smooth flow generated by (17.1) can be defined by φί: = y(t,-). The motions φ(-, x): 
R — M of a flow of (17.1) are called integral curves. 

W The equation 


£=o(y-2), y=rz—y—uz, z=a2y—bz (17.2) 
is called a Lorenz system of convective turbulence (see also 17.2.4.3, p. 887). Here øo > 0, r > 0 and 
b > 0 are parameters. A C^ flow on M = R? corresponds to the Lorenz system . 

3. Discrete Dynamical System 
Consider the difference equation 

Tua = (αι), (17.3) 
which can also be written as an assignment x —> y(x). Here y: M — M is a continuous or r times 
continuously differentiable mapping, where in the second case M C R”. If o is invertible, then (17.3) 
defines an invertible discrete dynamical system through the iteration of y, namely, 

φί = yo---oy, for t>0, g = φ Ίο...οφἹ, for t«0, q? — id. 
—— —— (17.4) 


t times, —t times, 

If y is not invertible, then the mappings v are defined only for t > 0. For the realization of φί see 
(5.74), p. 333. 
W A: The difference equation 

αμ = ax, (1— αι), ei 1, ous (17.5) 
with parameter a € (0, 4] is called a logistic equation. Here M = [0,1], and φ: [0,1] — [0,1] is the 
function φ(α) = απ] — x) for a fixed a. Obviously, ¢ is infinitely many times differentiable, but not 
invertible. Hence (17.5) defines a non-invertible dynamical system. 


W B: The difference equation 
Tur =Y +l- ar, Y =bn t=0,41,..., (17.6) 


with parameters a > 0 and b Z 0 is called a Hénon mapping. The mapping y: R? > R? corresponding 
to (17.6) is defined by y(x, y) = (y+1—az’, bx), it is infinitely many times differentiable and invertible. 
4. Volume Contracting and Volume Preserving Systems 

The invertible dynamical system {%" }er on M C R” is called volume depressing or dissipative (volume 
preserving or conservative), if the relation vol(y'(A)) < vol(A) (vol(y'(A)) = vol(A)) holds for every 
set A C M with a positive n-dimensional volume vol(A) and every t > 0 (t€ I). 

W A: Let yin (17.3) bea C"-diffeomorphism (1.6.: o: M — M is invertible, M C R” open, and 
yt are C"-smooth mappings) and let Do(x) be the Jacobi matrix of e in x € M. The discrete system 
(17.3) is dissipative if | det Dy(x)| < 1 for all x € M, and conservative if | det Dy(x)| = 1 in M. 


17.1 Ordinary Differential Equations and Mappings 859 


W D: For the system (17.6) Dy(x,y) = ες a and so | det Dy(x,y)| = b. Hence, (17.6) is 


dissipative if |b| < 1, and conservative if |b| = 1. 
The Hénon mapping can be decomposed into three mappings (Fig. 17.1): First, the initial domain 
is stretched and bent by the mapping z^ ig = y + 1 — az? in a area- preserving way, then it is 
contracted in the direction of the x’ -axis by a ei i be Ly” = y' (at |b| < 1), and finally it is reflected with 

IH ΜΙ T a". 


respect to the line y" = z" by xz" = y", y" = 


y y y” yuh 
> > > > 
: : 3i a 


Figure 17.1 


17.1.1.2 Invariant Sets 


1. a-andw-Limit Sets, Absorbing Sets 
Let {y'}ier be a dynamical system on M. The set A C M is invariant under (i), if e! (A) = A holds 
for all t € I’, and positively invariant under (q*), if c^ (A) C A holds for all t > 0 from I. 
For every x € M, the w-limit set of the orbit passing through x is the set 

ω(α)-- {νε M:dt, ET, t, ου, φ'"(α) y asn— +0}. (17.7) 
The elements of w(x) are called w-limit points of the orbit. If the dynamical system is invertible, then 
for every x € M, the set 

a(z)—-(y€ M:3t, ΕΤ, t, —oo, φ'"(α) y asn +00} (17.8) 
is called the a-limit set of the orbit passing through x; the elements of o (x) are called the a-limit points 
of the orbit. 
For many systems which are volume decreasing under the flow there exists a bounded set in phase 
space such that every orbit reaching it stays there as time increases. A bounded, open and connected 
set U C M is called absorbing with respect to {y' hier, if e! (U) C U holds for all positive t from T. (U 
is the closure of U.) 
E Consider the system of differential equations 

x y-z(l-a?—-4?), y=aty(—2’?-y’) (17.9a) 
in the plane. Using the polar coordinates x = rcosv, y = rsin, the solution of (17.9a) with initial 
state (ro, vo) at time t = 0 has the form 

r(t,ro) = [1 + (rg? — 1) e”, (t9) — t + Vo. (17.9b) 
This representation of the solution shows that the flow of (17.9a) has a periodic orbit with period 27, 


which can be given in the form ?((1,0)) = ((cost,sint),t € [0, 27]}. The limit sets of an orbit through 
p are: 


(0,0), [»| «1. 
AP) = PI = and wln) = 7((1,0)), p A (0,0), 
ΠΕ... ο τα... 


Every open sphere B, = { (x, y): a? + y? < r°} with r > 1 is an absorbing set for (17.9a). 

2. Stability of Invariant Sets 

Let A be an invariant set of the dynamical system {φί}ιε- defined on (M, p). The set A is called stable, 
if every neighborhood U of A contains another neighborhood U, C U of A such that (01) C U holds 
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for all t > 0. The set A, which is invariant under {y'}, is called asymptotically stable if it is stable and 
the following relations are satisfied: 

νας M 
dist(z, A) < A 


Here, dist(x, A) = inf plz, y). 
yes 


3Δ»0 ) το dist(y'(x), A) — 0 for t— +00. (17.10) 


3. Compact Sets 

Let (M, p) be a metric space. A system {U;}ier of open sets is called an open covering of M if every 
point of M belongs to at least one U;. The metric space (M, p) is called compact if it is possible to 
choose finitely many U;,,..., Ui, from every open covering (U;);ey of M such that M = U U -++ U Ui 
holds. The set K C M is called compact if it is compact as a subspace. 

4. Attractor, Domain of Attraction 

Let {y' her be a dynamical system on (M, p) and A an invariant set for (v). Then W(A) = (x € 
M: w(x) C A} is called the domain of attraction of A. A compact set A C M is called an attractor of 
(oer on M if A is invariant under {ο} and there is an open neighborhood U of A such that w(x) = A 
for almost every (in the sense of Lebesgue measure) x € U. 

B ^-—5((1,0))isanattractor of the flow of (17.9a). Here W (A) = R? \ ((0,0)). For some dynamical 
systems, a more general notion of an attractor makes sense. So, there are invariant sets A which have 
periodic orbits in every neighborhood of A which are not attracted by A, e.g., the Feigenbaum attractor. 
The set A may not be generated by a single limit set w. A compact set A is called an attractor in the 
sense of Milnor of the dynamical system [oer on M if A is invariant under {ρ!} and the domain of 
attraction of A contains a set with positive Lebesgue measure. 


17.1.2 Qualitative Theory of Ordinary Differential Equations 
17.1.2.1 Existence of Flows, Phase Space Structure 


1. Extensibility of Solutions 
Besides the differential equation (17.1), which is called autonomous, there are differential equations 
whose right-hand side depends explicitly on the time and they are called non-autonomous: 


“= f(t,x). (17.11) 
Let f: Rx M — M bea C"-mapping with M C R”. By the new variable z,,, :— t, (17.11) can 
be interpreted as the autonomous differential equation ὦ = f(%n41,2), $441 = 1. The solution of 


(17.11) starting from x at time to is denoted by (-, to, £o). In order to show the global existence of 
the solutions and with this the existence of the flow of (17.1), the following theorems are useful. 
1. Criterion of Wintner and Conti If M — R" in (17.1) and there exists a continuous function 


m 
w: [0, +00) > [1, +00), such that || f(@)|] < w (||a'l]) for all x € R” and f v dr = +00 holds, then 
ο wlr 


every solution of (17.1) can be extended onto the whole of IR... 

W For example, the following functions satisfy the criterion of Wintner and Conti: w(r) = Cr + 1 or 
w(r) = C r|ln r| +1, where C > 0 is a constant. 
2. Extension Principle If a solution of (17.1) stays bounded as time increases, then it can be ex- 
tended to the whole of R4. 
Assumption: In the following discussion the existence of the flow {y' Jer of (17.1) is always assumed. 
2. Phase Portrait 
a) If y(t) is a solution of (17.1), then the function y(t+c) with an arbitrary constant c is also a solution. 


b) Two arbitrary orbits of (17.1) have no common point or they coincide. Hence, the phase space of 
(17.1) is decomposed into disjoint orbits. The decomposition of the phase space into disjoint orbits is 
called a phase portrait. 
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c) Every orbit, different from a steady state, is a regular smooth curve, which can be closed or not 
closed. 

3. Liouville's Theorem 

Let {p'her be the flow of (17.1), D C M C R” be an arbitrary bounded and measurable set, D, : 


= q'(D) and V, :— vol(D;) be the n-dimensional volume of D, (Fig. 17.2a). Then the relation 
1 
a Y= Jj div f (x) dx holds for arbitrary t € R. For n = 3, Liouville's theorem states: 
Di 
1 
x E hi div f (2, v2, 3) da πο daa. (17.12) 
^] 


T 


322 
a) 
Figure 17.2 


Corollary: If div/(r) < 0 in M holds for (17.1), then the flow of (17.1) is volume contracting. If 
divf(x) Ξ 0 in M holds, then the flow of (17.1) is volume preserving. 
W A: For the Lorenz system (17.2), div f (x,y,z) = —(o4- 12-0). Sinceo > 0 and b > 0, div f(x,y,z) < 


d 
0 holds. With Liouville's theorem, T V / / | (σ 4- 1-- 0) dx, dz dg = —(e + 1+ b) V; obviously 
^]. 


holds for any arbitrary bounded and measurable set D C IR?. The solution of the linear differential 
equation V, = —(c + 1 + b) V; is V; = Vo et", so that V, — 0 follows for t + +00. 


, : . 9Η i 
B B: Let C R” xR” bean open subset and H: U > R a C?-function. Then, i; = 7 (x,y), Yi = 
Yi 
OH : ; — ; f 
- 85; ον y) (@=1,2,...,n) iscalleda Hamiltonian differential equation. The function H is called the 
Ti 


Hamiltonian of the system. If f denotes the right-hand side of this differential equation, then obviously 
πο 394 Hr OH 
divf(x,y) = M; Bx (x,y) (x, y)| = 0. Hence, the Hamiltonian differential equations are 
ici LO iOVi T 
volume preserving. 


17.1.2.2 Linear Differential Equations 


1. General Statements 
Let A(t) = [a;;(t)]?;-, be a matrix function on R , where every component a;;: R — R is a continuous 
function, and let b: IR — R” be a continuous vector function on R. Then 


t = A(t)r + b(t) (17.13a) 
is called an inhomogeneous linear first-order differential equation in IR", and 
t = A(t)r (17.13b) 


is the corresponding homogeneous linear first-order differential equation. 

1. Fundamental Theorem for Homogeneous Linear Differential Equations Every solution 
of (17.13a) exists on the whole of R. The set of all solutions of (17.13b) forms an n-dimensional vector 
subspace Ly of the C!-smooth vector functions over R.. 
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2. Fundamental Theorem for Inhomogeneous Linear Differential Equations The set of al 
solutions Ly of (17.13a) is an n-dimensional affine vector subspace of the C'-smooth vector functions 
over R of the form Ly = qo + Ly, where ¢ is an arbitrary solution of (17.13a). 


Let q1,..., Yn be arbitrary solutions of (17.13b) and 9 = [y1,..., Φε] the corresponding solution ma- 
triv. Then satisfies the matrix differential equation Z(t) = A(t)Z(t) on R, where Z € R"". I 
the solutions q,...,q, form a basis of Ly, then ® = [y1,..., Pn] is called the fundamental matrix 


of (17.13b). W(t) = det &(t) is the Wronskian determinant with respect to the solution matrix Φ ο 
(17.13b). The formula of Liouville states that: 


W(t) = rank A(t) W(t) (t € R). (17.13c 


For a solution matrix, either W(t) = 0 on R or W(t) Z 0 for allt € R. The system φι...., n isa 
basis of Ly, if and only if det[p1 (t), - - - , Yn(t)] 4 0 for a t (and so for all t). 


3. Theorem (Variation of Constants Formula) Let Φ be an arbitrary fundamental matrix ο 
(17.13b). Then the solution y of (17.13a) with initial point p at time t = 7 can be represented in the 
form 


y(t) = (t)ó(r) p+ | Φ(ι)Φ(5) ! i(s)ds (te R). (17.13d) 


T 


2. Autonomous Linear Differential Equations 
Consider the differential equation 
$—Am, (17.14) 


where A is a constant matrix of type (n, n). The operator norm (see also 12.5.1.1, p. 677) of a matrix 
A is given by || A|| = max(||Az||, x € R”, ||| € 1}, where for the vectors of R” the Euclidean norm 
is again considered. 

Let A and B be two arbitrary matrices of type (n, n). Then 
a) |A+ Bl € Al + Bl, Ὁ) IAAI = A) All (A € R) 


c) |Azl| € |Al|z| x € R”), — d) ABI € [AMI BI), 
e) || ΑΙ = Vua, where Amax is the greatest eigenvalue of AT A . 


The fundamental matrix with initial value E, at time t = 0 of (17.14) is the matrix-exponential function 


(see 5.3.6.4,4., p. 356) 


At AP ox Att 
„At 1 | E. ax 
e" — Ert Tp = H (17.15) 


with the following properties: 

a) The series of e^t is uniformly convergent with respect to t on an arbitrary compact time interval 
and absolutely convergent for every fixed t; 

b) [ες εἰ“ (t > 0); 


a2 


(e^*) = (e") = Ae"! = eA (te R); 


dt 

d) GE! = et^ es^ (s, te R); 

e) e^ is regular for all t and (e^*)-! = e-^'; 

f) i zd n äre commutative matrices of type (n,n), i.e., AB = BA holds, then B e^ = e^ B and 
e^ F9 = ee”: 


D 


g) if A and B are matrices of type (n, n) and B is regular, then eP4P" = B e^ B^. 
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3. Linear Differential Equations with Periodic Coefficients 

Considered is the homogeneous linear differential equation (17.13b), where A(t) = σος τἶδα T- 
periodic matrix function, i.e., αμ) = aij (t-- T) (Vt € R, à,j =1,...,n). In this case (17.13b) is 
called a linear T-periodic differential equation. Then every fundamental matrix Φ of (17.13b) can be 
written in the form &(t) = G(t)e'^, where G(t) is a smooth, regular T-periodic matrix function and R 
is a constant matrix of type (n, n) (Floquet's theorem). 

Let P(t) be the fundamental matrix of the T-periodic differential equation (17.13b), normed at t = 0, 
i.e., Φ(0) = En, and let (t) = G(t)e'* be a representation of it according to Floquet’s theorem. The 
matrix Φ(Τ) = ef is called the monodromy matrix of (17.13b); the eigenvalues p; of P(T) are the 
multipliers of (17.13b). A number p € C is a multiplier of (17.13b) if and only if there exists a solution 
yp £ 0 of (17.13b) such that y(t + T) = py(t) (t € IR) holds. 


17.1.2.3 Stability Theory 


1. Lyapunov Stability and Orbital Stability 
Consider the non-autonomous differential equation (17.11). The solution c(t, to, £o) of (17.11) is said 
to be stable in the sense of Lyapunov if: 


Vt; >to Ve 20 Ad —Óó(eti) pore o " z 
ο to, zol] < 8 J ᾿ IPE ti ο... (17.16a) 


Vt>t. 


The solution y(t, to, xo) is called asymptotically stable in the sense of Lyapunov, if it is stable and: 


Vt, >to IA — A(ti) Vai€eM)|. ; . 
[1ι — lti, to, v0)|| < me lolt t, σι) — plt tozo) 20 — (7 185 
for t + +00. 

For the autonomous differential equation (17.1), there are other important notions of stability besides 
the Lyapunov stability. The solution y(t, xo) of (17.1) is called orbitally stable (asymptotically orbitally 
stable), if the orbit γ(αο) = {y(t, xo), t € R} is stable (asymptotically stable) as an invariant set. A 
solution of (17.1) which represents an equilibrium point is Lyapunov stable exactly if it is orbitally 
stable. The two types of stability can be different for periodic solutions of (17.1). 
Β Let a flow be given in RË, whose invariant set is the torus Τ᾽, Locally, let the flow be described in 
a rectangular coordinate system by O, = 0, @2 = f2(01), where fo: R — R is a 2π periodic smooth 
function, for which: 


VO;ic R. JU, 'ighborhood of O Vó1, ὃ € U 
du ME C EU HAG) ΜΑ). 

An arbitrary solution satisfying the initial conditions (Θι(0). ©2(0)) can be given on the torus by 

80 260). 60) = 60) + (G0). (t€ R). 
From this representation it can be seen that every solution is orbitally stable but not Lyapunov stable 
(Fig. 17.2b). 
2. Asymptotical Stability Theorem of Lyapunov 
A scalar-valued function V is called positive definite in a neighborhood U of a point p € M C R”, if: 
1.V:UC M > R is continuous. 
2. V(x) > 0 for alla € U \ {p} and V(p) — 0. 
Let U C M be an open subset and V : U — R a continuous function. The function V is called a 
Lyapunov function of (17.1) in U, if V(o(t)) does not increase while for the solution y(t) € U holds. 
Let V: U + R be a Lyapunov function of (17.1) and let V be positive definite in a neighborhood U 
of p. Then p is stable. If the condition V(w(t, xo)) = constant (t > 0) always yields y(t, vo) = p for 
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a solution of (17.1) with (t,x) € U (t > 0), i.e., if the Lyapunov function is constant along a 
complete trajectory, then this trajectory can only be an equilibrium point, and the equilibrium point 
pis also asymptotically stable. 

W The point (0,0) is 2 steady point of the planar differential equation è = y, ý = —x — a?y. The 
function V(x, y) = = a? + y? is positive definite in every neighborhood of (0,0) and for its derivative 


3i V (x(t), y(t)) = —2x(t)?y(t)? < 0 holds along an arbitrary solution for x(t)y(t) 4 0. Hence, (0, 0) is 


asymptotically stable. 

3. Classification and Stability of Steady States 

Let xo be an equilibrium point of (17.1). In the neighborhood of xo the local behavior of the orbits of 
(17.1) can be described under certain assumptions by the variational equation y = D f(xo)y, where 
D f (vo) is the Jacobian matrix of f in zo. If D f (ro) does not have an eigenvalue A; with Re A; = 0, 
then the equilibrium point zo is called hyperbolic. The hyperbolic equilibrium point αρ is of type (m, k) 
if D f (zo) has exactly m eigenvalues with negative real parts and k = n — 1 — m eigenvalues with 
positive real parts. The hyperbolic equilibrium point of type (m, k) is called a sink, ifm — n, a source, 
if k = n, and a saddle point, if m 4 0 and k 4 0 (Fig. 17.3). A sink is asymptotically stable; sources 
and saddles are unstable (theorem on stability in the first approximation). Within the three topological 
basic types of hyperbolic equilibrium points (sink, source, saddle point) further algebraic distinctions 
can be made. Α sink (source) is called a stable node (unstable node) if every eigenvalue of the Jacobian 
matrix is real, and a stable focus (unstable focus) if there are eigenvalues with non-vanishing imaginary 
parts. For n — 3 a classification of saddle points is obtained as saddle nodes and saddle foci. 


Type of equi- 


librina point Sink Source Saddle point 
e 9 9 

Eigenvalues of the e " ο 

Jacobian matrix ô . è 


Phase portrait 


Figure 17.3 
4. Stability of Periodic Orbits 
Let y(t, £o) be a T-periodic solution of (17.1) and 4(xo) = {p(t,20), t € [0.Τ]} its orbit. Under 
certain assumptions, the phase portrait in a neighborhood of y(xo) can be described by the variational 
equation ἡ = D f(p(t,xo))y. Since A(t) = D f (y(t, xo)) is a T-periodic continuous matrix function 
of type (n, n), it follows from the Floquet theorem (see p. 863) that the fundamental matrix Py, (t) of 
the variational equation can be written in the form ®,,(t) = G(t)eP', where G is a T-periodic regular 
smooth matrix function with G(0) = E,, and R represents a constant matrix of type (n, n) which is not 
uniquely given. The matrix ρα = eT is called the monodromy matrix of the periodic orbit (xo), 
and the eigenvalues pi, . . . , Pn of e*? are called multipliers of the periodic orbit y(xo). If the orbit (xo) 
is represented by another solution y(t, x1), i.e., if (xo) = (21), then the multipliers of (29) and γ(σι) 
coincide. One of the multipliers of a periodic orbit is always equal to one (Andronov-Witt theorem). 
Let p1,.... pna; Pn = 1 be the multipliers of the periodic orbit ?(xo) and let 4,,(T) be the monodromy 
matrix of γαρ). Then 


T 
2» pj = Tr®,,(T) and I pj = det Pa, (T) = exp (/ TrD fea) 


j=l 
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= exp (a div f (y(t, xo)) i) : (17.17) 


Hence, if n = 2, then py = 1 and p, = exp (E div f (y(t, xo)) dt). 


W Let y(t, (1,0)) = (cost, sin t) bea27-periodic solution of (17.9a). The matrix A(t) of the variational 
equation with respect to this solution is 


—2cos?t —1-— sin2t 
αι = Ρε) = (1. aat) 


The fundamental matrix 4(, 9)(t) normed at t = 0 is given by 


"m e" cost —sint| (cost —sintV (e? 0 

αος) = eC sit cost]  Vsint cost 0 1)’ 

where the last product is a Floquet representation of 8q o)(t). Thus, ρι = €^ and p» = 1. The multi- 
pliers can be determined without the Floquet representation. For system (17.9a) div f (x, y) = 2—4z?— 
4y? holds, and hence divf(cost,sint) = —2. According to the formula above, pı = exp ( fy" —2dt) = 
exp(—47). 

5. Classification of Periodic Orbits 

If the periodic orbit y of (17.1) has no further multiplier on the complex unit circle besides p, = 1, then 
η is called hyperbolic. The hyperbolic periodic orbit is of type (m, k) if there are m multipliers inside 
and k = n — 1 — m multipliers outside the unit circle. If πι > 0 and k > 0, then the periodic orbit of 
type (m, k) is called a saddle point. 

According to the Andronov-Witt theorem, a hyperbolic periodic orbit y of (17.1) of type (n — 1,0) is 
asymptotically stable. Hyperbolic periodic orbits of type (m, k) with k > 0 are unstable. 

W A: A periodic orbit y = {(t),t € [0, T]) in the plane with multipliers p; and p» = 1 is asymptoti- 
cally stable if [ρι] < 1, i.e., if f div f(p(t)) dt < 0. 

W B: If there is a further multiplier besides p, = 1 on the complex unit circle, then the Andronov— 
Witt theorem cannot be applied. The information about the multipliers is not sufficient for the stability 
analysis of the periodic orbit. 
E C: As an example, let the planar system ¢ = —y + x f(x? +y?), ý = z-yf(a? 4 y?) be given 
by the smooth function f: (0, +00) > R, which additionally satisfies the properties f(1) = f'(1) = 0 
and f(r)(r — 1) < 0 for all r Z 1,r > 0. Obviously, y(t) = (cost, sin t) is a 27-periodic solution of the 
system and 


aad 


$t) = ή ο... 10 is the Floquet representation of the fundamental matrix. It follows 
; sint cost / VO 

that pı = p2 = 1. The use of polar coordinates results in the system 7 = r f(r), ) = 1. This repre- 
sentation yields that the periodic orbit y((1,0)) is asymptotically stable. 

6. Properties of Limit Sets, Limit Cycles 

The a- and w-limit sets defined in 17.1.1.2, p. 859, have with respect to the flow of the differential 
equation (17.1) with M C R" the following properties. Let x € M be an arbitrary point. Then: 

a) The sets a(x) and w(x) are closed. 

b) If y*(z) (respectively y7 (x)) is bounded, then w(x) 4 0 (respectively a(x) # Ø) holds. Furthermore, 
w(x) (respectively a(x)) are in this case invariant under the flow (17.1) and connected. 

W If for instance, y+ (x) is not bounded, then w(x) is not necessarily connected (Fig. 17.4a). 

For a planar autonomous differential equation (17.1), (i.e., M C R?) the Poincaré-Bendixson theorem 
is valid. 

Poincaré-Bendixson Theorem: Let y(-,p) be a non-periodic solution of (17.1), for which Υ (p) is 
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b) c) 


Figure 17.4 


a) 


bounded. If w(p) contains no equilibrium point of (17.1), then w(p) is a periodic orbit of (17.1). 
Hence, for autonomous differential equations in the plane, attractors more complicated than an equi- 
librium point or a periodic orbit are not possible. 

A periodic orbit y of (17.1) is called a limit cycle, if there exists an x € y such that either y C w(x) or 
y C a(x) holds. A limit cycle is called a stable limit cycle if there exists a neighborhood U of y such 
that y = w(x) holds for all x € U, and an unstable limit cycle if there exists a neighborhood U of ^ such 
that y = a(x) holds for all x € U. 

W A: For the flow of (17.9a), the property y = w(p) for all p - (0,0) is valid for the periodic orbit 
y = {(cost,sint), t € [0,27)}. Hence, U = R?\{(0,0)} is a neighborhood of y such that with it, y is 
a stable limit cycle (Fig. 17.4b). 

W B: In contrast, for the linear differential equation « = —y, ἢ = x, the orbit y = ((cost, sint), t € 
(0, 2π]} is a periodic orbit, but not a limit cycle (Fig. 17.4c). 

7. m-Dimensional Embedded Tori as Invariant Sets 

A differential equation (17.1) can have an m-dimensional torus as an invariant set. An m-dimensional 
torus T" embedded into the phase space M C R” is defined by a differentiable mapping g: IR" > R”, 
which is supposed to be 27-periodic in every coordinate O; as a function (O;,...,Om) > g(€1....,0,,). 
W In simple cases, the motion of the system (17.1) on the torus can be described in a rightangular co- 
ordinate system by the differential equations Θ, = ui (i = 1,2,..., m). The solution of this system 
with initial values (Θι(0)...., Om(0)) at timet = 0 is O;(t) = wit + O;(0) (¢=1,2,...,m;t € R). 
A continuous function f: R — R” is called quasiperiodic if f has a representation of the form f(t) = 
g (wit, wot, ..., Wnt), where g is also a differentiable function as above, which is 27-periodic in every 
component, and the frequencies w; are incommensurable, i.e., there are no such integers n; with 


m 
E n? > 0 for which niu ++ + wy = 0 holds. 
i 


17.1.2.4 Invariant Manifolds 
1. Definition, Separatrix Surfaces 
Let y be a hyperbolic equilibrium point or a hyperbolic periodic orbit of (17.1). The stable manifold 
W?*(«) (respectively unstable manifold W" ()) of y is the set of all points of the phase space such tha 
the orbits tending to y ast + --oo (respectively t — --οο) pass through these points: 

W*(q) = (x € M: w(x) = y) and W"(*) = (x € M: a(x) = 4}. (17.18 
Stable and unstable manifolds are also called separatrix surfaces. 
W In the plane, the differential equation 


ο pyoyt2 (17.19a 
is considered. The solution of (17.19a) with initial state (ao, yo) at time t = 0 is explicitly given by 
no 
p(t, xo, yo) = (e xo, eyo + τ ei — e ?t)), (17.19b 
For the stable and unstable manifolds of the equilibrium point (0,0) of (17.19a) one gets: 
2 
To 


WP?((0, 0)) = (xo, Yo): im, p(t, xo, Vo) = (0, 0)} = ro; yo): Yo + 3 0), 
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W"((0,0)) = {(ωο, yo): lim g(t, £o, yo) = (0, 0)} = (o; yo): zo = 0, yo € R} (Fig. 17.5a). 


YL-W ((0,0)) 


x 
| 7 | [N Ww’ (0/0) M 
a 


b) 


Figure 17.5 


Let M and N be two smooth surfaces in R”, and let L, M and L} N be the corresponding tangent planes 
to M and N through x. The surfaces M and N are transversal to each other if for alla € M A N the 
following relation holds: 

dim L, M + dim L,N — n = dim (L,M N L,N). 
W For the section represented in Fig. 17.5b one can see that dim L, M = 2, dim L,N = 1 and 
dim(L, M N L,N) = 0 holds. Hence, the section represented in Fig. 17.5b is transversal. 
2. Theorem of Hadamard and Perron 
Important properties of separatrix surfaces are given by the Theorem of Hadamard and Perron: 
Let y be a hyperbolic equilibrium point or a hyperbolic periodic orbit of (17.1). 
a) The manifolds W*(7) and W" (7) are generalized C"-surfaces, which locally look like C’-smooth ele- 
mentary surfaces. Every orbit of (17.1), which does not tend to y for t — +00 or t + —oo, respectively, 
leaves a sufficiently small neighborhood of y for t + --oo or t + —oo, respectively. 
b) If y = τρ is an equilibrium point of type (m, k), then W*(rg) and W" (xo) are surfaces of dimension 
m and k, respectively. The surfaces W*(xg) and W” (xg) are tangent at xo to the stable vector subspace 


E* = (y € R^: ο αν — 0 for t 2 +00} of equation ὦ = D f(xo)y (17.202) 


and the unstable vector subspace 


E" = (y € R”: cP $y 0 for t+ --οο} of equation ἡ = D f(ro)y, respectively. (17.20b) 
c) If y is a hyperbolic periodic orbit of type (m, k), then W*(7) and W"(») are surfaces of dimension 
m + 1 and k + 1, respectively, and they intersect each other transversally along y (Fig. 17.62). 
W A: To determine a local stable manifold through the steady state (0, 0) of the differential equation 
(17.19a) here it is supposed that W5,.((0,0)) has the following form: 
W5-((0,0)) = ((z.y): y = h(x), |x| < A, h: (CA, Δ) > R differentiable}. 
Let (x(t), y(t)) be a solution of (17.19a) lying in W,((0,0)). Based on the invariance, for times s near 
to t y(s) = h(x(s)) holds. By differentiation and representation of ὦ and y from the system (17.19a) 
one gets the initial value problem h'(x) (~x) = h(x) + x°, h(0) = 0 for the unknown function h(x). If 


; E s 5 2 93 , 03 : i 
looking for the solution in the form of a series expansion h(x) = σα) + Pil +--+, where h’(0) = 015 
"—- l , - 3 
taken under consideration, then one gets by comparing the coefficients a; = D and a, = 0 for k > 3. 


W B: For the system 

E y +r(l -xr -— y’), y=rty(l—2—-y’), 2— az (17.21) 
with a parameter a > 0, the orbit y = { (cost, sin t, 0), t € [0, 2π]} is a periodic orbit with multipliers 
p — €75, py — e?" and p; = 1. 
In cylindrical coordinates x = r cos 0, y = rsin ð, z = z, with initial values (ro, Vo, zo) at time t = 0, 
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the solution of (17.21) has the representation (r(t, ro), V(t, Jo), e? zo), where r(t, ro) and V(t, Jo) is the 
solution of (17.9a) in polar coordinates. Consequently, 

W*(y) = f(,y,2): z = 0} V1(0,0,0)) and W"(y) = {(2,y,2): α’ -- y? — 1). (cylinder). 
Both separatrix surfaces are shown in Fig. 17.6b. 


Figure 17.6 


3. Local Phase Portraits Near Steady States for n — 3 

Considered is the differential equation (17.1) with the hyperbolic equilibrium point 0 for n — 3. Set 
A= D f(0) and let det[AE — Αἱ] = A? + pA? + qÀ + r be the characteristic polynomial of A. With the 
notation 6 = pq — r and A = —p?q? + Ap?r + Ag? — 18pqr + 27r? (discriminant of the characteristic 
polynomial), the different equilibrium point types are characterized in Table 17.1. 

4. Homoclinic and Heteroclinic Orbits 

Suppose γι and γο are two hyperbolic equilibrium points or periodic orbits of (17.1). If the separatrix 
surfaces W*(y,) and W” (q2) intersect each other, then the intersection consists of complete orbits. 
For two equilibrium points or periodic orbits, the orbit y C W*(94) n W"(^9) is called heteroclinic if 
^n Æ ^s (Fig. 17.7a), and homoclinicif γι = y2. Homoclinic orbits of equilibrium points are also called 
separatrix loops (Fig. 17.7b). 


Figure 17.7 


E Consider the Lorenz system (17.2) with fixed parameters ø = 10, b = 8/3 and with variable r. The 
equilibrium point (0, 0, 0) of (17.2) is a saddle for 1 < r < 13.926..., which is characterized by a two- 
dimensional stable manifold W* and a one-dimensional unstable manifold W”. If r = 13.926..., then 
there are two separatrix loops at (0,0,0), i.e., as t + +00 branches of the unstable manifold return 
(over the stable manifold) to the origin (see [17.9]). 


17.1.2.5 Poincaré Mapping 


1. Poincaré Mapping for Autonomous Differential Equations 

Let y = {y(t, xo), t € (0, T]) be a T-periodic orbit of (17.1) and £ a (n — 1)-dimensional smooth 
hypersurface, which intersects the orbit ^ transversally in τρ (Fig. 17.8a). Then, there is a neighbor- 
hood U of αρ and a smooth function τ: U — R such that τ(αρ) = T and (T(x), x) € X for all x € U. 
The mapping P: UN} > Ð with P(x) = φ(τ(α). x) is called the Poincaré mapping of y at xo. If 
the right-hand side f of (17.1) is r times continuously differentiable, then P is also r times continu- 
ously differentiable . The eigenvalues of the Jacobi matrix DP(xọ) are the multipliers pj, ..., 951 of 
the periodic orbit. They do not depend on the choice of 20 on y and on the choice of the transversal 
surface. 
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Table 17.1 Steady state types in three-dimensional phase spaces 


Parameter A Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W* and μυ 
stable ImA; = 0 UMS ο τος. 
ies th A<0 node A <0,9 1,58 dim W* = 3, dinW" = 0 
450 stable ReA, 2 < 0 
iis focus àz «0 
WY 
A <0: A> 0: 
Parameter A Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W€ and μυ 
unstable ImA; = 0 εν aa 
6<0;r<0, ο, node λ > 0, 7 =1,2,3 dim W° = 0, dim W" = 3 
q>0 unstable Ἡολιο > 0 
sr focus à4»0 
WY 
A <0: A> 0: 
Parameter A Type of equili- | Roots of the charac- | Dimension of 
domain brium point teristic polynomial | W* and μυ 
E = saddle ImA; = 0 ; "T . κα 
6>0;r<0, | A<0 node Mio <0, M20 dim W* = 2, dim W” = 1 
q<0or z - - 
r<0,q>0 saddle eà12 < 
ud focus »3 >0 
I 
UT EAR DT 
Parameter ^ Type of equili- | Roots ofthe charac- | Dimension of 
domain brium point teristic polynomial | W* and W" 
ΝΗ saddle ImA; = 0 -— ση 
: - na >0, | A<0 node "ΠΡ dim W* = 1, dim W” = 2 
r>0,q>0 | axg| saddle Βολι» > 0 
focus A4 <0 
A <0: A> 0: 


E 


870 17. Dynamical Systems and Chaos 


A system (17.3) in M = U can be connected with the Poincaré mapping, which makes sense until the 
iterates stay in U. The periodic orbits of (17.1) correspond to the equilibrium points of this discrete 
system, and the stability of these equilibrium points corresponds to the stability of the periodic orbits 
of (17.1). 


Ey 


9693) ^ "E 
a) b) steady state periodig 


Figure 17.8 


H Considered is for the system (17.9a) the transversal hyperplanes 

X = {(7,9): r > 0,0 = vo} 
in polar coordinate form. For these planes U = Ð can be chosen. Obviously, T(r) = 27 (Vr > 0) and 
so 

P(r) = [+ (r? -= κα 
where the solution representation of (17.9a) is used. It is also valid that P(X) = £X, P(1) = 1 and 
Pi) =e <4, 
2. Poincaré Mapping for Non-Autonomous Time-Periodic Differential Equations 
A non-autonomous differential equation (17.11), whose right-hand side f has period T with respect 
to t, i.e., for which f(t + T, z) = f(t,z) (Vt € R, Vx € M) holds, is interpreted as an autonomous 
differential equation t = f(s,a), ὁ = 1 with cylindrical phase space M x (s mod T}. Let sọ € (s mod 
T} be arbitrary. Then, > = M x {so} is a transversal plane (Fig. 17.8b). The Poincaré mapping is 
given globally as P: £ + È over x +> q(so + T, so, xo), where y(t, so, £o) is the solution of (17.11) 
with the initial state xo at time sp. 


17.1.2.6 Topological Equivalence of Differential Equations 


1. Definition 
Suppose, besides (17.1) with the corresponding flow (v');en, that a further autonomous differential 
equation 

d = g(x), (17.22) 
is given, where g: N + R” is a C’-mapping on the open set N C R”. Of course, the flow (uen of 
(17.22) should exist. 
The differential equations (17.1) and (17.22) (or their flows) are called topologically equivalent if there 
exists a homeomorphism h: M — N (i.e., h is bijective, h and h^! are continuous), which transforms 
each orbit of (17.1) to an orbit of (17.22) preserving the orientation, but not necessarily preserving 
the parametrization. The systems (17.1) and (17.22) are topologically equivalent if there also exists a 
continuous mapping τ: R x M — R , besides the homeomorphism h: M — N, such that 7 is strictly 
monotonically increasing at every fixed x € M, maps R onto R, with 7(0,x) = 0 for all x € M and 
satisfies the relation π(φ!(α)) = v7 ^? (h(x)) for alla € M and t € R. 
In the case of topological equivalence, the equilibrium points of (17.1) go over into steady states of 
(17.22) and periodic orbits of (17.1) go over into periodic orbits of (17.22), where the periods are not 
necessarily coincident. Hence, if two systems (17.1) and (17.22) are topologically equivalent, then the 
topological structure of the decomposition of the phase spaces into orbits is the same. If two systems 
(17.1) and (17.22) are topologically equivalent with the homeomorphism h: M — Ν and if h preserves 
the parametrization, i.e., h(y'(x)) = v'(h(x)) holds for every t, x, then (17.1) and (17.22) are called 
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topologically conjugate. 

Topological equivalence or conjugation can also refer to subsets of the phase spaces M and N. Sup- 
pose, e.g., (17.1) is defined on U, C M and (17.22) on Uz C N. Then (17.1) on U; is called topologically 
equivalent to (17.22) on [ο if there exists a homeomorphism A: U; — Uz which transforms the inter- 
section of the orbits of (17.1) with U; into the intersection of the orbits of (17.22) with Uz preserving 
the orientation. 

W A: Homeomorphisms for (17.1) and (17.22) are mappings where, e.g., stretching and shrinking of 
the orbits are allowed; cutting and closing are not. 

The flows corresponding to phase portraits of Fig. 17.9a and Fig. 17.9b are topologically equivalent; 
the flows shown in Fig. 17.9a and Fig. 17.9c are not. 


Figure 17.9 
W B: Consider the two linear planar differential equations (see [17.11]) 
i = Ar and i = Br with A = ( 7 , E and B — [o 5. ) The phase portraits of these systems 
close to (0,0) are shown in Fig. 17.10a and Fig. 17.10b. 


, : 1 = 
The homeomorphism h: R? — R? with h(x) = Ra, where R = — Ia v and the function 


V2 1 1 
; 1 
T: Rx R? > R with r(t, x) = -- t transform the orbits of the first system into the orbits of the second 
3 À 


one. Hence, the two systems are topologically equivalent. 


: M. 


AN S ὙΓ᾽ 


Figure 17.10 


2. Theorem of Grobman and Hartman 
Let p be a hyperbolic equilibrium point of (17.1). Then, in a neighborhood of p the differential equation 
(17.1) is topologically equivalent to its linearization y = D f(p)y. 


17.1.3 Discrete Dynamical Systems 
17.1.3.1 Steady States, Periodic Orbits and Limit Sets 


1. Types of Steady State Points 

Let xo be an equilibrium point of (17.3) with M C R”. The local behavior of the iteration (17.3) 
close to zo is given, under certain assumptions, by the variational equation yi41 = Ῥφ(πο)νι, t € D. 1f 
Dq(ro) has no eigenvalue A; with |A;| = 1, then the steady state point xo, analogously to the differential 
equation case, is called hyperbolic. The hyperbolic equilibrium point xo is of type (m, k) if D f (xo) has 
exactly m eigenvalues inside and k = n — m eigenvalues outside the complex unit circle. The hyperbolic 
equilibrium point of type (m, k) is called a sink for m — n, a source for k — n and a saddle point for 
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m > Qand k > 0. A sink is asymptotically stable; sources and saddles are unstable (theorem on stability 
in the first approximation for discrete systems). 

2. Periodic Orbits 

Let γ(αο) = {y* (xo), k = 0,---, T — 1} be a T-periodic orbit (T > 2) of (17.3). If xo is a hyperbolic 
equilibrium point of the mapping y”, then y(xo) is called hyperbolic. 

The matrix Dy" (zo) = De(qT- (αρ) «+» Dip(xo) is called the monodromy matriz, the eigenvalues p; 
of Dy" (xo) are the multipliers of γ(το). 

If all multipliers p; of (xo) have an absolute value less than one, then the periodic orbit y(xo) is asymp- 
totically stable. 

3. Properties of w -Limit Set 

Every w -limit set w(x) of (17.3) with M = R” is closed, and w(y(x)) = w(x). If the semiorbit y+ (x) is 
bounded, then w(x) # Ø and w(x) is invariant under y. Analogous properties are valid for a-limit sets. 
W Suppose the difference equation z,,4 = —2,, t = 0,£1,---, is given on R with y(x) = —r. Ob- 
viously, the relations w(1) = {1,—1}, w(y(1)) = ω(--1) = w(1), and φ(ω(1)) = w(1) are satisfied 
for r = 1. It is to be mentioned that w(1) is not connected, is different from the case of differential 
equations. 


17.1.3.2 Invariant Manifolds 


1. Separatrix Surfaces 

Let αρ be an equilibrium point of (17.3). Then W*(29) = (y € M: ¢'(y) — xo for i + +00} is called 
a stable manifold and W” (xo) = (y € M: (y) — xo for i > —oo} an unstable manifold of o. Stable 
and unstable manifolds are also called separatrix surfaces. 


2. Theorem of Hadamard and Perron 
The theorem of Hadamard and Perron describes the properties of separatrix surfaces for discrete sys- 
tems in M C R”: 
If zo is a hyperbolic equilibrium point of (17.3) of type (m, k), then W*(xg) and W” (xo) are generalized 
C"-smooth surfaces of dimension m and k, respectively, which locally look like C’-smooth elementary 
surfaces. The orbits of (17.3), which do not tend to 20 for i — --oo or i — --οο, leave a sufficiently 
small neighborhood of xo for i — Ἔσο or i — —oo, respectively. The surfaces W?(zrg) and W” (xo) 
are tangent at xo to the stable vector subspace E° = (y € R” : [De(xo)! y — 0 for i + --οο} of 
Uix1 = De(vo)y; and the unstable vector subspace E" = (y € R” : [Do(xo)|/'y + 0 for i + --οο}, 
respectively. 
W Considered is the following time discrete dynamical system from the family of Hénon mappings: 
tipi =? Ἔνι--ο, Yar =T i € Z. (17.23) 


Both hyperbolic equilibrium points of (17.23) are P, = (V2, V2) and P; = (-V2, — V2). 
Determination of the local stable and unstable manifolds of Pj: The variable transformation v; = 
& + V2, y; = yi + V2 transforms system (17.23) into the system £4 = €? + 2/2 & +m, mu = € 
with the equilibrium point (0,0). The eigenvectors αι = (V2 + νὸ, 1) and ay = (V2 — V3, 1) of the 
Jacobian matrix D f((0,0)) correspond to the eigenvalues λι» = V2 + V3, so Ε΄ = (tas,t € R} 
and E" = {ta,,t € R}. Supposing that Wz.((0,0)) = {(6.η): η = 8(€). |E] < A,B: (CA, A) > 
R differentiable}, one looks for β in the form of a power series B(€) = (V3 — V2) €+ kê? +--+. From 
(£i, ni) € Wi, ((0,0)), (S41. M41) € Wis, ((0,0)) follows. This leads to an equation for the coefficients 
of the decomposition of β, where k « 0. The theoretical shape of the stable and unstable manifolds is 
shown in Fig. 17.11a (see [17.12]). 

3. Transverse Homoclinic Points 

The separatrix surfaces W*(xo) and W" (xo) of a hyperbolic equilibrium point αρ of (17.3) can intersect 
each other. If the intersection W*(xo) N W” (xo) is transversal, then every point y € W*(xg) O W"(xg) 
is called a transversal homoclinic point. 
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Figure 17.11 


Fact: Ify is a transversal homoclinic point, then the orbit {y'(y)} of the invertible system (17.3) consists 
only of transversal homoclinic points (Fig. 17.11b). 


17.1.3.3 Topological Conjugation of Discrete Systems 


1. Definition 
Suppose, besides (17.3), a further discrete system 

Tipi = Y(t) (17.24) 
with Y: N — N is given, where N C R” is an arbitrary set and ᾧ is continuous (M and N can be 
general metric spaces). The discrete systems (17.3) and (17.24) (or the mappings y and v) are called 
topologically conjugate if there exists a homeomorphism h: M — N such that p = ht opoh. If (17.3) 
and (17.24) are topologically conjugated, then the homeomorphism h transforms the orbits of (17.3) 
into orbits of (17.24). 
2. Theorem of Grobman and Hartman 
If y in (17.3) is a diffeomorphism e: R” — R”, and τρ a hyperbolic equilibrium point of (17.3), then 
in a neighborhood of xo (17.3) is topologically conjugate to the linearization y41 = Dy(xo) ye. 


17.1.4 Structural Stability (Robustness) 


17.1.4.1 Structurally Stable Differential Equations 


1. Definition 

The differential equation (17.1), i.e., the vector field f: M — R”, is called structurally stable or robust, 
if small perturbations of f result in topologically equivalent differential equations. The precise defini- 
tion of robustness requires the notion of distance between two vector fields defined on M. The further 
investigations are restricted to smooth vector fields on M, which have a common open connected ab- 
sorbing set U C M. Let the boundary QU of U be a smooth (n — 1)-dimensional hypersurface and 
suppose that it can be represented as OU = {a € R”: h(x) = 0}, where h: R” > R is a C!-function 
with grad h(x) 4 0 in a neighborhood of QU. Let X! (U) be the metric space of all smooth vector fields 
on M with the C! metric 


p(fig) = sup || F(x) — σα) + sup || D f(z) — D g(£)|l. (17.25 


(In the first term of the right-hand side || - || means the Euclidean vector norm, in the second one the 
operator norm.) The smooth vector fields f intersecting transversally the boundary OU in the direction 
U, i.e., for which grad h(x)" f(x) Z 0, (x € QU) and t(x) € U (x € OU, t > 0) hold, form the se 
X! (U) C X'(U). The vector field f € X} (U) is called structurally stable if there is a ô > 0 such tha: 
every other vector field g € X} (U) with p(f, g) < ὃ is topologically equivalent to f. 

W Consider the planar differential equation g (-, a) 


x y-z(a—a?—3?), ἡ----γ(α--α)-- y’) (17.26 
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with parameter a, where |a| < 1. The differential equation g belongs, e.g., to X} (U) with U = {(α, y): 


x? +y? < 2) (Fig. 17.12a). Obviously, p(g (,0), g (,0)) = la| (V2 + 1). The vector field g (-, 0) is 
structurally unstable; there exist vector fields arbitrarily close to g(-,0), which are not topologically 


equivalent to g (-,0) (Fig. 17.12b,c). This is clear, if the polar coordinate representation ἡ = —r? + 
ar, ὃ = 1 of (17.26) is considered. For o > 0 there always exists a stable limit cycle r = γα. 
y y, y 
oU 
X X X 
U 
a) b) α-0 c) a0 


Figure 17.12 


2. Structurally Stable Systems in the Plane 

Suppose the planar differential equation (17.1) with f € X} (U) is structurally stable. Then: 

a) (17.1) has only a finite number of equilibrium points and periodic orbits. 

b) All w-limit sets w(x) with x € U of (17.1) consist of equilibrium points and periodic orbits only. 
Theorem of Andronov and Pontryagin: The planar differential equation (17.1) with f € X} (U) 
is structurally stable if and only if: 

a) All equilibrium points and periodic orbits in U are hyperbolic. 

b) There are no separatrices, i.e., no heteroclinic or homoclinic orbits, coming from a saddle and tending 
to a saddle point. 


17.1.4.2 Structurally Stable Time Discrete Systems 

In the case of time discrete systems (17.3), i.e., of mappings y : M — M, let U C M C R” be 
a bounded, open, and connected set with a smooth boundary. Let Diff (U) be the metric space of 
all diffeomorphisms on M with the corresponding U defined C'-metric. Suppose the set Diff! (U) C 
Diff(U) consists of the diffeomorphisms ¢, for which e(U) C U is valid. The mapping e € Diff (U) 
(and the corresponding dynamical system (17.3)) is called structurally stable if there exists a ὃ > 0 such 
that every other mapping v € Diff τ ) with p(y, v) < ὃ is topologically conjugate to y. 


17.1.4.3 Generic Properties 


1. Definition 
A property of elements of a metric space (M, p) is called generic (or typical) if the set of the elements 
B of M with this property form a set of the second Baire category, i.e., it can be represented as B = 


(| Bm, where every set B,, is open and dense in M. 
m=1,2.;. 


WB A: The sets R and I C R (irrational numbers) are sets of second Baire category, but Q C R is not. 
W B: Density alone as a property of “typical” is not enough: Q C R and I C Rare both dense, but 
they cannot be typical at the same time. 

W C: There is no connection between the Lebesgue measure A (see 12.9.1, 2., p. 694) of a set from R 


1 1 
and the Baire category of this set. Theset B= fN B,withB, = U (o. οι) απ + i x) , where 
k=1,2,... n>0 ; 2^ C 


Q = {a,}%p represents the rational numbers, is a set of second Baire category (see [17.5], [17.10]). 
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1 
On the other hand, since By D Bry1 and A( Bx) < +00 also A(B) = jim (Br) < jim 0 
πάν : 


2 

coe k 1— 1/2 
holds. 
2. Generic Properties of Planar Systems, Hamiltonian Systems 
For planar differential equations the set of all structurally stable systems from xl (U) is open and dense 
in XL (U). Hence, structurally stable systems are typical for the plane. It is also typical that every orbit 
of a planar system from X1 (U) for increasing time tends to one of a finite number of equilibrium points 
and periodic orbits. Quasiperiodic orbits are not typical. Under certain assumptions, in the case of 
Hamiltonian systems, the quasiperiodic orbits of differential equation are preserved in the case of small 
perturbations. Hence, Hamiltonian systems are not typical systems. A $ 

: : : Ho : H, 
Bl Given in R* a Hamiltonian system in action-angle variables jı = 0, j2 = 0, 6, = 5 2 Θα = E ae 

JA J2 
where the Hamiltonian Ho(ji, j2) is analytical. Obviously, this system has the solutions j; = c, jo = 
C9, O1 = wit + c3, Og = wat + c4 with constants c,,...,C4, where w, and wg can depend on c, and co. 
The relation (j1, j2) = (οι, c2) defines an invariant torus 7?. Consider now the perturbed Hamiltonian 
Ηο({1, j2) + εΠι (ji, j2, O1, O2) 

instead of Ho, where H; is analytical and £ > 0 is a small parameter. 
The Kolmogorov-Arnold-Moser theorem (KAM theorem) says in this case that if Ho is non-degenerate, 


. 0 : δ : f ; 
i.e., det ( 95 # 0, then in the perturbed Hamiltonian system most of the invariant non-resonant 
Jk 


tori will not vanish for sufficiently small € > 0 but will be only slightly deformed. “Most of the tori” 
means that the Lebesgue measure of the complement set with respect to the tori tends to zero if € tends 
to 0. A torus, defined as above and characterized by w, and wy, is called non-resonant if there exists a 
ω [64 
constant c > 0 such that the inequality ELLE > τας holds for all positive integers p and q. 
Ua 4 q 

3. Non-Wandering Points, Morse-Smale Systems 
Let (t! bier be a dynamical system on the n-dimensional compact orientable manifold M. The point 
p € M is called non-wandering with respect to {φ!} if 

VT»0 Jt, στ. v(U)nU,z0 (17.27) 
holds for an arbitrary neighborhood U, C M of p. 


W Steady states and periodic orbits consist only of non-wandering points. 

The set Ω(φ!) of all non-wandering points of the dynamical systems generated by (17.1) is closed, in- 
variant under {y'} and contains all periodic orbits and all w -limit sets of points from M. 

The dynamical system (v! Τε on M generated by a smooth vector field is called a Morse-Smale system 
if the following conditions are fulfilled: 

1. The system has finitely many equilibrium points and periodic orbits and they are all hyperbolic. 

2. All stable and unstable manifolds of equilibrium points and periodic orbits are transversal to each 
other. 

3. The set of all non-wandering points consists only of equilibrium points and periodic orbits. 
Theorem of Palis and Smale: Morse-Smale systems are structurally stable. 

The converse statement of the theorem of Palis and Smale is not true: In the case of n > 3, there exist 
structurally stable systems with infinitely many periodic orbits. 

For n > 3, structurally stable systems are not typical. 
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17.2 Quantitative Description of Attractors 
17.2.1 Probability Measures on Attractors 


17.2.1.1 Invariant Measure 


1. Definition, Measure Concentrated on the Attractor 
Let {y'}ier be a dynamical system on (M, p). Let B be the o-algebra of Borel sets on M (12.9.1, 2., 
p. 694) and let μι B — [0, +00] be a measure on B. Me mapping φί is supposed to be u measurable. 
The measure p is called invariant under (y bier if u(y *(A)) = μ(Α) holds for all A € B and t > 0. 
If the dynamical system {y'}yer is invertible, then the property of the measure being invariant under 
the dynamical system can be expressed as µ(φί(4)) = (A) (A € B, t > 0). The measure μ. is said 
to be concentrated on the Borel set A C M if w(M \ A) = 0. If A is also an attractor of {y'}ier and u 
is an invariant measure under {y'}, then it is concentrated at A, if u(B) = 0 for every Borel set B with 
ANB=0 
The support of a measure u: B — [0, +00], denoted by supp pu, is the smallest closed subset of M on 
which the measure μ is concentrated. 
ΒΛ: The Bernoulli shift mapping is considered on M = [0,1]: 
444,1 = 2%, (mod1). (17.28a) 
In this case the map g: [0, 1] — [0, 1] is defined as 
μα. f 22, 0x x € 1/2, 
g(x) = 2v 1, 1/2« r € 1. 


The definition yields that the Lebesgue measure is invariant under the Bernoulli shift mapping. If a 


(17.28b) 


number x € [0, 1) is written in dyadic form x = X` a,:27" (an = Oor 1), then this representation can 
n=1 
be identified with x = .a,aga3.... The result of the operation 2r( mod 1) can be written as . a^, a5 a5... 
with a; = à;,4 , i.e., all digits a; are shifted to the left by one position and the first digit is omitted. 
E B: The mapping V: (0, 1] > [0,1] with 
2y, 0 € y « 1/2, 
σι) = . 
2(1— y), 1/2€ y €1 


is called a tent mapping and the Lebesgue measure is an invariant measure. The homeomorphism 


(17.29) 


2 
h: (0,1) > [0, 1) with y = — arcsin yx transforms the mapping ᾧ from (17.5) into (17.29). Hence, 
T 


in the case of a = 4, (17.5) has an invariant measure which is absolutely continuous. For the density 
pi(y) = 1 of (17.29) and p(x) of (17.5) at a = 4 it is valid that ρι(ῳ) = p(h-! (y)) |( 1) (y)]. It follows 


directly that p(x) = —————. 
πνα(1 -- x) 


HC: If 2 is a stable periodic point of period T of the invertible discrete dynamical system {1}, 
then u = 5 J yi(ap) iS a probability measure for {y'}. Here, 6,, is the Dirac measure concentrated 


at xo (see τον p. 694). 

2. Natural Measure 

Let A be an attractor of {φ'}ιερ in M with domain of attraction W. For an arbitrary Borel set A C W 
and an arbitrary point πο € W define the number: 


t(T, A, xo) 
UT 


L(A; το) := jim (17.30) 


Τ-λοο 
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Here, t(T, A, xo) is that part of the time T > 0 for which the orbit portion {y! (xo) }7.9 lies in the set A. 
If (A; το) = a for A-a.e.* xo from W, then let μ(Α) :— μ(Α; το). Since almost all orbits with initial 
points αρ € W tend to A for t + +00, µ is a probability measure concentrated at A. 


17.2.1.2 Elements of Ergodic Theory 


1. Ergodic Dynamical Systems 
A dynamical system {y'}ier on (M, p) with invariant measure y is called ergodic (one also says that the 
measure is ergodic) if either μ(Α) = 0 or μ(ΜΝ A) = 0 for every Borel set A with y™(A) = A (Yt > 0). 
If {φ!} is a discrete dynamical system (17.3), p: M — M is a homeomorphism and M is a compact 
metric space, then there always exists an invariant ergodic measure. 
WB A: Suppose there is given the rotation mapping of the circle S! 
31,17 3; ΕΦ (πιο 2π),. t=0,1,..., (17.31) 
with v: [0, 27) — [0, 2), defined by p(x) = «+ $ (mod27). The Lebesgue measure is invariant under 
Φ Φ 
i. IE p is irrational, then (17.31) is ergodic; if pm is rational, then (17.31) is not ergodic. 
W B: Dynamical systems with stable equilibrium points or stable periodic orbits as attractors are 
ergodic with respect to the natural measure. 
Birkhoff Ergodic Theorem: Suppose that the dynamical system {y'}icr is ergodic with respect 
to the invariant probability measure jz. Then, for every integrable function h € L'(M, B, ji), the time 


" 1 yt 
average along the positive semiorbits {φ!(σο)}}5ο 1:6. h(a) = um T [ h (ie (a9)) dt for flows and 
+00 T Jo 


n—1l 
h(x) = ]im — Y ἡ (ie (vo) for discrete systems, coincide with the space average f hd for p-a.e. 
CUP OR ig ae 


points zo € M. 
2. Physical or SBR Measure 
The statement of the ergodic theorem is useful only if the support of the measure p is large. Let y: 
M — M bea continuous mapping, and u: B — R be an invariant measure. The measure p is called 
a SBR measure (according to Sinai, Bowen and Ruelle, see also [17.9]) if for any continuous function 
h: M — R the set of all points πρ € M, for which 

n—1l 


im : Y A(y'(xo)) = [^ du (17.32a) 
a=0 M 
holds, has a positive Lebesgue measure for this. It is sufficient that the sequence of measures 
1 n-i 
ΠΡ Σ δρ) (11.929) 


where à, is the Dirac measure, weakly converges to µ for almost all x € M, i.e., for every continuous 
function / h dp, > / hdp as n => +00. 

M M 
W For some important attractors, such as the Hénon attractor, the existence of an SBR measure is 
proven. 
3. Mixing Dynamical Systems 
A dynamical system (ver on (M, p) with invariant probability measure ju is called mizing if 
lim n CAD c *(B)) = n( A)u(B) holds for arbitrary Borel sets A, B C M. For a mixing system, the 
—roo 


measure of the set of all points which are at t = 0 in A and under ọ* for large t in B, depends only on 


*Here and in the following a.e. is an abbreviation for “almost everywhere". 
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the product μ(Α)μ(Β). 
A mixing system is also ergodic: Let (o!) be a mixing system and A be a Borel set with (4) = 
A (t > 0). Then p(A)? = jim µίφ * (A) n A) = μ(Α) holds and μ(Α) is 0 or 1. 
A flow {φ!} of (17.1) is mixing if and only if the relation 
lim /[g(e'(x)) — g][h(z) -- h] du — 0 (17.33) 


t—++00 
M 


holds for arbitrary quadratically integrable functions g,h € L?(M, B, u). Here, g and h denote the 
space average, which is replaced by the time average. 
W The modulo mapping (17.28a) is mixing. The rotation mapping (17.31) is not mixing with respect 


A 
to the probability measure on 
π 


4. Autocorrelation Function 
Suppose the dynamical system (her on M with invariant measure p is ergodic. Leth: M > R 
be an arbitrary continuous function, {'(x)}i50 be an arbitrary semiorbit and let the space average 


h be replaced by the time average, i.e., by Jim. m x fm h(y'(x)) dt in the time-continuous case and by 


n—l 


lim — x h (ο) (x)) in the time-discrete case. With respect to h the autocorrelation function along 
noo η}, ^ 
i=0 


the semiorbit {9 ales to a time point 7 > 0 is defined for a flow by 


CT) = = lim wi 7 (r)) h (e! (x)) dt —h? (17.34a 
and for a discrete system by 
n—1l 
C,(r) = lim T3 d pitt (x)) h (v (1) — 2. (17.34b 
noo n i=0 


The autocorrelation function is defined also for negative time, where Ch (+) is considered as an even 
function on R or Z. 


Periodic or quasiperiodic orbits lead to periodic or quasiperiodic behavior of Ch. A quicker descent o 
C;,(7) for increasing 7 and arbitrary test function h refers to chaotic behavior. If C; (7) decreases for 
increasing 7 with an exponential speed, then it means mixed behavior. 

5. Power Spectrum 

The Fourier transform of C; (7) is called a power spectrum (see also 15.3.1.2, 5., p. 786) and is denoted 


+00 
by Pa(w). In the time-continuous case, under the assumption that / |Ομ(τ)]άτ < oo, 
J —oo 
Ph(w = [αι Cx(7)e ^" dr = 2 [ας ) εοβ(ωτ) dr. (17.35a) 
Too 
In the time-discrete case, if ο. |Ci,(k)| < +00 holds, then 


P, (w) = Cr (0 ) 232040) ) cos wk (17.35b) 


holds. If the absolute TM or summability of C, (-) does not hold, then, in the most impor- 
tant cases, Ph can be considered as a distribution. The power spectrum corresponding to the periodic 
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motions of a dynamical system is characterized by equidistant impulses. For quasiperiodic motions, 
there occur impulses in the power spectrum, which are linear combinations with integer coefficients of 
the basic impulses of the quasiperiodic motion. A “wide-band spectrum with singular peaks” can be 
considered as an indicator of chaotic behavior. 

ΒΑ: Let y be a T-periodic orbit of (17.1), h be a test function such that the time average of h(p(t)) 
is zero, and suppose h(y(t)) has the Fourier representation 


2 
h(y(t)) = S apet with wo = a 
k-—oo T 
Then with 6 as the 6 distribution, holds 
Too Too 
C,(r)- M Jes cos(kwor) and Ρι(ω) -- π V; 26(w — kwo). 
k-—oo k=- 


W B: Suppose o is a quasiperiodic orbit of (17.1), h is a test function such that the time average is 
zero along Φ, and let id be the representation (double Fourier series) 


Too 
=> Y Ok, e i(kiwi+kowa)t 


ki——oo kg=—0o 


Then, 


+00 
C,(7)- M Y lon, ky |" οοβ(Κιωι + Kawo)r, 
Κιξ--οο Κ2---οο 
Too 
ῬΡι(ω) =27 M 3 [αμ] δ(ω — kiwi — kawa). 


ky=—00 kg=—00 


17.2.2 Entropies 
17.2.2.1 Topological Entropy 


Let (M, p) be a compact metric space and {y"},er be a continuous dynamical system with discrete 
time on M. A distance function p, on M for arbitrary n € IN is defined by 


par, y) = max p (eG), e ())- (17.36) 


Furthermore, let N (e, pn) be the largest number of points from M which have in the metric p, a distance 
at least ε from each other. The topological entropy of the discrete dynamical system (17.3) or of the 


1 
mapping φ is h(y) = lim lim sup — In N(e, pn). The topological entropy is a measure for the complexity 
E>0 noo Nn 


of the mapping. Let (Mi, ρι) be a further compact metric space and yı: Mı — Mı be a continuous 
mapping. If both mappings y and φι are topologically conjugate, then their topological entropies 
coincide. In particular, the topological entropy does not depend on the metric. For arbitrary n € N, 
h(y") = nh() holds. If p is a homeomorphism, then A (z^) = |k| h (p) for allk € Z. Based on the 
last property the topological entropy h(y') := h (vy!) is defined for a flow v! = y(t,-) of (17.1) on 
McR". 


17.2.2.2 Metric Entropy 

Let {y'}ier be a dynamical system on M with attractor A and with an invariant probability mea- 
sure p ο ανν. on A. For an arbitrary € > 0 consider the cubes Ωι(ε)..... Qne) (£) of the form 
(Gri... n): kie < v; < (ki + De (i = 1,2,...,n)) with k; € Z, for which u(Q;) > 0. For arbitrary 
g from a 'Q. the semiorbit {φ!(α}}5 fo İS followed for increasing t. In time- distances of r > 0 (r = 1 
in discrete systems), the N cubes, in which the semiorbit is found is denoted by 44,..., in after each 
other. Let Ej, ;, be the set of all starting points in the neighborhood of A whose semiorbits at the 


880 17 Dynamical Systems and Chaos 


times t; = ir (i = 1,2,..., N) are always in Qi, ..., Qi, and let ρ(έι,:::. ἐν) = u( Ein,- iy) be the 
probability that a (typical) starting point is in Εν. 


'The entropy gives the increment of the information on average by an experiment which shows that 
among a finite number of disjoint events which one has really happened. In the above situation this is 


Ην-- M »(ι.:::.ἱν) In plih, sin) (17.37) 
(isein) 
where the summation is over all symbol sequences (i4, +- - , iy) with length N, which are realized by the 


orbits described above. 
The metric entropy or Kolmogorov-Sinai entropy h, of the attractor A of (i) with respect to the in- 


Hx 
variant measure μ is the quantity h, = lim lim ~=. For discrete systems, the limit as € — 0 is 
- £20 Noo ΤΝ 


omitted. For the topological entropy h(y) of o: A — A the inequality h, < h(y) holds. In several 
cases h(y) = sup{h,: pi invariant probability measure on A}. 

W A: Suppose A = {29} is a stable equilibrium point of (17.1) as an attractor, with the natural mea- 
sure jt concentrated on zo. For these attractors h, = 0. 

ΒΒ: For the shift mapping (17.28a), h(y) = h, = In 2, where µ is the invariant Lebesgue measure. 


17.2.3 Lyapunov Exponents 


1. Singular Values of a Matrix 

Let L be an arbitrary matrix of type (n, n). The singular values o1 > σι > +++ > ση of L are the 
non-negative roots of the eigenvalues αι > -++ > a, > 0 of the positive semidefinite matrix LTL. The 
eigenvalues o; are enumerated according to their multiplicity. 

The singular values can be interpreted geometrically. If ΜΚ. is a sphere with center at 0 and with radius 
€ > 0, then the image L(:) is an ellipsoid with semi-axis lengths oj¢ (i = 1,2,...,n) (Fig. 17.13a). 


© (Xy) 


σ;Ε σιε ey (txygeV) 
ss 
9 (xy*£V) 


a) b) Xy*£V 


Figure 17.13 


2. Definition of Lyapunov Exponents 

Let {y'}ier be a smooth dynamical system on M C R”, which has an attractor A with an invariant 
ergodic probability measure p concentrated on A. Let σι({, x) > +++ > o, (t, x) be the singular values 
of the Jacobian matrix D οί (a) of y! at the point x for arbitrary t > 0 and x € A. Then there exists 


1 
a sequence of numbers A, > .:: > An, the Lyapunov exponents, such that F In σι(ί,α) — A; for 


t — +00 μ-α.ο. in the sense of L'. According to the theorem of Oseledec, there exists µ-α.ο. a sequence 
of subspaces of R” 
R” = Ef D Ες D+ D Eg, = {0}, (17.38) 


1 
such that for jj-a.e. x the quantity F In | D e (z)v|| tends to an element As, € {A1,.-., An} uniformly 
with respect to v € BY V Ez 


$jpi" 
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3. Calculation of Lyapunov Exponents 
Suppose o;(t, x) are the semi-axis lengths of the ellipsoid got by deformation of the unit sphere with 
1 
center at x by D t(x). The formula y;(z) = Jim Sup, In o;(t, x) can be used to calculate the Lya- 
oo 

punov exponents, if additionally a reorthonormalization method, such as Householder, is used. The 
function y(t, x, 0) = D q'(x)v is the solution of the variational equation with v at t = 0 associated to 
the semiorbit y+ (x) of the flow {φ!}. Actually, (o! Jer is the flow of (17.1), so the variational equation 
is y = D f(g'(x)) y. The solution of this equation with initial v at time t = 0 can be represented as 
y(t, x, v) = &,(t)v, where 9, (1) is the normed fundamental matrix of the variational equation at t = 0, 
which is a solution of the matrix differential equation Z = D f (q'(x)) Z with initial Z(0) = E, accord- 
ing to the theorem about differentiability with respect to the initial state (see 17.1.1.1, 2., p. 857). 


il 
The number x(x, v) = jim sup 7 In [12 φ'(υγυ] describes the behavior of the orbit y(x+ev), 0 <£ « 1 
—00 8 


with initial 2+ ev with respect to the initial orbit y(x) in the direction v. If x(x, v) < 0, then the orbits 
move nearer to x for increasing t in the direction v. If, on the contrary, x(x,v) > 0, then the orbits 
move away (Fig. 17.13b). 

Let A be the attractor of the dynamical system {y'}ier and y the invariant ergodic measure concen- 
trated on it. Then, the sum of all Lyapunov exponents pi-a.e. x € A is 


t 
Ys Jim + f divf(o (2) ds (17.392) 


i=1 0 


in the case of flows (17.1) and for a discrete system (17.3), it is 

pe i 
YA = lim +X In| det D y(y'(z))|. (17.39b) 
" k—oo k io 


Hence, in dissipative systems 2 Ai < 0 holds. Considering that one of the Lyapunov exponents is 
= 

equal to zero if the attractor is not an equilibrium point, the calculation of Lyapunov exponents can be 
simplified (see [17.9]). 

W A: Let be xp an equilibrium point of the flow of (17.1) and let αι be the eigenvalues of the Jacobian 
matrix at 2. With the measure concentrated on zo, the following holds for the Lyapunov exponents: 
A; = Rea; (i = 1,2,...,n). 

E B: Let (ας) = (v'(xo), t € [0.Τ]} be a T-periodic orbit of (17.1) and let p; be the multipliers of 


1 
(zo). With the measure concentrated on (xo) there is A; = T In |p;| for i = 1,2...,n. 


4. Metric Entropy and Lyapunov Exponents 
If {ther is a dynamical system on M C R” with attractor A and an ergodic probability measure p 


concentrated on A, then the inequality h, < Y A; holds for the metric entropy h,, where in the sum 
X0 
the Lyapunov exponents are repeated according to their multiplicity. 
The equality 
h,— MA (Pesin’s formula) (17.40) 
A20 
is not valid in general (see also 17.2.4.4, BI B, p. 888). If the measure μ is absolutely continuous with 
respect to the Lebesgue measure and y: M — M is a C?-diffeomorphism, then Pesin's formula is valid. 
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17.2.4 Dimensions 
17.2.4.1 Metric Dimensions 
1. Fractals 


Attractors or other invariant sets of dynamical systems can be geometrically more complicated than 
point, line or torus. Fractals, independently of dynamics, are sets which distinguish themselves by 
one or several characteristics such as fraying, porosity, complexity, and self-similarity. Since the usual 
notion of dimension used for smooth surfaces and curves cannot be applied to fractals, a generalized 
definition of the dimension is necessary. For more details see [17.2], [17.12]. 

E The interval Go = [0, 1] is divided into three subintervals with the same length and the middle open 
third is removed, so the set Gi = [0, 4] U (2, 1] is obtained. Then from both subintervals of ΟἽ the open 


middle third ones are removed, which yields the set Gz = (0, i U B i U B i U δ. 1]. Continuing 


this procedure, Gg is obtained from (ας. ι by removing the open middle thirds from the subintervals. 
So, a sequence of sets Go D Gi Ὁ: D Gn 2 ::- is obtained, where every Gn consists of 2" intervals 


1 
f length —. 
of length 30 


The Cantor set C is defined as the set of all points belonging to all Gn, i.e., C = ἢ Gn. The set C is 


n=1 
compact, uncountable, its Lebesgue measure is zero and it is perfect, i.e., C is closed and every point 
is an accumulation point. The Cantor set can be an example of a fractal. 


2. Hausdorff Dimension 
The motivation for this dimension comes from volume calculation based on Lebesgue measure. If sup- 
posing that a bounded set A C RŽ? is covered by a finite number of spheres B,, with radius r; € e, 


4 
so that U; B,, D A holds, then for A a “rough volume" is 5 rule Now, one defines the quantity 
t 


4 
με(Α) = inf{> 37i) over all finite coverings of A by spheres with radius r; < £. If € tends to zero, 
i 


then one gets the Lebesgue outer measure A( A) of A. If A is measurable, the outer measure is equal to 
the volume vol(A). 

Suppose M is the Euclidean space IR" or, more generally, a separable metric space with metric p and 
let A C M bea subset of it. For arbitrary parameters d > 0 and & > 0, the quantity 


i 


µας (A) = inf [Eton : A C | Βι, diamB; € e} (17.41a) 


is determined, where B; C M are arbitrary subsets with diameter diamB; = sup p(r,y). 
T,y€ Bi 


The Hausdorff outer measure of dimension d of A is defined by 
μα) = lim μας) = Sup Ua, (A) (17.41b) 
= E>! 


and it can be either finite or infinite. The Hausdorff dimension dy (A) of the set A is then the (unique) 
critical value of the Hausdorff measure: 
_ f +œ, if μα(Α) #0 forall d> 0, : 

ἀπά) = { inf (d > 0: ual A) = 0}. (17.416) 
Remark: The quantities pa (A) can be defined with coverings of spheres with radius r; € £ or, in the 
case of R”, of cubes with side length < e. 
Important Properties of the Hausdorff Dimension: 
(HD1) dg (0) — 0. 
(HD2) If AC R”, then 0 € dy(A) € n. 
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(HD3) From A C B, it follows that dy(A) € dx(B). 
(HD4) If A= Ù Αι, then dy(A) = sup dy (A). 
i=1 i 


(HD5) If A is finite or countable, then dj (A) = 0. 

(HD6) Πρ: M — M is Lipschitz continuous, i.e., there exists a constant L > 0 such that p(y(x), y(y)) 

< Lp(z, y) Vx, y € M, then dy(yp(A)) € dy(A). If the inverse mapping q^! exists as well, and it is also 

Lipschitz continuous, then dg (A) = dy(y(A)). 

E For the set Q of all rational numbers dg(Q) = 0 because of (HD5). The dimension of the Cantor 
In2 

set C is dy(C) = — & 0.6309.... 

cere ate) In3 

3. Box-Counting Dimension or Capacity 

Let A be a compact set of the metric space (M, p) and let Ν.(Α) be the minimal number of sets ο 

diameter < €, necessary to cover A. The quantity 


E In Ν.(Α 
dp(A) = lim sup nN) (17.42a 
ε--θ In = 
is called the upper bor-counting dimension or upper capacity, the quantity 
In N.(A 
dits limni DAA (17.42b 
Ε-»0 


Int 


is called the lower boz-counting dimension or lower capacity (then dc) of A. Ifdg(A) = dg(A) :-- dg(A 
holds, then dg(A) is called the boz-counting dimension of A. In IR" the box-counting dimension can be 
considered also for bounded sets which are not closed. 

For a bounded set Ας R”, the number N.(A) in the above definition can also be defined in the following 
way: Let R” be covered by a grid from n-dimensional cubes with side length e. Then, N.(A) can be 
the number of cubes of the grid having a non-empty intersecting A. 

Important Properties of the Box-Counting Dimension: 

(BD1) dj (A) € dg(A) always holds. 

(802) For m-dimensional surfaces F C R” holds dg(F) = dg(F) = m. 

(BD3) dp(A) = dp (A) holds for the closure A of A, while often dz (A) < dg (A) is valid. 

(BD4) If A = UAn, then, in general, the formula dg(A) = sup dg(An) does not hold for the box- 


counting dimension. 

11 3 
W Suppose A = (0,1, PUE .}. Then dg(A4) = 0 and dg(A) = ο 
If A is the set of all rational points in (0, 1], then because of BD2 and BD3 d5(A) = 1 holds. On the 
other hand dy(A) = 0. 
4. Self-Similarity 
Several geometric figures, which are called self-similar, can be derived by the following procedure: An 
initial figure is replaced by a new one which is composed of p copies of the original, any of them scaled 
linearly by a factor q > 1. All figures that are k times scalings of the initial figure in the k-th step are 
handled as the in the first step. 


^ Λ W A: Cantor set: p= 2, q = 3. 
— E B: Koch curve: p = 4, q = 3. The 


first three steps are shown in Fig. 17.14. 
Figure 17.14 
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Figure 17.15 
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ZA EM 


Figure 17.16 


W C: Sierpinski gasket: p = 3, q = 2. The 
first three steps are shown in Fig. 17.15. 
(The white triangles are always removed.) 


W D: Sierpinski carpet: p = 8, q = 3. The 
first three steps are shown in Fig. 17.16. 
(The white squares are always removed.) 


For the sets in the examples A-D: 


17.2.4.2 Dimensions Defined by Invariant Measures 


1. Dimension of a Measure 
Let u be a probability measure in (M, p), concentrated on A. If x € A is an arbitrary point and B;(x) 
is a sphere with radius ὁ and center at 2, then 


usu ο 
d,(x) = lims 17.43a 
ux) im sup ind (17.43a) 
denotes the upper and 
ο ως παμθι(α)) 
ἁμία) = lim inf EE τον (17.43b) 


denotes the lower point-wise dimension of ui in x. If d, (x) = d(x) :— d (x), then d, (x) is called the 
dimension of the measure ji in x. 
Young Theorem 1: If the relation d,(x) = a holds for µ-α.ο.", x € A, then 
=d = inf dy(X)}. 17.44 
a= daly) = nf. (du) (17.44) 


The quantity dg (μ) is called the Hausdorff dimension of the measure p . 
E Suppose M = R” and let A C R” be a compact sphere with Lebesgue measure A(A) > 0. The 


λ 
restriction of jj to A is pa = XA)’ Then 
μ(Βε(α)) ~ 9" and απ(μ) = n. 
2. Information Dimension 
Suppose, the attractor A of (o! er is covered by cubes Q1 (£), . . . , Qn(e)(€) of side length £ as in 17.2.2.2, 
p. 879. Let u be an invariant probability measure on A. 


The entropy of the covering Qi(e), ..., Qs (2) is 
n(e) 
H(e) = — Mipi(s)Inpi(e), with ρε) = u(Qi(e)) (i= 1,...,n(e)). (17.45) 
i=l 


€ 


exists, then this quantity has the property of a dimension and is called 


If the limit d;(u) = — lim me 


the information dimension. 


*Here and in the following a.e. is an abbreviation for *almost everywhere". 
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Young Theorem 2: If the relation d,(x) = a holds for µ-α.ο. x € A, then 
a = ἆπ(μ) = dr(p). (17.46) 


E A: Let the measure µ be concentrated at the equilibrium point αρ of {y}. Since H-() = —1ln 1 = 
0 always holds for € > 0, so αμ) = 0. 

E B: Suppose the measure p is concentrated on the limit cycle of {1}. For € > 0, Η.(μ) = —1ne 
holds and so d;(1) = 1. 

3. Correlation Dimension 

Let {y;}%, bea typical sequence of points of the attractor A C IR" of {y'}ier, p an invariant probability 


measure on A and let m € IN be arbitrary. For the vectors x; :— (yi... ium) let the distance be 
defined as dist(z;, xj) :— ax luis — Yj+s||, where || - || is the Euclidean vector norm. If O denotes 
«scm 
- . 0, x <0, : 
the Heaviside function O(x) — 150 then the expression 


C™(e) = lim sup — card((z;, xj): dist(z;, £j) < €} 


N-++00 + [2 
1x 
= lim sup “ΤΣ X O (e = dist(z;, v;)) (17.47a) 
—oo 4 πας] 


is called the correlation integral. The quantity 


1 Gm E 
dy = lim a) (17.47b) 
(if it exists) is the correlation dimension. 
4. Generalized Dimension 


Let the attractor A of {y'}:er on M with invariant probability measure u be covered by cubes with 
side length € as in 17.2.2.2, p. 879. For an arbitrary parameter q € R, q Z 1, the sum 


π(ε) 
Hy,(e) = 1-4 In X` pi(e)! where pi(e) = u(Qi(e)) (17.48a) 
i=1 
is called the generalized entropy of q-th order with respect to the covering Qi(€),---, Qnie)(€)- 
The Rényi dimension of q-th order is 
LH Hs) 
d, =— lim he (17.48b) 


if this limit exists. 


Special Cases of the Rényi Dimension: 


a) g=0: do = dc(supp p). (17.49a) 

b) g=1: d:= lim d, = di(y). (17.49b) 
q—1 

c) q= 2: dy = dx. ( Τ.496) 


5. Lyapunov Dimension 
Let {y'}:er be a smooth dynamical system on M C R” with attractor A (or invariant set) and with the 
invariant ergodic probability measure p concentrated on A. If Ay > Ap > +++ > A, are the Lyapunov 


k ΚΕΙ 
exponents with respect to u and if k is the greatest index for which X A; > 0 and X A; < 0 hold, then 


i=1 i=1 
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the value k 
2A 
dr(u) = k + EET (17.50) 


is called the Lyapunov dimension of the measure p. 


If Y: A; > 0, then dr(u) = n; if A, < 0, then dr (u) = 0. 
ici 


Ledrappier Theorem: Let (!) be a discrete system (17.3) on M C R” with a C?-function y and 
ji, as above, an invariant ergodic probability measure concentrated on the attractor A of {ρ'}. Then 
αμ(μ) € dz(u) holds. 


Wl A: Suppose the attractor A C IR? ofa smooth dynamical system (9 is covered by N, squares with 
side length e. Let σι > 1 > σι be the singular values of D i. Then for the dg-dimensional volume of 
the attractor ma, c ΝΕ: £^? holds. Every square of side length ε is transformed by y approximately 
into a parallelogram with side length σοε and σιε. If the covering is made by rhombi with side length 


o£, then Noze S N. holds. From the relation Nee’? ~ Nose (£02)? one gets directly 
σα 
] λ 
ato Sp (17.51) 


^ Inos |λ2|᾽ 


This heuristic calculation gives a hint of the origin of the formula for the Lyapunov dimension. 

WB B: Suppose the Hénon system (17.6) is given with a = 1.4 and b = 0.3. The system (17.6) has 
an attractor A (Hénon attractor) with a complicated structure for these parameter values. The nu- 
merically determined box-counting dimension is dg(A) ~ 1.26. It can be shown that there exists 
an SBR measure for the Hénon attractor A. For the Lyapunov exponents A; and A, the formula 
Ai + λο = In| det Dy(x)| = Inb = In0.3 ~ —1.204 holds. With the numerically determined value 
λι & 0.42 one gets Ay ~ —1.62. So, 

0.42 


ἀν(μ) = 1+ τος ~ 1-26. (17.52 


17.2.4.3 Local Hausdorff Dimension According to Douady and Oesterlé 
Let {y'}ier be a smooth dynamical system on M C R” and A a compact invariant set. Suppose tha: 
an arbitrary to > 0 is fixed and let Φ := qf. 


Theorem of Douady and Oesterlé: Let σι(α) > -:: > on(x) be the singular values of D (x 
and let d € (0,n] be a number written as d = do + s with dọ € {0,1,...,n — 1} and s € [0,1]. I 
sup [σι (x)es(x) . . . ca (x)03,,4(x)] < 1 holds, then dg (A) < d. 
αΕΛ 


Special Version for Differential Equations: Let {y'}icr be the flow of (17.1), A be a compac 
invariant set and let a(x) > ::: > a(x) be the eigenvalues of the symmetrized Jacobian matrix 


1 
zP f(x)" + D f(x)] at an arbitrary point x € A. If d € (0,n] is a number of the form d = do + s 


where dy € (0,...,n — 1} and s € [0,1], and sup[oi(x) + +++ + oa (x) + soya (x)] < 0 holds, then 
2ΕΛ 


αμ(Δ) < d. The quantity 
πω 0, if a(x) < 0, 

DOT) =  sup(d: 0 <d < n, αι(α) t+ agg (x) + (d — [d])a+ (x) 2 0} otherwise, 
where x € A is arbitrary and [d] is the integer part of d, is called the Douady- Oesterlé dimension at the 
point x. Under the assumptions of the Douady-Oesterlé theorem for differential equations, da (^) < 
sup dpo(x). 
αΕΛ 


(17.53) 
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W The Lorenz system (17.2), in the case of ø = 10, b = 8/3, r = 
28, has an attractor A (Lorenz attractor) with the numerically de- 
termined dimension dy(A) ~ 2.06 (Fig. 17.17 is generated by 
Mathematica). With the Douady-Oesterlé theorem, for arbitrary 
b>1,0>0andr > 0 one gets the estimation 

dy(A) < 3- ziti where (17.54a) 


k= 


1 
2 


2 b ῃ " 54b 
ZA mE (17.54b) 


17.2.4.4 Examples of Attractors 

ΒΑ: The horseshoe mapping p occurs in connection with 
Poincaré mappings containing the transversal intersections of sta- 
ble and unstable manifolds. The unit square M = [0,1] x [0,1] 
is stretched linearly in one coordinate direction and contracted in 
Figure 17.17 the other direction. Finally, this rectangle is bent at the middle 
(Fig. 17.18). Repeating this procedure 
infinitely many times, a sequence of sets 
M D (M) D ::: is produced, for which 


A- f e" (M) (17.55) 


k=0 


isa compact set and an invariant set with 

M φ(Μ) °M) respect to y. A attracts all points of M. 

Apart from one point, A can be described 

Figure 17.18 locally as a product “line x Cantor set". 

W B: Leto € (0,1/2) bea parameter and M = [0, 1] x [0, 1] be the unit square. The mapping e: M > 
M given by 


1 
(2x, ay), if0<a< 57€ [0,1], 


ία.) = (17.56a) 


1 1 
(2x —layt+ 7 if 5 « r € 1l, y € [0,1] 


is called the dissipative baker’s mapping. Two iterations ofthe baker’s mapping are shown in Fig. 17.19. 


i 


1x 1x 1x 


The “flaky pastry structure” is recogniz- 


y y able. The set 
1 A= N en) (17.56b) 
k=0 


is invariant under y and all points from 
M are attracted by A. The value of the 
Hausdorff dimension is 


In2 
ἁμι md —25— 
Figure 17.19 —Ina 


(17.56c) 


For the dynamical system (t^) there exists an invariant measure js on M, which is different from the 
Lebesgue measure. At the points where the derivative exists, the Jacobian matrix is D e^((x,y)) = 


k 
t i) Hence, the singular values are e1(k, (z,y)) = Qk o2(k, (a, y)) = αξ and. consequently, 
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the Lyapunov exponents are Ay = In2, A» = Ina (with respect to the invariant measure p). For the 
Lyapunov dimension 


In2 
dilu) =1+ — cadat) (17.56d) 
—lna 
is obtained. Pesin's formula (see 17.2.3,4., (17.40), p. 881) for the metric entropy is valid here, i.e., 
hy = YF M=. (17.56e) 


Ai>0 


E C: Let T be the whole torus with local coordinates (O, x, y), as is shown in Fig. 17.20a. 


© 


Figure 17.20 


Let a mapping y: T - T be defined by 


Tk+1 1 /cos Ok Tk 
+1 = 20k, Ss ν km; 1, 1T. 
e -29. ας ο (17.57) 
with parameter a € (0, 1/2). The image y(T), with the intersections (T) à D(@) and y?(T) n D(@), 
is shown in Fig. 17.20b and Fig. 17.20c. The result of infinitely many intersections is the set A = 


oo 
N e (T), which is called a solenoid. The attractor A consists of a continuum of curves in the length 
ορ 


direction, and each of them is dense in A, and unstable. The cross-section of the A transversal to these 
curves is a Cantor set. 
' -- In2 : E ' 
The Hausdorff dimension is dg (A) = 1 — ime The set A has a neighborhood which is a domain of at- 
πα 
traction. Furthermore, the attractor A is structurally stable, i.e., the qualitative properties formulated 
above do not change for C'-small perturbations of y. 


W D: The solenoid is an example of a hyperbolic attractor. 


17.2.5 Strange Attractors and Chaos 


1. Chaotic Attractor 

Let {y'}icr be a dynamical system in the metric space (M, p). The attractor A of this system is called 
chaotic if there is a sensitive dependence on the initial condition in A. 

The property “ sensitive dependence on the initial conditions ” will be made more precise in different 
ways. It is given, e.g., if one of the two following conditions is fulfilled: 

a) All motions of {y'} on A are unstable in a certain sense. 

b) The greatest Lyapunov exponent of (i) is positive with respect to an invariant ergodic probability 
measure concentrated on A. 

W Sensitive dependence in the sense of a) occurs for the solenoid. Property b) can be found, e.g., for 
Hénon attractors. 

2. Fractals and Strange Attractors 

An attractor A of {γιεν is called fractal if it represents neither a finite number of points or a piecewise 
differentiable curve or surface nor a set which is bounded by some closed piecewise differentiable surface. 
An attractor is called strange if it is chaotic, fractal or both. The notions chaotic, fractal and strange 
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are used for compact invariant sets analogously even if they are not attractors. A dynamical system is 
called chaotic if it has a compact invariant chaotic set. 
WB The mapping 
X441 2%, +Yn  (modl), Yny1=Ln+Yn (mod 1) (17.58) 
(Anosov diffeomorphism) is considered on the unit square. The adequate phase space for this system 
is the torus T?. It is conservative, has the Lebesgue measure as invariant measure, has a countable 
number of periodic orbits whose union is dense and is mixing. Otherwise, A = Τ is an invariant set 
with integer dimension 2. 
3. Systems Chaotic in the Sense of Devaney 
Let {y'}rer be a dynamical system in the metric space (M, p) with a compact invariant set A. The 
system {y'}ier (or the set A) is called chaotic in the sense of Devaney, if: 
a) {y'}ier is topologically transitive on A, i.e., there is a positive semiorbit, which is dense in A. 
b) The periodic orbits of (v! γιεν are dense in A. 
c) (e! hier is sensitive with respect to the initial conditions in the sense of Guckenheimer on A, i.e., 
J3e»0VzeAVó»0 JYEANU (x) 3120 :ρ(φ'(α), φ'(υ)) Σ ε (17.59) 
where Us(x) = {σε M: p(x, z) < ô}. 
W Consider the space of the 0-1-sequences 


So = {s = sosiso-.., si € {0,1} (i =0,1...)}. 
For two sequences s = 808152... and 5' = sjs/s),..., their distance is defined by 
0, ifs=s' 
> ΠΕΝ n ? ? 
als, 5} = 123. if sz s, 
where j is the smallest index for which s; # s}. So, (2, p) is a complete metric space which is also 
compact. 
The mapping p: 8 = 605185... > o(s) = s' = 815983... is called a Bernoulli shift. 
The Bernoulli shift is chaotic in the sense of Devaney. 


17.2.6 Chaos in One-Dimensional Mappings 

For continuous mappings of a compact interval into itself, there exist several sufficient conditions for 

the existence of chaotic invariant sets. Three examples are to be considered. 

Shinai Theorem: Let o: I — I be a continuous mapping of a compact interval J, e.g., J = [0, 1] 

into itself. Then the system (v on J is chaotic in the sense of Devaney if and only if the topological 

entropy of y on J, i.e., h(y), is positive. 

Sharkovsky Theorem: Consider the following ordering of positive integer numbers: 
3-5»-7-..-2:.3-2.5-...-2.3-2*. 5»... 2:25:21. (17.60) 

Let p: I — I be a continuous mapping of a compact interval into itself and suppose {y*} has an n- 

periodic orbit on J. Then (q^) also has an m-periodic orbit if n + m. 

Block, Guckenheimer and Misiuriewicz Theorem: Let o: I — I be a continuous mapping 

of the compact interval J into itself such that (q^) has a 2"m-periodic orbit (m > 1, odd). Then 

In2 


h(y) > σι holds. 


17.2.7 Reconstruction of Dynamics from Time Series 
17.2.7.1 Foundations, Reconstruction with Basic Properties 


1. Measuring Function, Time Series 
Considered is a dynamic system {y'}ier with P € {Z}, R4}, generated by a mapping o € Diff} (U) 
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(see 17.1.4.2, p. 874) or a vector field f € X} (U) (see 17.1.4.1. p. 873). A Cl-function h: U > R, 
the so called measuring function is needed to reconstruct the dynamics from measurements. Since in 
practice only discrete time measurements can be obtained, this will be done in time periods (kr, k = 
1,2,...} C P, where 7 > 0 is a fixed interval of time. For m € N and & € {—1, 1} the time sequence 
of order m measured with fixed time step 7 > 0 


{ (h Gr (9) , h (p *9"(p)) ...., & (07097 (9))) 


is called the backward (« = —1) or forward (κ. = 1) coordinates of the orbit {y'(p)}ier with p € U 
based on the measuring function h. 


oo 


(17.61) 


k=m-1 


2. Immersion, Embedding, Theorem of Whitney 

Let U C R” be an open set. The C!-mapping Φ: U — R” is called an immersion if the Jacobian 
matrix D®(u) has rank n for every u € U. The immersion Φ: U — R” is called embedding if Φ is a 
homeomorphism of U into (U) ($(U) is considered with the subspace topology of IR"). 


Whitney’s theorem states that with respect to the open and bounded set U C IR" for every m > 2n 4-1 
the set of all embeddings Φ: U + R™ form an open and dense subset of C!-mappings U — IR". Hence, 
for m > 2n + 1 is  generically an embedding. 

3. Reconstruction Theorem of Takens, Theorem of Kupka and Smale 

Considering Diff: (U) with an arbitrary natural number m 2n + 1 the set of all pairs (p, h) € 
Diff! (U) x C! (U, R) , for which the reconstruction mapping (in forward coordinates) 


p EU Φρικ) = (Alp), h(e (p)... A= (p) (17.62 
is an embedding, is open and dense in Diff (U ) x C!(U, R). Hence, the property of p,n, to be an 
embedding is generic for m > 2n + 1. Such a number m is called embedding dimension (see theorem 
of Takens [17.13]). The theorem of Takens can be applied for differential equations from X1 (U): I 
m > 2n + Lis an arbitrary natural number, then the set of all pairs (f, h) € X! (U) x C*(U, R) for 
which the reconstruction mapping (in forward coordinates) 


p E€ U ++ ο Αα.) (17.63 


(with {%" }>0 as the semi-flow belonging to f) is an embedding, constitutes an open and dense set in 


X} (U) x C!(U, R). Hence, the property of ®,,, to be an embedding is generic. 

W Let the differential equation « = —x = f(a) be given in an interval (C1 — ¢,1 + £) (e > 0, fora 
sufficiently small £). Since f is continuously differentiable and f(—1) = 1 > 0 and f(1) = —1, clearly 
f € X} (U) with U = (—1,1). This also follows from the explicite solution y'(x) = xe~'(t > 0,2 € U). 
The theorem of Takens states that for m > 3 the reconstruction function Φρῃ is a generic embedding 
with continuously differentiable measuring function h : (—1,1) — R. For example the measuring 
function h(x) = « for the mapping x € (—1,1) œ> Bfn, (£) = (x, xe *, xe?) is obviously an em- 
bedding in R?. However with the measuring function ^: (—1,1) — R. the reconstruction mapping 
x € (-1,1) 9 Φρη,(α) = (x?,2?e ?, xe) is not injective and therefore is not an embedding. 

The reconstruction theorem of Takens is based on the theorem of Kupka and Smale: The set of all 
diffeomorphisms y € Diff! (U ), such that their period points are hyperbolic and W*(p) is transver- 
sal to W"(q) for arbitrary period point, forms a set of second Baire category, that is, such diffeomor- 


phisms are typical in Diff; (U . The condition m > 2n + 1 follows from the fact that for typical 
(o, h) € Diff 1(U ) x C! (U, R) mapping 9, ;, is an immersion in the neighborhood of the period points, 
therefore it can be extended into an embedding on the entire U. 

4. Dynamic in the Reconstruction Space 

The theorem of Takens implies that for a generic (y,h) € Difi (U) x C'(U, R) the set $(U) (re- 
construction space) with 9 = 4, is an immerse and homeomorphic image of U and the mapping 
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V = ® o p o P™ is defined on &(U). The topologic properties of the stationary points and periodic 
orbits of the (unknown) system (v^ reż, defined on U and the system {yk} reg, defined on (U) are 
identical to those of the eigenvalues of the Jacobi matrices. Similarly the entropies and dimensions, e.g. 
correlation dimension (see 17.2.7.2, p. 891), correspond to the Lyapunov exponents associated with in- 
variant measure. The mapping v on @(u) is completely described in all points being given in the time 
series. For example select τ = 1. Let point x, = (n(p*(p)), dg h (71 (p))) € R”, k € Z, begiven. 
Clearly x, = (qr), with qe = y! (p). Then v(z,) = (Φ ο p o O!)(@(q)) = zia, ie, v (Alar), 

h(qiim-1)) = (νι). R(dk+2); - - -, A(dk+m)) - From the measurements of the orbits (with l = Z4) of 
w defined on &(U) the entire dynamics of y defined on U can be obtained. 


17.2.7.2 Reconstructions with Prevalent Properties 


1. Prevalence as a Generic Metric 

Prevalence or generic metric is an extension to infinite dimension of the wellknown concept "almost 
everywhere in Lebesgue-measure” (see 12.9.1,2., p. 694) known in finite dimensions, so it differs from 
the corresponding notions for the sets of the second Baire category. A Borel set S in a Banach space 
B is prevalent (see [17.13]) if there is a finite Borel measure u with support K (see 17.2.1.1,1., p. 876), 
such that (5 + x) = μ(5) = (K) for all x € B. 

WA: Every Borel set of a finite dimensional vector space with measure zero complement is prevalent. 
WB: The union and intersection of finitely many prevalent sets are also prevalent. 

EC: Let C^(U, R), U C R” be the Banach space of all scalar valued functions from C^(U, IR) whose 
partial derivatives up to order k are continuous on U. If U € R” is open and connected, then the 
prevalent subsets of C*(U, R) , are dense in this space. 


2. Reconstruction Theorems of Sauer, Yorke and Casdagli 

Let {y'}:50 bea continuous dynamics generated by the vector field f € X mi (see 17.1.4.1, p. 873) and let 
A be a compact subset in U with fractal dimension do(A) = d. Furthermore let m > 2d be an integer 
and r > 0 arbitrary. The process {φί}ι»ο restricted to A might have only finitely many stationary 
points, there is no orbit with period 7 or 27, and only finitely many orbits with periods 37, 47, ...,mT 
where the multiplyers of these periodic orbits (except by 1) are all different. The set of all measuring 
functions h : U + R is a prevalent set Ο (U, R) if the reconstruction function ®;»,, 


p€U + pp) = (h (g "79" (p)) a (e? (p) ,..., (p) (17.64) 


satisfies the following conditions: 
a) Pfnr is injective on A; 
V(W), whith W = Gn R* x 


b) Φις is an immersion on every subset Üc A, such that Ü- 
< d. (Theorem of Sauer, Yorke, Cas- 


19... .,0},G C R” open, V: G — R” is a C'-mapping and k 


Tm 


dagli see [17.13].) 


3. Estimation of Correlation Dimension 
Given the dynamic system {y"}ier mit P € (Z4, R4], generated either by y € Diff (U) or f € X1(U). 
Let {y'}ier have an attractor A in U with invariant probability measure μ. Let h: U 4 R bea 
πο. function, m € N the order parameter, 7 = 1 the time stepsize and for i = 1,2,... 

= (Ui Visas Vim) € ROH (17.65) 


with y; = h(y'(p)) being a forward coordinate of the m + 1 order time series of the orbit (er with 
p € U. The distance of vectors x; and v; is defined as dist(r;, £j) = max |Yirs — Vjss]. Let N > m 
<s<m 
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denote a natural number and £ > 0 a real number, then the expression 


1 
O™(y) = lim sup a number of ordered pairs (rj, xj): i, j € {1,...,N},dist(xi,x;) <€} (17.66) 


In C" (e 
is called (discrete) correlation integral (with respect to m and £). If the d (m) = lim 
E> ne 


then it is an estimation of the correlation dimension dg. The Takens theorem implies that h € C ! (U, R) 
is generic for m > 2n, and the theorem of Sauer, Yorke, Casdagli implies that it is prevalent for m 4-1 > 
dc (M) with backward coordinates. 

Ε The Lorenz system (17.2) (see 17.1.1.1,2., p.858) belongs to X! (U), where U can be selected as 
U --- ((,9,2) € R? : 1a? +y? + (z— 0 — τ)] < c} (e > 0, sufficiently large). Clearly the Lorenz 
attractor A (with c = 10, b = 8/3, r = 28) is in U. The theorem of Douady-Oesterlé (see 17.2.4.3, 
p. 886) gives the upperbound dg (A) € 2.421. Numerical integration with the Box-Counting-Method 
gives dj (A) & 2.06. Estimation of the correlation dimension in natural measure using the embedding 
method with time series in backward coordinates (τ & 0.12) gives the value dg & 2.03 (Grassberger, 
(17.12]) for the Lorenz attractor . 


exists, 


17.3 Bifurcation Theory and Routes to Chaos 


17.3.1 Bifurcations in Morse-Smale Systems 

Let {φέγιεγ be a dynamical system generated by a differential equation or by a mapping on M C R”, 
which additionally depends on a parameter e € V C R}. Every change of the topological structure 
of the phase portrait of the dynamical system for small changes of the parameter is called bifurcation. 
The parameter £ = 0 € V is called the bifurcation value if there exist parameter values e € V in every 
neighborhood of 0 such that the dynamical systems {yt} and {yb} are not topologically equivalent 
or conjugated on M. The smallest dimension of the parameter space for which a bifurcation can be 
observed is called the codimension of the bifurcation. 

One distinguishes between local bifurcations, which occur in the neighborhood of a single orbit of the 
dynamical system, and global bifurcations, which affect a large part of the phase space. 


17.3.1.1 Local Bifurcations in Neighborhoods of Steady States 
1. Center Manifold Theorem 


Consider a parameter-dependent differential equation 
qm) or r= fies im ebene οθονη) (17.67) 


with f: M x V — R”, where M C R” and V C R' are open sets and f is supposed to be r times con- 
tinuously differentiable. Equation (17.67) can be interpreted as a parameter-free differential equation 
č = f(r,2), €= 0 in the phase space M x V. From the Picard-Lindelóf theorem and the theorem on 
differentiability with respect to the initial values (see 17.1.1.1, 2., p. 857) it follows that (17.67) has a 
locally unique solution y(-, p, €) with initial point p at time t = 0 for arbitrary p € M and e € V, which 
is r times continuously differentiable with respect to p and e. Suppose all solutions exist on the whole 
of R. 
Furthermore, it is supposed that the system (17.67) has the equilibrium point x = 0 at € = 0, i.e., 
e n 
δή: (0, o) with Πολ; = 0. 
Ox; ee 

Jj 
Furthermore, suppose, D, f (0,0) has exactly m eigenvalues with negative real part and k = n — s — m 
eigenvalues with positive real part. 
According to the center manifold theorem for differential equations (theorem of Shoshitaishvili, see 
(17.12)), the differential equation (17.67), for e with a sufficiently small norm ||e|| in the neighborhood 


f(0,0) = 0 holds. Let À1,..., λε be the eigenvalues of D, (0,0) = | 
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of 0, is topologically equivalent to the system 
$-—F(ze)meAzr-9g(ze) y=-y, ż=z (17.68) 


with x € R*, y € R” and z € R^, where A is a matrix of type (s, s) with eigenvalues A,,..., λε, and g 
represents a C"-function with g(0,0) = 0 and D,9(0,0) = 0. 
It follows from representation (17.68) that the bifurcations of (17.67) in a neighborhood of 0 are 
uniquely described by the differential equation 

t= F(a, ¢). (17.69 
Equation (17.69) represents the reduced differential equation to the local center manifold Wg. = (x, y, z: 
y = 0,z = 0} of (17.68). The reduced differential equation (17.69) can often be transformed into 
a relatively simple form, e.g., with polynomials on the right-hand side, by a non-linear parameter- 
dependent coordinate transformation so that the topological structure of its phase portrait does no 
change close to the considered equilibrium point. This form is a so-called normal form. A normal form 
cannot be determined uniquely; in general, a bifurcation can be described equivalently by differen 
normal forms. 
2. Saddle-Node Bifurcation and Transcritical Bifurcation 
Suppose (17.67) is given with | = 1, where f is continuously differentiable at least twice and D, f (0,0 
has the eigenvalue A; = 0 and n — 1 eigenvalues A; with Πολ; 7 0. According to the center manifold 
theorem, in this case, all bifurcations (17.67) near 0 are described by a one-dimensional reduced differ- 


F 
ential equation (17.69). Obviously, here F(0,0) — ^. 0) = 0. If, additionally, it is supposed tha: 
x 
93 OF 
8 10,0) # 0, δε (0,0) A 0 and the right-hand side of (17.69) is expanded according to the Taylor 
z 


formula, then this representation can be transformed by coordinate transformation (see [17.6]) into 
the normal form 


b=ata2? t (17.70) 
OF ' OF 
for — (0,0) > 0] ori =a—a?+--- | for =— (0,0) <0), where a = a(e) is a differentiable 
Ox? Ox? 
function with a(0) = 0 and the points indicate higher-order terms. For a < 0, (17.70) has two equi- 
librium points close to r = 0, among which one is stable, the other is unstable. For a = 0, these 


equilibrium points fuse into x = 0, which is unstable. For a > 0, (17.70) has no equilibrium point near 
to 0 (Fig. 17.21b). 
The multidimensional case results in a saddle-node bifurcation in a neighborhood of 0 in (17.67). This 
bifurcation is represented in Fig. 17.22 for n = 2 and λι = 0,2 < 0. The representation of the 
saddle-node bifurcation in the extended phase space is shown in Fig. 17.21a. For sufficiently smooth 
vector fields (17.67), the saddle-node bifurcations are generic. 


a a<0 a=0 α»0 


Figure 17.21 


F 
If among the conditions which yield a saddle-node bifurcation for F, the condition FO 0) Z 0 is 


oF 2p 
replaced by the conditions a (0,0) = 0 and a 


(0,0) Z 0, then one gets from (17.69) the truncated 
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"TE 


1 0 
a<0 a=0 a>0 


Figure 17.22 


normal form (without higher-order terms) 2 = ox — 27 of a transcritical bifurcation. For n = 2 and 
λο < 0, the transcritical bifurcation together with the bifurcation diagram is shown in Fig. 17.23. 
Saddle-node and transcritical bifurcations have codimension 1-bifurcations. 


3. 


Ξ0 α»0 
Figure 17.23 


Hopf Bifurcation 


Consider (17.67) with n > 2, | = 1 andr > 4. Suppose that /(0. ε) = 0 is valid for all e with |e 
(£o > 0 sufficiently small). Suppose the Jacobian matrix D, f (0,0) has the eigenvalues λι = Ay = iw 
with w 7 0 and n — 2 eigenvalues A; with ReA; # 0. According to the center manifold theorem, the 


bifurcation is described by a two-dimensional reduced differential equation (17.69) of the form 

i = a(e)x — w(e)y + g(z, y, E), y=wle)etale)y + go(2,y, ε) (17.71 
where a, w, gi and g are differentiable functions and ω(0) = w and also a(0) = 0 hold. By a non-linear 
complex coordinate transformation and by the introduction of polar coordinates (r, Ὁ), (17.71) can be 
written in the normal form 

? — α(ε)τ ο +, 0 — w(e) - bler? +- (17.72 
where dots denote the terms of higher order. The Taylor expansion of the coefficient functions of (17.72 
yields the truncated normal form 

ἡ — o'(0)er -- a(0)?, ὃ — ω(0) + ω(θ)ε + b(0)r?. (17.73 


K 


neighborhood of the equilibrium point for € = 0. 


T. 


1e following cases occur for (17.73) under the assumption α (0) > 0: 


1. a(0) <0 (Fig. 17.24a). : 2. a(0) > 0 (Fig. 17.24b) : 


a) € > 0: Stable limit cycle and a) € < 0: Unstable limit cycle. 
unstable equilibrium point. 


< £9 


he theorem of Andronov and Hopf guarantees that (17.73) describes the bifurcations of (17.72) in a 


b) e — 0: Cycle and equilibrium point fuse b) ε — 0: Cycle and equilibrium point fuse 


into a stable equilibrium point. 
c)e <0: All orbits close to (0,0) tend asin b) — c)&e > 0: Spiral type unstable 

fort + +00 spirally to the equilibri- equilibrium point as in b). 

um point (0, 0). 


into an unstable equilibrium point. 


The interpretation of the above cases for the initial system (17.67) shows the bifurcation of a limit cycle 
of a compound equilibrium point (compound focus point of multiplicity 1), which is called a Hopf bifur- 


17.9 Bifurcation Theory and Routes to Chaos 895 


a) e>0 e<0 b)  e«0 £20 


Figure 17.24 


cation (or Andronov-Hopf bifurcation). The case a(0) « 0 is also called supercritical, the case a(0) > 0 
subcritical (supposing that a'/(0) > 0). The case n = 3, λι = Às = i, Az < 0, α(0) > 0 and a(0) < 0 is 
shown in Fig. 17.25. 


e>0 e<0 


Figure 17.25 


Hopf bifurcations are generic and have codimension 1. The cases above illustrate the fact that a super- 
critical Hopf bifurcation under the above assumptions can be recognized by the stability of a focus: 
Suppose the eigenvalues A; (c) and λο(ε) of the Jacobian matrix on the right-hand side of (17.67) at 0 
are pure imaginary for € = 0, and for the other eigenvalues A; ReA; 4 0 holds. Suppose furthermore 
d 
that d Βολι(ε}ι--ο 0 and let 0 be an asymptotically stable focus for (17.67) at € = 0. Then there is 
E 
a supercritical Hopf bifurcation in (17.67) at ¢ = 0. 
E The van der Pol differential equation 3 + £(z? — 1)t + x = 0 with parameter £ can be written as a 
planar differential equation 
$-—y, y= --ε(α" -- 1η -- τα. (17.74) 
For & = 0, (17.74) becomes the harmonic oscillator equation and it has only periodic solutions and 
an equilibrium point, which is stable but not asymptotically stable. With the transformation u — 
Ve v, v = Vey fore > 0 (17.74) is transformed into the planar differential equation 


ù=v, i--—uc- (u? --ε)υ. (17.75) [ 


For the eigenvalues of the Jacobian matrix at the equilibrium point (0,0) of (17.75): 


€ e d 1 
λιρ(ε) = 3*V4-7 land so λι2(0) = +i and T ReAi(€)je=0 = 27 0. 
As shown in the example of 17.1.2.3, 1., p. 863, (0, 0) is an asymptotically stable equilibrium point of 
(17.75) for e = 0. There is a supercritical Hopf bifurcation for € = 0, and for small ε > 0, (0,0) is an 


unstable focus surrounded by a limit cycle whose amplitude is increasing as € increases. 

4. Bifurcations in Two-Parameter Differential Equations 

1. Cusp Bifurcation Suppose the differential equation (17.67) is given with r > 4 and | = 2. Let 
the Jacobian matrix D, f (0,0) have the eigenvalue λι = 0 and n — 1 eigenvalues A; with Πολ; z 0 and 


OF 
p, (0,0) — 0 und 


F 
suppose that for the reduced differential equation (17.69) F(0,0) = 2 (0,0) = 
Ῥ 
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OF 
l3 := ES ——— (0,0) Z 0. Then the Taylor expansion of F close to (0, 0) leads to the truncated normal form 
(without higher-order terms see, [17.1]) 


d = a + œz + sign laa? (17.76) 
with the parameters o4 and o3. The set { (a1, 0, £): αι + αὐ + sign lax? = 0} represents a surface in 
extended phase space and this surface is called a cusp (Fig. 17.26a). 

In the following, it is supposed l3 < 0. The non-hyperbolic equilibrium points of (17.76) are defined 
by the system αι + aax — a? = 0, a — 3a? = 0 and thus they lie on the curves Sı and S5, which are 
determined by the set {(a1, αὐ): 27a? — 4a3 = 0} and form a cusp (Fig. 17.26b). If (αι, a2) = (0,0) 
then the equilibrium point 0 of (17.76) is stable. The phase portrait of (17.67) in a neighborhood of 0, 
e.g., for n = 2, ls < 0 and λι = 0 is shown in Fig. 17.26c for A» < 0 (triple node) and in Fig. 17.26d 
for As > 0 (triple saddle) (see [17.6]). 

At transition from (αι, αὐ) = (0,0) into the interior of domain 1 (Fig. 17.26b) the compound node- 
type non-hyperbolic equilibrium point 0 of (17.67) splits into three hyperbolic equilibrium points (two 
stable nodes and a saddle) (supercritical pitchfork bifurcation). 


In the case of the two-dimensional phase space of (17.67) the phase portraits are shown in Fig. 17.26c,e. 
When the parameter pair of S; V {(0,0)} (i = 1,2) traverse from 1 into 2 then a double saddle node- 
type equilibrium point is formed which finally vanishes. A stable hyperbolic equilibrium point remains. 


i -— E JA 
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Figure 17.26 
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Figure 17.27 


2. Bogdanov-Takens Bifurcation Suppose, for (17.67), n > 2, | = 2, r > 2 hold and the matrix 
D, f (0,0) has two eigenvalues λι = λο = 0 and n — 2 eigenvalues A; with ReA; Æ 0. Let the reduced 
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two-dimensional differential equation (17.69) be topologically equivalent to the planar system 


$—y, ý= +T +T? — xy. (17.77) 
Then there is a saddle-node bifurcation on the curve S; = {(a1, αν): αὖ — 4a, = 0}. At the transition 
on the curve $5 = {(a1, αὐ): αι = 0,a2 < 0} from the domain αι < 0 into the domain a, > 0 a stable 
limit cycle arises by a Hopf bifurcation and on the curve $5 = {(a1,a2): αι = —kaZ+---} (k > 0, 
constant) there exists a separatrix loop for the original system (Fig. 17.27), which bifurcates into a 
stable limit cycle in domain 3 (see [17.6], [17.9]). 

This bifurcation is of a global nature and one says that a single periodic orbit arises from the homoclinic 
orbit of a saddle or a separatrix loop vanishes. 

3. Generalized Hopf Bifurcation Suppose that the assumptions of the Hopf bifurcation with 
r > Gare fulfilled for (17.67), and the two-dimensional reduced differential equation after a coordinate 


" 


transformation into polar coordinates has the normal form ? = eyr + £r? — τὸ +--+, 0 =1+---. The 
bifurcation diagram (Fig. 17.28) of this system contains the line $1 = {(€1,€2): ει = 0, £2 Æ 0}, whose 
points represent a Hopf bifurcation (see [17.4], [17.6]). There exist two periodic orbits in domain 3, 
among which one is stable, the other one is unstable. On the curve Sy = {(€1,€2): €3+4€2 > 0, ει < 0}, 
these non-hyperbolic cycles fuse into a compound cycle which disappears in domain 2. 


ο... 6 (C 6 6 


Figure 17.28 


5. Breaking Symmetry 

Some differential equations (17.67) have symmetries in the following sense: There exists a linear trans- 
formation T (or a group of transformations) such that f(T'v,c) = T f (r,£) holds for all x € M and 
€ € V. An orbit y of (17.67) is called symmetric with respect to T if Ty = y. 

One talks about a symmetry breaking bifurcation at € = 0, e.g., in (17.67) (for | = 1), if there is a 
stable equilibrium point or a stable limit cycle for ε < 0, which is always symmetric with respect to T, 
and for ε = 0 two further stable steady states or limit cycles arise, which are no-longer symmetric with 
respect to T. 

E For system (17.67) with f(x,¢) = &x—2? the transformation T defined as T: x + --ᾱ is a symmetry, 


since f(—a,e) = —f(a,e) (x € R,e € R). Fore < 0 the point x, = 0 is a stable equilibrium point. 
For £ > 0, besides xı = 0, there exist the two other equilibrium points £23 = de; both are non- 
symmetric. 


17.3.1.2 Local Bifurcations in a Neighborhood of a Periodic Orbit 


1. Center Manifold Theorem for Mappings 
Let y be periodic orbit of (17.67) for e = 0 with multipliers p1, ..., Pn-1; Pn = 1. A bifurcation close to 
η is possible, if when changing e, at least one of the multipliers lies on the complex unit circle. The use 
of a surface transversal to y leads to the parameter-dependent Poincaré mapping 

zı P(a,¢). (17.78) 
Then, with open sets E C IR"! and V C R! let P: Ex V — ΒΓ! bea C"-mapping where the mapping 
P:ExV > R'!xR!with Ρ(α,ε) = (P(x, ε), ε) should be a C’-diffeomorphism. Furthermore, 
let P(0,0) = 0 and suppose the Jacobian matrix D; P(0, 0) has s eigenvalues ρι,.... ps with |p;| = 1, 
m eigenvalues p.41,..., Ps+m With |p;| < 1 and k = n — s — m — 1 eigenvalues Psym4i,--+;Pn—1 
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with |p;| > 1. Then, according to the the center manifold theorem for mappings (see [17.4]), close to 
(0,0) € E x V, the mapping P is topologically conjugate to the mapping 

(z,y, z, €) -> (F(a, ε), Αν, A"z, ε) (17.79) 
near (0,0) € R"^! x R? with Ε(α,ε) = A*r + g(z, ε). Here g is a C"-differentiable mapping satisfying 
the relations g(0,0) = 0 and D,g(0,0) = 0. The matrices Α΄, A* and A" are of type (s, s). (m, m) and 
(k, k), respectively. 
It follows from (17.79) that bifurcations of (17.78) close to (0,0) are described only by the reduced 
mapping 


zı Ε(α,ε) (17.80) 
on the local center manifold Wf. = { (x,y,z): y = 0, z = 0}. 
F(,0)4 F(a) F(,o) 
» » » 
X X x 
a) 
b) a<0 α-θ a>0 


Figure 17.29 


2. Bifurcation of Double Semistable Periodic Orbits 
Let the system (17.67) be given with n > 2, r > 3 and! = 1. Suppose, at € = 0, the system (17.67) has 


periodic orbit y with multipliers pı = +1, |p;| # 1 (i —2,3,...,n — 1) and p, = 1. According to the 
center manifold theorem for mappings, the bifurcations of the Poincaré mapping (17.78) are described 
OF 


F 
by the one-dimensional reduced mapping (17.80) with A* — 1. If δι (0,0) Æ 0 and ad (0,0) # 0 15 


τὰ 
supposed, then it leads to the normal forms 


2 
rey Εἴα, α) —-a- zz? (i 3 F (0,0) > 0) or (17.81a) 
r 
2 
r—arr-zi? [ο s (0. 0) « ) : (17.81b) 
v? 


The iterations from (17.81a) close to 0 and the corresponding phase portraits are represented in 
Fig. 17.29a and in Fig. 17.29b for different a (see [17.6]). Close to x — 0 there are fora «θα 
stable and an unstable equilibrium point, which fuse for a = 0 into the unstable steady state x = 0. 
For a > 0 there exists no equilibrium point close to x = 0. The bifurcation described by (17.81a) in 
(17.80) is called a subcritical saddle node bifurcation for mappings. 

In the case of the differential equation (17.67), the properties of the mapping (17.81a) describe the 
bifurcation of a double semistable periodic orbit: For a < 0 there exists a stable periodic orbit γι and an 
unstable periodic orbit γο, which fuse for œ = 0 into a semistable orbit y, which disappears for a > 0 
(Fig. 17.30a,b). 

3. Period Doubling or Flip Bifurcation 

Let system (17.67) be given with n > 2, r > 4and/ = 1. Considered is a periodic orbit y of (17.67) at 
εΞ 0 with the multipliers pı = —1, |p;| A 1 (i —2,...,n — 1), and p, = 1. The bifurcation behavior 
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a) 471 b) 9 


Figure 17.30 


of the Poincaré mapping in the neighborhood of 0 is described by the one-dimensional mapping (17.80) 
with A* = —1, if supposing the normal form 

re F(z,o) = (--1 -- α)α + αὖ. (17.82) 
The steady state x = 0 of (17.82) is stable for small a > 0 and unstable for a < 0. The second iterated 


mapping F? has for a < 0 two further stable fixed points besides x = 0 for σι» = +V—a + ο(|α]), 
which are not fixed points of F’. Consequently, they must be points of period 2 of (17.82). 

In general, for a C^-mapping (17.80) there is a two-periodic orbit at £ = 0, if the following conditions 
are fulfilled (see [17.2]): 


F OF? 
F(0,0) = 0, 2 (0, 0) 1; - (0,0) =0, 
Ox Oe 
0? F? op 03 F? (17.83) 
ὅσδε (0, 0) # 0, Ox? (0,0) =0, Ox? (0:0) #0 
: 2 OF τος . . 
Since à (0,0) = +1 holds (because of δε» 0) = —1), the conditions for a pitchfork bifurcation are 
a x 


formulated for the mapping F?. 
For the differential equation (17.67) the properties of the mapping (17.82) imply that at a = 0 a stable 
periodic orbit γα splits from y with approximately a double period (period doubling), where y loses its 
stability (Fig. 17.30c). 
Β The logistic mapping ga: [0,1] > [0,1] is given for 0 < a € 4 by ga(x) = ax(1 — x), i.e., by the 
discrete dynamical system 

Tiy = απι(1 -- σι). (17.84) 
The mapping has the following bifurcation behavior (see [17.10]): For 0 < a € 1 system (17.84) has 
the equilibrium point 0 with domain of attraction [0,1]. For 1 « a < 3, (17.84) has the unstable 
equilibrium point 0 and the stable equilibrium point 1 — 1/a , where this last one has the domain of 
attraction (0, 1). For a, = 3 the equilibrium point 1 — 1/α is unstable and leads to a stable two-periodic 
orbit. 
At the value αν = 1 + V6, the two-periodic orbit is also unstable and leads to a stable 2?-periodic or- 
bit. The period doubling continues, and stable 2?-periodic orbits arise at a = ag. Numerical evidence 
shows that ag > os © 3.570... as q > +00. 
For a = o, there is an attractor F (the Feigenbaum attractor), which has a structure similar to the 
Cantor set. There are points arbitrarily close to the attractor which are not iterated towards the at- 
tractor, but towards an unstable periodic orbit. The attractor F has dense orbits and the Hausdorff 
dimension is dg (F) ~ 0.538.... On the other hand, the dependence on initial conditions is not sen- 
sitive. In the domain a, < a < 4, there exists a parameter set A with positive Lebesgue measure 
such that system (17.84) has a chaotic attractor of positive Lebesgue measure for a € A. The set A is 
interspersed with windows in which period doublings occur. 


The bifurcation behavior of the logistic mapping can also be found in a class of unimodal mappings, 
i.e., of mappings of the interval J into itself, which has a single maximum in J. Although the param- 
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eter values αι, for which period doubling occurs, are different from each other for different unimodal 
mappings, the rate of convergence by which these parameters tend to digg, is equal: ἂχ — o, & CO-*, 
where ô = 4.6692... is the Feigenbaum constant (C depends on the concrete mapping). The Hausdorff 
dimension is the same for all attractors F at à = a: dy(F) ~ 0.538... . 


4. Creation of a Torus 
Consider system (17.67) with n > 3,r > 6and/ = 1. Suppose that for all e close to 0 system (17.67) has 
2 

a periodic orbit γε. Let the multipliers of yo be p1,2 = οὖν with V g {ο. 7 F τ). pj (j 5 3,...,n-1) 

with |p;| A 1 and p, = 1. 

According to the center manifold theorem, in this case there exists a two-dimensional reduced C®- 

mapping 
zı Ε(α,ε) (17.85) 

with ΡΟ, ε) = 0 for e close to 0. 

If the Jacobian matrix D,F(0,2) has the conjugate complex eigenvalues p(£) and p(e) with |p(0)| = 1 


d 
for all ar 0, if d :— — |p(& 
or all ε near 0, if a de lp(e) 


i-o > 0 holds and p(0) is not a q-th root of 1 for q = 1,2,3,4, 


then (17.85) can be transformed by a smooth ε dependent coordinate transformation into the form 


zc Ε(α,ε) = F(x, €) + O(|x||7) (O Landau symbol), where F, is given in polar coordinates by 


r ΠΟΙΟΣ 
: 17. 
(7) . (, + w(e) + b(e)r? Ὁ 
Here a, w and bare differentiable functions. Suppose α(0) < 0 holds. Then, the equilibrium point r = 0 
of (17.86) is asymptotically stable for all € < 0 and unstable for € > 0. Furthermore, for € > 0 there 


de 
exists the circle r = ,/— πο , which is invariant under the mapping (17.86) and asymptotically stable 
\ a 


ο. Qo 


a) εΞ0 £»0 b) 


(Fig. 17.31a). 


Figure 17.31 


The Neimark-Sacker Theorem (see [17.10], [17.1]) states that the bifurcation behavior of (17.86) 
is similar to that of F (supercritical Hopf bifurcation for mappings). 
W In mapping (17.85), given by 


x 1 f/(+e)rt+yt2? -- 2y? 
y γοί-α (1 γεν ἠ- α2 — a3)’ 


there is a supercritical Hopf bifurcation at € = 0. 
With respect to the differential equation (17.67), the existence of a closed invariant curve of mapping 
(17.85) means that the periodic orbit yo is unstable for a(0) < 0 and for £ > 0 a torus arises which is 
invariant with respect to (17.67) (Fig. 17.31b). 
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17.3.1.3 Global Bifurcation 

Besides the periodic creation orbit which arises if a separatrix loop disappears, (17.67) can have further 
global bifurcations. Two of them are shown in [17.12] by examples. 

1. Emergence of a Periodic Orbit due to the Disappearance of a Saddle-Node 

W The parameter-dependent system 

i-—zcz(l—a?—3?) ἠ- y(14- 2 - a), j z(l4 z-40)-cy(1-z*-y) 
has in polar coordinates x = r cos V, y = r sin Ὁ the following form: 

t =r(1— r?°), Ý = -(1 +a + rcos?). (17.87) 
Obviously, the circle r = 1 is invariant under (17.87) for an arbitrary parameter œ, and all orbits (except 
the equilibrium point (0, 0)) tend to this circle for t + --oo. For a < 0 there are a saddle and a stable 
node on the circle, which fuse into a compound saddle-node type equilibrium point at œ = 0. For a > 0, 
there is no equilibrium point on the circle, which is a periodic orbit (Fig. 17.32). 


X2 €) €) 


Figure 17.32 


2. Disappearance of a Saddle-Saddle Separatrix in the Plane 
W Consider the parameter-dependent planar differential equation 

$-—oac-2zy, y=1lt+2’?-y’. (17.88) 
For a = 0, equation (17.88) has the two saddles (0, 1) and (0, —1) and the y-axis as invariant sets. The 
heteroclinic orbit is part of this invariant set. For small |a| 4 0, the saddle-points are retained while 
the heteroclinic orbit disappears (Fig. 17.33). 


iid 


a=0 α»0 


Figure 17.33 


17.3.2 Transitions to Chaos 

Often a strange attractor does not arise suddenly but as the result of a sequence of bifurcations, from 
which the typical ones are represented in Section 17.3.1. The most important ways to create strange 
attractors or strange invariant sets are described in the following. 

17.3.2.1 Cascade of Period Doublings 

Analogously to the logistic equation (17.84), a cascade of period doublings can also occur in time- 
continuous systems in the following way. Suppose system (17.67) has the stable periodic orbit y for 
€ < €. For e = ει a period doubling occurs near a, at which the periodic orbit 7{ loses its stability 


for & > ει. A periodic orbit 49 with approximately double period splits from it. At € = £5, there is a 
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new period doubling, where 49 loses its stability and a stable orbit γ9 with an approximately double 
period arises. For important classes of systems (17.67) this procedure of period doubling continues, so 
a sequence of parameter values (2;) arises. 

Numerical calculations for certain differential equations (17.67), e.g., for hydrodynamical differential 
equations such as the Lorenz system, show the existence of the limit 


i Ej+1 — Ej " 
lim ----------- ὃ, whereó (17.89) 
7719 Ezy ει 
is again the Feigenbaum constant. 
For e, = lim ej, the cycle with infinite period loses its stability, and a strange attractor appears. 
29ο 


The geometric background for this strange attractor in (17.67) by a cascade of period doubling is shown 
in Fig. 17.34a. The Poincaré section shows approximately a baker mapping, which suggests that a 
Cantor set-like structure is created. 


F(a) 


A Y. 


b) 


a 


Figure 17.34 


17.3.2.2 Intermittency 

Consider a stable periodic orbit of (17.67), which loses its stability for € = 0 when exactly one of 
the multipliers, for € < 0 inside the unit circle takes the value +1. According to the center manifold 
theorem, the corresponding saddle-node bifurcation of the Poincaré mapping can be described by a 


one-dimensional mapping in the normal form x > F (x,a) =a+x+4+a7+4+---. Here a is a parameter 
depending on g, i.e., a = α(ε) with a(0) = 0. The graph of F(-,a) for positive a is represented in 
Fig. 17.34b. 
As can be seen in Fig. 17.34b, the iterates of F (-,@) stay for a relatively long time in the tunnel zone 
for aZ, 0. For equation (17.67), this means that the corresponding orbits stay relatively long in the 
neighborhood of the original periodic orbit. During this time, the behavior of (17.68) is approximately 
periodic (laminar phase). After getting through the tunnel zone the considered orbit escapes, which 
results in irregular motion (turbulent phase). After a certain time the orbit is recovered and a new 
laminar phase begins. A strange attractor arises in this situation if the periodic orbit vanishes and 
its stability goes over to the chaotic set. The saddle-node bifurcation is only one of the typical local 
bifurcations playing a role in the intermittence scenario. Two further ones are period doubling and the 
creation of a torus. 


17.3.2.3 Global Homoclinic Bifurcations 


1. Smale's Theorem 

Let the invariant manifolds of the Poincaré mapping of the differential equation (17.67) in R? near the 
periodic orbit y be as in Fig. 17.11b, p. 873. The transversal homoclinic points P’ (xo) correspond 
to a homoclinic orbit of (17.67) to y. The existence of such a homoclinic orbit in (17.67) leads to 
a sensitive dependence on initial conditions. In connection with the considered Poincaré mapping, 
horseshoe mappings, introduced by Smale, can be constructed. This leads to the following statements: 


a) In every neighborhood of a transversal homoclinic point of the Poincaré mapping (17.80) there exists 
a periodic point of this mapping (Smale’s theorem). Hence, in every neighborhood of a transversal 
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homoclinic point there exists an invariant set of P" (m € IN), A, which is of Cantor type. The restriction 
of P" to A is topologically conjugate to a Bernoulli shift, i.e., to a mixing system. 

b) The invariant set of the differential equation (17.67) close to the homoclinic orbit is like a product of 
a Cantor set with the unit circle. If this invariant set is attracting, then it represents a strange attractor 
of (17.67). 

2. Shilnikov’s Theorem 
Consider the differential equation (17.67) in R? with scalar parameter e. Suppose that the system 
(17.67) has a saddle-node type hyperbolic steady state 0 at € = 0, which exists so long as |e| remains 
small. Suppose, that the Jacobian matrix D, f (0,0) has the eigenvalue A3 > 0 and a pair of conjugate 
complex eigenvalues λιο = a c iw with a < 0. Suppose, additionally, that (17.67) has a separatrix 
loop Yo for € = 0, i.e., a homoclinic orbit which tends to 0 for t — —oo and t + +00 (Fig. 17.35a). 


E€ 


Then, in a neighborhood of a separatrix loop (17.67) has the following phase portrait : 

a) Let As +a < 0. If the separatrix loop breaks at € # 0 according to the variant denoted by A in 
(Fig. 17.35a), then there is exactly one periodic orbit of (17.67) for e = 0. If the separatrix loop breaks 
at € Æ 0 according to the variant denoted by B in (Fig. 17.35a), then there is no periodic orbit. 

b) Let As +a > 0. Then there exist countably many saddle-type periodic orbits at € = 0 (respectively, 
for small |e|) close to the separatrix loop 4 (respectively, close to the destructed loop γρ). The Poincaré 
mapping with respect to a transversal to the γρ plane generates a countable set of horseshoe mappings 
at € = 0, from which there remain finitely many for small |e| 4 0. 


W'(pj) 


b) Wp) 


Figure 17.35 
3. Melnikov's Method 


Consider the planar differential equation 
t = f(x) 4 eg(t, x), (17.90) 
where ε is a small parameter. For ε = 0, let (17.90) be a Hamiltonian system (see 17.1.4.3, 2., p. 875), 


i.e., for f = (fi, fo) fi ae and fy = -- hold, where H: U C R? — R is supposed to be a C?- 
T2 Tı 


function. Suppose the time-dependent vector field g: R x U — R? is twice continuously differentiable, 
and T-periodic with respect to the first argument. Furthermore, let f and g be bounded on bounded 
sets. Suppose that for € = 0 there exists a homoclinic orbit with respect to the saddle point 0, and 
the Poincaré section X; of (17. 90) in the phase space { (£1, x5, t)) for t = to looks as in Fig. 17.35b. 
The Poincaré mapping P-4,: Xr — Σο. for small |e], has a saddle point p. close to x = 0 with the 
invariant manifolds W*(pz) and W " (pz). If the homoclinic orbit of the unperturbed system is given by 
p(t — to), then the distance between the manifold W*(p.) and W“(p-), measured along the line passing 
through y(0) and perpendicular to f(y(0)), can be calculated by the formula 


M (to) 
I FO) I 


Here, M(-) is the Melnikov function which is defined by 


d(to) = 


| 


+ O(e?). (17.91a) 


M(to) = | tte (t — to)) ^ g(t, e(t — to)) dt. (17.915) 


—oo 
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(For a = (a1,a2) and b = (bi,b2), ^ means a ^ b = ayb2 — aby.) If the Melnikov function M has a 
simple root at to, i.e., M (to) = 0 and M'(to) ¥ 0 hold, then the manifolds W*(p.) and W“(p-) intersect 
each other transversally for sufficiently small € > 0. If M has no root, then W*(p.) O W"(p.) = 0, i.e., 
there is no homoclinic point. 

Remark: Suppose the unperturbed system (17.90) has a heteroclinic orbit given by y(t — to), running 
from a saddle point 0; in a saddle 0». Let pl and p? be the saddle points of the Poincaré mapping P- to 
for small |e|. If M, calculated as above, has a simple root at to, then W*(p!) and W” (p) intersect each 
other transversally for small e > 0. 

W Consider the periodically perturbed pendulum equation 2 + sin x = e sin wt, i.e., the system 

t = y, Ὁ = —sinz + esinwt, in which € is a small parameter and w is a further parameter. The 


ME 
unperturbed system i = y, y = — sin x is a Hamiltonian system with H (x,y) = ου” — cosx. It has 


(among others) a pair of heteroclinic orbits through (—7, 0) and (7,0) (in the cylindrical phase space 


1 
S! x R these are homoclinic orbits) given by y*(t) = (22 arctan(sinh t), +2—— ) (t € R). The 
sh 


— 2n sinwto 
' cosh(zu/2) 
at to = 0, the Poincaré mapping of the perturbed system has transversal homoclinic points for small 
€ » 0. 


17.3.2.4 Destruction of a Torus 


1. From Torus to Chaos 

1. Hopf-Landau Model of Turbulence The problem of transition from regular (laminar) behav- 
ior to irregular (turbulent) behavior is especially interesting for systems with distributed parameters, 
which are described, e.g., by partial differential equations. From this viewpoint, chaos can be inter- 
preted as behavior irregular in time but ordered in space. 


direct calculation of the Melnikov function yields M (tọ) = 3 . Since M has a simple root 


On the other hand, turbulence is the behavior of the system, that is irregular in time and in space. The 
Hopf-Landau model explains the arising of turbulence by an infinite cascade of Hopf bifurcations: For 
€ = ει a steady state bifurcates in a limit cycle, which becomes unstable for £2 > ει and leads to a 
torus T?. At the k-th bifurcation of this type a k-dimensional torus arises, generated by non-closed 
orbits which wind on it. The Hopf-Landau model does not lead in general to an attractor which is 
characterized by sensitive dependence on the initial conditions and mixing. 
2. Ruelle-Takens-Newhouse Scenario Suppose that in system (17.67) n > 4 and | = 1 hold. 
Suppose also that changing the parameter ε, the bifurcation sequence “equilibrium point — periodic 
orbit — torus T? — torus T?" is achieved by three consecutive Hopf bifurcations. 
Let the quasiperiodic flow on T? be structurally unstable. Then, certain small perturbation of (17.67) 
can lead to the destruction of T? and to the creation of a structurally stable strange attractor. 
3. Theorem of Afraimovich and Shilnikov on the Loss of Smoothness and the 
Destruction of the Torus 7? Let the sufficiently smooth system (17.67) be given with n > 3 
and | = 2. Suppose that for the parameter value ερ, the system (17.67) has an attracting smooth torus 
T? (£o) spanned by a stable periodic orbit ys, a saddle-type periodic orbit γι and its unstable manifold 
W"" (y) (resonance torus). 
The invariant manifolds of the equilibrium points of the Poincaré mapping computed with respect to 
a surface transversal to the torus in the longitudinal direction, are represented in Fig. 17.36a. The 
multiplier p of the orbit γε, which is the nearest to the unit circle, is assumed to be real and simple. 
Furthermore, let e(-) : [0,1] — V be an arbitrary continuous curve in parameter space, for which 
ε(0) = ερ and for which system (17.67) has no invariant resonance torus for € = &(1). Then the 
following statements are true: 
a) There exists a value 5, € (0, 1) for which T?(e(s.)) loses its smoothness. Here, either the multiplier 
p(s.) is complex or the unstable manifold W” (y4) loses its smoothness close to ys. 
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b) There exists a further parameter value δε, € (5, 1) such that system (17.67) has no resonance torus 
for s € (Sx, 1]. The torus is destroyed in the following way: 

a) The periodic orbit ys loses its stability for € = ε(5..). A local bifurcation arises as period doubling 
or the formation of a torus. 

B) The periodic orbits γι and γε coincide for € = £(s,.) (saddle-node bifurcation) and so they vanish. 
^j) The stable and unstable manifolds of γα intersect each other non-transversally for € = ε(5..) (see 
the bifurcation diagram in Fig. 17.36c). The points of the beak-shaped curve Sı correspond to the 
fused ys and γι (saddle-node bifurcation). The tip C1 of the beak-shaped curve is on a curve Sp, which 
corresponds to a splitting of the torus. 
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Figure 17.36 


The parameter points where the smoothness is lost, are on the curve 55, while the points on $4 charac- 
terize the dissolving of a T? torus. The parameter points for which the stable and unstable manifolds of 
Yu intersect each other non-transversally, are on the curve $4. Let Pj be an arbitrary point in the beaked 
shaped tip of the beak such that for this parameter value a resonance torus 7? arises. The transition 
from P) to P, corresponds to the case a) of the theorem. If the multiplier p becomes —1 on $5, then 
there is a period doubling. A cascade of further period doublings can lead to a strange attractor. Ifa 
pair of complex conjugate multipliers p.» arises on the unit circle passing through S5, then it can result 
in the splitting of a further torus, for which the Afraimovich-Shilnikov theorem can be used again. 
The transition from Pp to P» represents the case 3) of the theorem: The torus loses its smoothness, and 
on passing through on 5;, there is a saddle-node bifurcation. The torus is destroyed, and a transition 
to chaos through intermittence can happen. The transition from P» to P5, finally, corresponds to the 
case y): After the loss of smoothness, a non-robust homoclinic curve forms on passing through on S4. 
The stable cycle ys remains and a hyperbolic set arises which is not attracting for the present. If +, 
vanishes, then a strange attractor arises from this set. 
2. Mappings on the Unit Circle and Rotation Number 
1. Equivalent and Lifted Mappings The properties of the invariant curves of the Poincaré map- 
ping play an important role in the loss of smoothness and destruction of a torus. If the Poincaré mapping 
is represented in polar coordinates, then, under certain assumptions, one gets decoupled mappings of 
the angular variables as informative auxiliary mappings on the unit circle. These are invertible in the 
case of smooth invariant curves (Fig. 17.36a) and in the case of non-smooth curves (Fig. 17.36b) 
they are not invertible. A mapping F: R > R with F(O + 1) = Ρ(Θ) + 1forall O € R, which 
generates the dynamical system 

Oni = F(0,), (17.92) 
is called equivariant. For every such mapping, an associated mapping of the unit circle f: 6} — St 
can be assigned where St = R \ Z = {@ mod 1,0 € R}. Here f(x) := F(O) if the relation x = [O] 
holds for the equivalence class [O]. F is called a lifted mapping of f. Obviously, this construction is not 
unique. In contrast to (17.92) 


Tua = f(x) (17.93) 
is a dynamical system on ο). 


E Fortwo parameters w and K let the mapping F (τω, K) be defined by F (σιω, K) = o+w-— K sino 
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for all τ € R. The corresponding dynamical system 
On41 = On +w — K sino; (17.94) 
can be transformed by the transformation ση = 27O, into the system 


K 
Ony = On +Q- = sin 270, (17.95) 
π 
where Q = πη With Ρ(Θ;Ώ, K) = O4-0— 5, sin 270 an equivariant mapping arises, which generates 
π 2π 
the canonical form of the circle mapping. 


2. Rotation Number The orbit γ(Θ) = (F"(O)) of (17.92) is a q-periodic orbit of (17.93) in S! if 
and only if it is a E cycle of (17.92), i.e., if there exists an integer p such that O,,, = On + p, (n € Z) 
q 


holds. The mapping f: S! — S! is called orientation preserving if there exists a corresponding lifted 
mapping F', which is monotone increasing. If F from (17.92) is a monotone increasing homeomorphism, 
n 


then there exists the limit lim for every x € R, and this limit does not depend on x. Hence, 
n| 


oo 
n (oy 


the expression p(F) :— lim can be defined. If f: S1 — S! is a homeomorphism and F and 


|n|>oo 


F are two lifted mappings of f, then p(F) = p(F) + k holds, where k is an integer. Based on this last 
property, the rotation number p( f) of an orientation-preserving homeomorphism f: S! — S! can be 
defined as p( f) = p(F) mod 1, where F is an arbitrary lifted mapping of f. 

If f: S! > S! in (17.93) is an orientation-preserving homeomorphism, then the rotation number has 
the following properties (see [17.4]): 


a) If (17.93) has a q-periodic orbit, then there exists an integer p such that p(f) — P holds. 
q 


b) If p(f) = 0, then (17.93) has an equilibrium point. 
c) If (f) = £ , where p ¥ 0 is an integer and q is a natural number (p and q are co-primes), then 
q 


(17.93) has a q-periodic orbit. 
d) p(f) is irrational if and only if (17.93) has neither a periodic orbit nor an equilibrium point. 
Theorem of Denjoy: If f: St — S! is an orientation-preserving C?-diffeomorphism and the rotation 
number a = p( f) is irrational, then f is topologically conjugate to a pure rotation whose lifted mapping 
is F(r) 2 r4 a. 
3. Differential Equations on the Torus 7? 
Let 

€; = fi(61,05), €» = f2(61,02) (17.96) 
be a planar differential equation, where fı and f» are differentiable and 1-periodic functions in both 
arguments. In this case (17.96) defines a flow, which can also be interpreted as a flow on the torus 
T? = S! x S! with respect to Θι and Os. If f;(O1,@2) > 0 for all (Θι, 95), then (17.96) has no 
equilibrium points and it is equivalent to the scalar first-order differential equation 

dO; Ἠί(Θι, Θα) 


dO, Γίθι.Θα)᾽ (17.97) 
fo 


With the relations ΘΙ = t, O9 = x and f = ^ 
Jı 


(17.97) can be written as a non-autonomous differential 


equation 


t= f(t, Ὁ) (17.98) 
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whose right-hand side has a period of 1 with respect to t and 2. 
Let φ(-, xo) be the solution of (17.98) with initial state zo at time t = 0. So, a mapping ¢'(-) = e(1,-) 
can be defined for (17.98), which can be considered as the lifted mapping of a mapping f: S! > ΟΊ, 


W Let w, w € R be constants and Θι = wi, Oo = ων a differential equation on the torus, which 


, : ; ; Rue Ὁ w; ω: 
is equivalent to the scalar differential equation 3 = 12 for ωι # 0. Thus, e(t, xo) = Et zo and 
Wy Wy 


ez) = γα, 
Wy 
4. Canonical Form of a Circle Mapping 
1. Canonical Form The mapping F from (17.95) is an orientation-preserving diffeomorphism for 
0 € K « 1, because z = ] — Kcos2rv > 0 holds. For K = 1, F is no-longer a diffeomorphism, 


but it is still a homeomorphism, while for K > 1, the mapping is not invertible, and hence no-longer a 
homeomorphism. In the parameter domain 0 < K < 1, the rotation number p(Q, K) := p(F(-,Q, K)) 
is defined for F(-,Q, K). Let K € (0,1) be fixed. Then p(-, K) has the following properties on [0.1]: 
a) The function p(-, K) is not decreasing, it is continuous, but it is not differentiable. 


- : p " ; TRE 
b) For every rational number = € (0, 1) there exists an interval [,/,, whose interior is not empty and 
q 


for which p(Q, K) = 7 holds for all Ω € Ip/q. 


c) For every irrational number a € (0, 1) there exists exactly one Ω with p(Q, K) — a. 

2. Devil'sStaircase and Arnold Tongues For every K € (0, 1), p(-, A’) is a Cantor function. The 
graph of p(-, K), which is represented in Fig. 17.37b, is called the devil’s staircase. The bifurcation 
diagram of (17.95) is represented in Fig. 17.37a. At every rational number on the Q-axis, a beak- 


shaped region ( Arnold tongue) with a non-empty interior starts, where the rotation number is constant 
and equal to the rational number. 
K 1 1 1 pc, KA 
3 2 3 1 E 
: 08Γ ~ 
0.6} E 
0.4 ye 
> 0.2 rand 
1 1 1 Q á L i L L 
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Figure 17.37 


The reason for the formation of the tongue is a synchronization of the frequencies (frequency locking). 
a) For 0 € K < 1, these regions are not overlapping. At every irrational number of the Q-axis, a 
continuous curve starts which always reaches the line K = 1. In the first Arnold tongue with p = 0, 
the dynamical system (17.95) has equilibrium points. If K is fixed and Q increases, then two of these 
equilibrium points fuse on the boundary of the first Arnold tongue and vanish at the same time. Asa 
result of such a saddle-node bifurcation, a dense orbit arises on St. Similar phenomena can be observed 
when leaving other Arnold tongues. 

b) For K > 1 the theory of the rotation numbers is no-longer applicable. The dynamics become 
more complicated, and the transition to chaos takes place. Here, similarly to the case of Feigenbaum 
constants, further constants arise, which are equal for certain classes of mappings to which also the 
standard circle mapping belongs. One of them is described below. 
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Ψ5--1 
2 


3. Golden Mean, Fibonacci Numbers The irrational number is called the golden mean 


; ; . Ρ , 5—1 1 
and it has a simple continued fraction representation = po = [1;1,1,...] (see 1.1.1.4, 
2 Lpi 
tu 
3., p. 4). By successive evaluation of the continued fraction a sequence {1} of rational numbers is got, 


ν5-1 Εν 
2 


. The numbers r,, can be represented in the form rn = , where £j, are 
n+l 

Fibonacci numbers (see 5.4.1.5, p. 375), which are determined by the iteration Fi; = Fa + Fh- (n = 

1,2,---) with initial values Fp = 0 and F; = 1. Now, let Qa be the parameter value of (17.95), for which 


"m 
2 


which converges to 


p(Q5,,1) = and let Ωμ be the closest value to Qc, for which p(Qn, 1) = rn holds. Numerical 


Qn = Ona 


= —2.8336.... 
nH T Qr 


calculation gives the limit lim 
n 


18 Optimization 


18.1 Linear Programming 


18.1.1 Formulation of the Problem and Geometrical 
Representation 


18.1.1.1 The Form of a Linear Programming Problem 
1. TheSubject 


of linear programming is the minimization or maximization of a linear objective function (OF) of finitely 
many variables subject to a finite number of constraints (CT), which are given as linear equations or 
inequalities. 

Many practical problems can be directly formulated as a linear programming problem, or they can be 
modeled approximately by a linear programming problem. 

2. General Form 


A linear programming problem has the following general form: 
OF: f(x) = εἰσι He t em + οργιά γι tod ec, = max! (18.1a) 
CT: 141 κ A "ο. F Ong SR nfa < bi 
Gs 101 Md Asrty T Ag rt 1 Ον RER As nTn < bs 
Asti 101 Ttt Asti rr T Astirtitrq1 Te T Asti nEn = bs+1, (18 1b) 
Am 11 Ue Am, rr T Amrt1r+1 ΣΝ anntn = bm, 
θη. 2 NEM Lr41,---;Zn free. 


In a more compact vector notation this problem becomes: 


1T. 1 


9T 5 D 
OF: f(x)=c x! +c’? x’ = max! (18.2a) CT: Aux! + Appx? < bt, 
: 2 
AnXxb + Αρκ’ = b, (18.2b) 
. . x! 20, x? free 
Here, the following notations are used: 
σι Cr+ Ti Tr41 
C2 i Cr+2 πο 7 Tr42 
c= eU xta. ρα πο g (18.269) 
Be Cn By Ty 
Q11 Q12 "7^ Air Ω1,» 11 1,742 777 Ain 
291 22 '** Gar Q2r41 G2r42 *** Gn 
Αιξ. ; Ap=]. : (18.2d) 
Qs, 1 Qs2 "ΙΙ Agr Asrt1 Asr42 °°" Asn 
@s41,1 Qs+1,2 °°" Astiyr Qs+1,r+1 Qs+1,r+2 777 Astin 
153,1 05422 ''* Qs+2,r 152751 As+2,r42 η 
An =|. An =|]. (18.90) 
Am,1 Gm,2 tt Amr Am,r+1 Am, r+2 ''' Amn 


3. Constraints 

with the inequality sign “ >” will have the above form if they are multiplied by (—1). 
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4. Minimum Problem 
Α minimum problem f(x) — min! becomes an equivalent maximum problem by multiplying the ob- 
jective function by (—1) 

—f(x) = max! (18.3) 


5. Integer Programming 
Sometimes certain variables are restricted to be only integers. This discrete problem is not discussed 
here. 

6. Formulation with only Non-Negative Variables and Slack Variables 

Inapplying certain solution methods, only non-negative variables are considered, and constraints (18.1b), 
(18.2b) given in equality form. 


OF: f(x) = cri t: cux, = max! (184a 
Every free variable 7, must be decomposed into the C b 
difference of two non-negative variables z4 = x} — T: aati bb ms = bi, 
xz. The inequalities become equalities by adding : : : 
non-negative variables; they are called slack vari- Qmit1 b gnis = bm, (18.4b 
ables. That is, the problem is considered in the 
form as given in (18.4a,b), where n is the increased 11 2 0,...,24, 2 0. 
number of variables. In vector form: 
OF: f(x) = εἶχ = max! (18.5a) CT: Ax=b, x70. (18.5b 
The relation m < n can be supposed, otherwise the system of equations contains linearly dependen 


or contradictory equations. 

7. Feasible Set 

The set of all vectors x satisfying constraints (18.2b) is called the feasible set of the original problem. 
If the free variables are rewritten as above, and every inequality of the form “<” into an equation as in 
(18.4a) and (18.4b), then the set of all non-negative vectors x 2 0 satisfying the constraints is called 
the feasible set M: 


M={xe R":x>0, Ax— bJ. (18.6a) 
A point x* € M with the property 
f(x") > f(x) for every x€ M (18.6b) 


is called the maximum point or the solution point of the linear programming problem. Obviously, the 
components of x not belonging to slack variables form the solution of the original problem. 
18.1.1.2 Examples and Graphical Solutions 


1. Example of the Production of Two Products 
Suppose primary materials R,, R2, and R3 are needed to produce two products E; and Ey. Scheme 


18.1 shows how many units of primary materials are needed to produce each unit of the products Ej 
and E», and there are given also the available amount of the primary materials. 

Selling one unit of the products E; or E» results 

in 20 or 60 units of profit, respectively (PR). Scheme 18.1 | Rı / E; | Πο /E; | Πα / Ei 
Determine a production program which yields E 12 8 0 
maximum profit, if at least 10 units must be pro- Í 

duced from product ΕΙ. E 6 12 10 
Denoting by xı and x2 the number of units pro- Amount 630 620 350 


duced from E, and E», the problem is: 


OF: f(x) = 20a, + 601» = max! 
CT: 12%, + 6x2 < 630, 
82, + 121» < 620, 

102» < 350, 

zı > 10. 
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Introducing the slack variables 23, v4, £5, 26, one gets: 


OF: f(x) = 20x, + 60% + 0-23 + 0-24 + 0-45 + 0-26 = max! 
CT: 12%,+ 6%2.+ 23 Ξ 630, 
82, + 121» + αι = 620, 

10x» + ο Ξ 7350), 

—n + ^ rg = —10. 


2. Properties of a Linear Programming Problem 

On the basis of this example, some properties of the linear pro- ^m 
gramming problem can be demonstrated by graphical representa- 
tion. Here the slack variables are not considered; only the original 35 
two variables are used. 

a) Alineaz--aa2x9 = b divides the 2, xə plane into two half-planes. 25 
The points (σι, £2) satisfying the inequality a124--a2r3 < b arein one M 

ofthese half-planes. The graphical representation of this set of points t t 

in a Cartesian coordinate system can be made by a line, and the half- > 
plane containing the solutions of the inequalities is denoted by an ar- OL! 40 χι 
row. The set of feasible solutions M, i.e., the set of points satisfying 
all inequalities is the intersection of these half-planes (Fig. 18.1). Figure 18.1 

In this example the points of M form a polygonal domain. It may happen that M is unbounded or 
empty. If more then two boundary lines go through a vertex of the polygon, this vertex is called a 
degenerate vertex (Fig. 18.2). 


0| τ bl: t 0 χι 


Figure 18.2 


b) Every point in the x1, 20 plane satisfying the equality f(x) = 20x, + 601» = co is on one line, on 
the level line associated to the value cg. With different choices of co, a family of parallel lines is defined, 
on each of which the value of the objective function is constant. Geometrically, those points are the 
solutions of the programming problem, which belong to the feasible set M and also to the level line 
2021 + 6022 = co with maximal value of co. In this example, the solution point is (x1, x2) = (25,35) 
on the line 2011 + 6023 = 2600. The level lines are represented in Fig. 18.3, where the arrows point 
in the direction of increasing values of the objective function. 

Obviously, if the feasible set M is bounded, then there is at least one vertex such that the objective 
function takes its maximum. If the feasible set M is unbounded, it is possible that the objective function 
is unbounded, as well. 


18.1.2 Basic Notions of Linear Programming, Normal Form 
Now, the problem (18.5a,b) is considered with the feasible set M . 
18.1.2.1 Extreme Points and Basis 


1. Definition of the Extreme Point 
A point x € M is called an extreme point or vertex of ΜΙ, if for all xj, x; € M with x, Z x»: 
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xzAxit(l—-A)x, 0«A«1, (18.7) 
i.e., X is not on any line segment connecting two different points of M. 
2. Theorem about Extreme Points 
'The point x € M is an extreme point of M if the columns of matrix A associated to the positive 
components of x are linearly independent. 
If the rank of A is rn, then the maximal number of independent columns in A is m. So, an extreme 
point can have at most m positive components. The other components, at least n — m, are equal to 
zero. In the usual case, there are exactly rn positive components. If the number of positive components 
is less then m, it is called a degenerate extreme point. 


P 47 (42,0) 


Figure 18.3 Figure 18.4 
3. Basis 


To every extreme point m linearly independent column vectors of the matrix A can be assigned, the 
columns belonging to the positive components. This system of linearly independent column vectors is 
called the basis of the extreme point. Usually, exactly one basis belongs to every extreme point. However 
: . TL nog 
several bases can be assigned to a degenerate extreme point. There are at most (5) possibilities to 


choose m linearly independent vectors from n columns of A. Consequently, the number of different 


bases, and therefore the number of different extreme points is ( K ) . If M is not empty, then M has at 
least one extreme point. CT: tı + aot 252 1, 
x: <2, 
. DIS AE = max! T2 S 4, 

W OF: f(x) = 2a, + 3x2 + 4r; = max! ci xs ed (18.8) 


221 — 329 + 213 < 2. 


The feasible set M determined by the constraints is represented in Fig. 18.4. Introduction of slack 
variables r4, £5, xg, 7 leads to: 


CT: πι t+ σα — da SN 
20 + Ts τοι 

-ᾱι + 2x3 + T6 = 2, 

221 — 3x9 + 278 + a7 = 2. 


The extreme point P) = (0,1, 0) of the polyhedron corresponds to the point x = (£1, £2, 13, x4, T5, to, 
$7) = (0, 1,0,0, 1, 2, 5) of the extended system. The columns 2, 5, 6 and 7 of A form the corresponding 
basis. The degenerated extreme point P corresponds to (1, 0, 0, 0, 2, 3, 0). A basis of this extreme point 
contains the columns 1,5, 6 and one of the columns 2, 4 or 7. 
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Remark: Here, the first inequality was a “>” inequality and a4 was not added but subtracted. Fre- 
quently these types of additional variables both with a negative sign and a corresponding b; > 0 are 
called surplus variables, rather than slack variables. As will be seen in 18.1.3.3, p. 916, the occurrence 
of surplus variables requires additional effort in the solution procedure. 


4. Extreme Point with a Maximal Value of the Objective Function 

Theorem: If M is not empty, and the objective function f(x) = c"x is bounded from above on M, 
then there is at least one extreme point of M where it has its maximum. 
A linear programming problem can be solved by determining at least one of the extreme points with 
maximum value of the objective function. Usually, the number of extreme points of M is very large 
in practical problems, so a method is needed by which the solution can be found in a reasonable time. 
Such a method is the simplex method which is also called the simplex algorithm or simplex procedure. 


18.1.2.2 Normal Form of the Linear Programming Problem 


1. Normal Form and Basic Solution 
The linear programming problem (18.4a,b) can always be transformed to the following form with a 
suitable renumbering of the variables: 


OF:  f(x)-— oem c: somos co = max! (18.9a) 
CT: A410, te + G1 n—mEn—m + Un—m+1 = by ] 
EM denas 18.9b 
Om, 131 ERU aT Üm.n—mUn—m Tox = bm ; ( ) 
T1,- , n-m, Tn—m+1;- - -Tn = 0. 


The last m columns of the coefficient matrix are obviously independent, and they form a basis. The 
basic solution (σι, %2,.--,®n—m;€n—m41;+++;%n) = (0,...,0,b1,...,bm) can be determined directly 
from the system of equations, but if b > 0 does not hold, it is not a feasible solution. 

If b > O, then (18.9a,b) is called a normal form or canonical form of the linear programming problem. 
In this case, the basic solution is a feasible solution, as well, i.e., x > 0, and it is an extreme point of M. 
The variables x1, ..., &n-m are called non-basic variables and x, 4,44... . , &n are called basic variables. 
The objective function has the value co at this extreme point, since the non-basic variables are equal 
to zero. 

2. Determination of the Normal Form 

If an extreme point of M is known, then a normal form of the linear programming problem (18.5a,b) 
can be obtained in the following way. A basis is chosen from the columns of A corresponding to the 
extreme point. Usually, these columns are determined by the positive components of the extreme point. 
Suppose the basic variables are collected into the vector xg and the non-basic variables are in xy. The 
columns associated to the basis form the basis matrix Ap, the other columns form the matrix Ay. 
Then, 


Ax = Ayxy + Apxp = b. (18.10) 


The matrix Ag is non-singular and it has an inverse A, the so-called basis inverse. Multiplying 


(18.10) by Ag! and changing the objective function according to the non-basic variables results in the 
canonical form of the linear programming problem: 


OF: f(x) =cyxn +00, (18.1 1a) 
CT: Aglxxy-- xg = Agb with xy >0, xg 70. (18.11b) 


Remark: If the original system (18.1b) has only constraints of type “<” and simultaneously b > 0, 
then the extended system (18.4b) contains no surplus variables (sce 18.1.2.1, p. 911). In this case a 
normal form is immediately known. Selecting all slack variables as basic variables xg the result is 
Ap = Land xg = b and xy = 0 is a feasible extreme point. 
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W In the above example x = (0, 1,0,0, 1,2, 5) is an extreme point. Consequently: 


1000 11 -1 
1100 ELIT 00 0 
Ap- 00 1 07? Aa = ' Ayv=|{-l 2 0J? (18.12a) 
-3 0 0 1 Tc. 22 0 
"ND 3001 keeps Jn 
10 X5 Te T7 πι X3 X4 
1 1-1 
-1-1 1 i 
AgAw-|-12 Of, Az b=| 9 (18.12b) 
5 5 —3 E 
5 
TQ X3 T4 
πι + T2 + X3— T4 = 
Ti 23 Z4 + T5 Ld 
—24 + 223 + ας = 2; (18.13) 
5x1 + ὅσα — 324 Tor; = 5. 
From f(x) = 211 + 313 + 43 the transformed objective function 


f(x) = —z1 + z3 + 374 +3 (18.14) 
is obtained, if the triple of the first constraint is subtracted. 


18.1.3 Simplex Method 
18.1.3.1 Simplex Tableau 


The simplex method is used to produce a sequence of extreme points of the feasible set with increasing 
values of the objective function. The transition to the new extreme point is performed starting from the 
normal form corresponding to the given extreme point, and arriving at the normal form corresponding 
to the new extreme point. In order to get a clear arrangement, and easier numerical performance, the 
normal form (18.9a,b) is represented in the simplex tableau (Scheme 18.2a, 18.2b): 


Scheme 18.2a Scheme 18.2b 
Sp ot nem or briefly XN 
Tn-m+1| 41,1 τπτ Qln-m by XB An b 
: : : € |-0 
Tn Am,1 *** Gmn—m bm 
Ci +7" Cn-m | 60 


The k-th row of the tableau corresponds to the constraint 


Ln—mtk + GkaXi + +++ + kno mUn-m = bp. (18.15a) 
The objective function is 
C11 +*+ + Cn-mZn-m = f(x) — co. (18.15b) 


From this simplex tableau, the extreme point (xy, X5) — (0, b) can befound. The value ofthe objective 
function at this point is f(x) = co. To put down —c into the right below vertex of the tableau is 
advantageous for carrying out the simplex method. In every tableau always exactly one of the following 
three cases can be found: 

a)c; < 0,7 — L...,n — m: The tableau is optimal. The point (xy, X5) = (0, b) is a maximal point. 
If all the c; are positive, then this vertex is the only maximal point. 

b) There exists at least one j such that c; > 0 and αμ € 0, i = 1,...,m: The linear programming 
problem has no solution, since the objective function is not bounded on the feasible set; for increasing 
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values of x; it increases without a bound. 

c) For every j with c; > 0 there exists at least one i with a;; > 0: It is possible to move from the 
extreme point x to a neighboring extreme point x with f(x) > f(x). In the case of a non-degenerate 
extreme point x, the “>” sign always holds. 


18.1.3.2 Transition to the New Simplex Tableau 


1. Non-Degenerate Case 
If a tableau is not in final form (case c)), then a new tableau (Scheme 18.3) is determined. A basic 
variable x, and a non-basic variable x, are interchanged by the following calculations: 


1 
a) ùp = —. (18.164) 
apq 
b) ἄρ; = apj ` Apes j£ by = b, + ἄρα. (18.16b) 
ο) Gig = —ig ἄμα, i p, & = —Cq + ἅμα. (18.16c) 
d) ἄν = αν + Ap; * Gig, i Æp, JAG 
bi = bi + bp Gig, dp Gj = cj + apj Gy, j Æq, —Čo = —co + bp ` 6, (18.164) 


The element aj, is called the pivot element , the p-th row is the pivot row, and the q-th column is the 
pivot column . For the choice of a pivot element the following two requirements are to be considered: 
a) čo > co should hold; 

b) the new tableau must also correspond to a feasible solution, i.e., b > 0 must hold. 

Then, (Xy, Xg) = (0, b) is a new extreme point at which the value of the objective function f(x) = Co 
is not smaller than it was previously. These conditions are satisfied if the pivot element is chosen in the 
following way: 

a) To increase the value of the objective function, a column with c, > 0 can be chosen for a pivot 
column; 


b) to get a feasible solution, the pivot row must be chosen as 


b : bi 

P= min (=) ; (18.17) 
Ώρα 1<i<m | Gig 

αια»0 

If the extreme points of the feasible set are not degenerate, then the simplex method terminates in a 
finite number of steps (case a) or case b)). 
E The normal form in 18.1.2, p. 911ff can be written in a simplex tableau (Scheme 18.4a). This 
tableau is not optimal, since the objective function has a positive coefficient in the third column. The 
third column is assigned as the pivot column (the second column could also be taken under considera- 


tion). The quotients b;/aj, are calculated with every positive element of the pivot column (there is only 
one of them). The quotients are denoted behind the last column. The smallest quotient determines the 
pivot row. 
Scheme 18.3 Scheme 18.4a Scheme 18.4b 
XN πι T3 T4 TQ T3 T5 
XplAn| b gal 1 1-}1| 1 πα. 0 0 1| 2 
€ |-à 75| -1-1 1] 1 133 241-1 1| 1 
r;|—1 2 012 ας -ἶ 2 0 2 2 
v7 5 5-3 δ τ] 2 2 3) 8 
—1 1 3 -3 2 4-3 |--θ 
If it is not unique, then the extreme point corresponding to the new tableau is degenerate. After per- 
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forming the steps of (18.16a)—(18.16d) the tableau in Scheme 18.4b is obtained. This tableau deter- 
mines the extreme point (0,2,0,1,0,2,8), which corresponds to the point P; in Fig. 18.4. Since this 
new tableau is still not optimal, xe and x3 are interchanged (Scheme 18.4c). The extreme point of the 
third tableau corresponds to the point Ps in Fig. 18.4. After an additional change an optimal tableau 


is obtained (Scheme 18.4d) with the maximal point x* — (2, 2, 2,5,0,0, 0), which corresponds to the 
point P5, and the objective function has a maximal value here: f(x*) = 18. 
Scheme 18.4c Scheme 18.4d Scheme 18.5 
πι Te X5 T αρ T5 Tp ot Tn 
w| 0 0 2 22 0 90 I 2 Vp od c Gp bi 
3 1 1 5 i : 
a 2 2 i 2 5 2 Ym | Am Amn bm 
T d E d ἩἹ 
πα |-_ 5 ners OF ὃν 0 
ΝΙΝ. Μα ο... z — —x —z 
απ. 3-1 3| 6 6:3 1 1 OF" 5; aja aj5| ΣΣ bj = —g(0, b) 
= Tı ==> 1 2 j-l j- 251 
4—2 —3 |-10 1 ; 
—-.-=-7 |-18 
3 3 


2. Degenerate Case 

If the next pivot element cannot be chosen uniquely in a simplex tableau, then the new tableau rep- 
resents a degenerate extreme point. A degenerate extreme point can be interpreted geometrically as 
the coincident vertices of the convex polyhedron of the feasible solutions. There are several bases for 
such a vertex. In this case, it can therefore happen that some steps are performed without reaching a 
new extreme point. It is also possible that one gets a tableau that has occurred already before, so an 
infinite cycle may occur. 

In the case of a degenerate extreme point, one possibility is to perturb the constants b; by adding &' 
(with a suitable £^ > 0) such that the resulting extreme points are no longer degenerate. The solution 
can be got from the solution of the perturbed problem, if ¢ = 0 is substituted. 

If the pivot column is chosen “randomly” in the non-uniquely determined case, then the occurrence of 
an infinite cycle is unlikely in practical cases. 


18.1.3.3 Determination of an Initial Simplex Tableau 


1. Secondary Program, Artificial Variables 

If there are equalities among the original constraints (18.1b) or inequalities with negative b;, then it is 
not easy to find a feasible solution to start the simplex method. In this case, one starts with a secondary 
program to produce a feasible solution, which can be a starting point for a simplex procedure for the 
original problem. The system Ax — b is modified by multiplying some of the equations with (—1) in 
order to satisfy the condition b > 0. Now, an artificial variable yy > 0 (k = 1,2,..., m) is added to 
every left-hand side of Ax — b with b 2 0, and the secondary program is considered: 


ΟΕ”: g(x,y) = —y — +++ — Ym = max! (18.18a) 
aiti Fee + aimn + Y1 = b, 
cus Amity +++++ mnn + Ym = bm, (18.18b) 


$155,444 = 0; Uis Um = 0. 
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For this problem, the variables y1, . . . , Ym are basic variables, and one can start the first simplex tableau 
(Scheme 18.5). The last row of the tableau contains the sums of the coefficients of the non-basic 
variables , and these sums are the coefficients of the new secondary objective function OF*. Obviously, 
g(x,y) € 0 always. If g(x*,y*) = 0 for a maximal point (x*, y*) of the secondary problem, then 
obviously y* = 0, and consequently x* is a solution of Ax = b. If g(x*, y*) < 0, then Ax = b does not 
have any solution. B 

2. Solution of the Secondary Program 

Our goal is to eliminate the artificial variables from the basis. A scheme is prepared not only for the 
secondary program separately. The original tableau is completed by columns of the artificial variables 
and the row of the secondary objective function. The secondary objective function now contains the 
sums of the corresponding coefficients from the rows corresponding to the equalities, as shown below. 
If an artificial variable becomes a non-basic variable, its column can be omitted, since it will be never 
chosen again as a basis variable. If a maximal point (x*, y*) is determined, then two cases are distin- 
guished: B 

1. οκ”, y*) < 0: The system Ax = b has no solution, the linear programming problem does not have 
any feasible solution. 

2. g(x*,y*) = 0: If there are no artificial variables among the basic variables, this tableau is an ini- 
tial tableau for the original problem. Otherwise all artificial variables among the basic variables are 
removed by additional steps of the simplex method. 


By introducing the artificial variables, the size of the problem can be increased considerably. It is not 
necessary to introduce artificial variables for every equation. If the system of constraints before in- 
troducing the slack and surplus variables (see Remark in 18.1.2, 3., p. 913) has the form Aix > bj, 
ΑΟΧΞ by, Asx € bs with b, by, bs > 0, then artificial variables must be introduced only for the first 
two systems. For the third system the slack variables can be chosen as basic variables. 

W In the example of 18.1.2, p. 912, only the first equation requires an artificial variable: 


ΟΕ: g(x,y) = — yı = max! 
CT*: zı + T2 + 21 — T4 + Yı zd 
T2 + 26 = 2; 
—=gi + 2x3 + Te = 2; 
211 — ὅπο + 2x3 + 27 = 2. 
The tableau (Scheme 18.6b) is optimal with g(x*, y*) = 0. After omitting the second column the 
first tableau of the original problem is obtained. 
Scheme 18.6a Scheme 18.6b 
πι T2 T3 T4 Z1 Yı T3 T4 1 
γι "11-51 29 lop Τ-1 |. 
25 01002 αι T5 -1-1-1 1) 1 
ας |-1 02 0) 2 26 -1 0 2 0) 2 
Xz 2-32 0] 2 T7 5 3 5-3] 5 
OF 2 3 4 010 OF -1-3 1 3 |-3 
ΟΕ! 1 11-1 1 OF* 0-1 0 0,0 


18.1.3.4 Revised Simplex Method 
1. Revised Simplex Tableau 


Suppose the linear programming problem is given in normal form: 


OF: = f(x) = cz Cu πιῶη- πι + Co = max! (18.19a) 
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CT: 1101 vim Oi;n—mÜn-m + Tn-m+1 = fi, 


(18.19b) 


AmAT1 sr Om,n-mUn-m + t= Bry 


ω ως 0. 
Obviously, the coefficient vectors aj, ,,,; (i= 1,...,n) are the i-th unit vectors. 
In order to change into another normal form and therefore to reach another extreme point, it is suffi- 
cient to multiply the system of equations (18.19b) by the corresponding basis inverse. (Recall the fact 
that if Ag denotes a new basis, then the coordinates of a vector x can be expressed in this new basis 
as Ag!x. If the inverse of the new basis is known, then any column as well as the objective function 
from the very first tableau can be got by simple multiplication.) The simplex method can be modified 
so that only the basis inverse is determined in every step instead of a new tableau. From every tableau 
only those elements are calculated which are Scheme 18.7 
required to find the new pivot element. If 
the number of variables is considerably larger 
than the number of constraints (n > 3m), x? αι πα ° Qin | bi τι 
then the revised simplex method requires : 
considerably less computing cost and there- 
fore has better accuracy. Lm Qmn-m+1 7 Amn | bm | Tm 
The general form of a revised simplex tableau 
is shown in Scheme 18.7. 
The quantities of the scheme have the following meaning: 


νον Tn-m+1 ` Tn Tq 


Cy ++" Cn—m Cn-m+1 "7 Cn το Cq 


: Actual basic variables (in the first step the same as z, 4444: v). 
C1,--+;C€n τ Coefficients of the objective function (the coefficients associated to the basic variables 
are zeros). 


bi,...,6m +: Right-hand side of the actual normal form. 
co : Value of the objective function at the extreme point (zP,... zB) = (δι... Dm). 
Q1n—m+1 77^ Aln - 
Ata . : . Actual basis inverse, where the columns of A* are the columns of 
` Ln—mti;+--;¥n corresponding to the actual normal form; 
Üm,n—m-4 °°" Amn 
r=(rj,...,7%m)*: Actual pivot column. 


2. Revised Simplex Step 


a) The tableau is not optimal when at least one of the coefficients c; (j = 1,2,...,n) is positiv. A 
pivot column q is chosen for a c, 7 0. 

b) One calculates the pivot column r by multiplying the q-th column of the original coefficient matrix 
(18.19b) by A* and introduces the new vector as the last vector of the tableau. 

The pivot row k is determined in the same way as in the simplex algorithm (18.17). 

c) The new tableau is calculated by the pivoting step (18.16a-d), where a;, is formally replaced by r; 
and the indices are restricted for n — m +1 € j < n. The column r is omitted. z, becomes a basic 


variable. For j = 1,...,n — m, the results are čj = cj + αἲ 6, where č = (C, 5a... ., Čn)™, and αι is 
the j-th column of the coefficient matrix of (18.19b). 


W Consider the normal form of the example in 18.1.2, p. 912. One wants to bring x4 into the basis. 
The corresponding pivot column r = αι is placed into the last column of the tableau (Scheme 18.88) 
(initially A* is the unit matrix). 


For j = 1,3, 4 one gets €; = c; — ας: (c1, C3, c4) = (2,4,0). 
The determined extreme point x = (0, 2,0, 1,0, 2, 8) corresponds to the point P; in Fig. 18.4, p. 912. 
The next pivot column can be chosen for j = 3 = q. 
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Scheme 18.8a Scheme 18.8b 
πι Xa L4 (X9 X5 Le X7 T4 Xi X3 T4 πο T5 Le T7 23 
πο 1 0 0 0|1 -1 T2 1 100/2] 0 
25 010 0/1) 1 1:1 X4 0 100/|1/|-1 
Te 00 1901210 T6 0 01 0/2) 2 
στ 0 0 0 1151-53 Ὅτ 0 301|8| 2 
—1 1 3/0 0 0 0|-5ι 3 2 4-δ|0-δ 0 0/|-6 4 
The vector r is determined by 
1100 1 0 
ο ο ΓΩ 2] =| 2 
0301 5 2 


and it is placed into the very last column of the second tableau (Scheme 18.8b). One proceedes 
as above analogously to the method shown in 18.1.3.2, p. 915. If one wants to return to the original 
method, then the matrix of the original columns of the non-basic variables must be multiplied by A* 
and only these columns will be kept. 


18.1.3.5 Duality in Linear Programming 


1. Correspondence 


Το any linear programming problem (primal problem) an other unique linear programming problem 
can be assigned (dual problem): 


Primal problem Dual problem 


OF: f(x)-c[x,- cix, = max! (18.20a) ΟΕ”: g(u) =bru,;+bju,= min! (18.21a) 


CT: AX, + A12X < by, CT*: AT uy, + Ag w Z6, 
As4X, + A22X = by, AT uy + Α ü; — C5, 


0, x free. 18.01 
X 20, % free (18.200) u,>0, u free. (18.21b) 


The coefficients of the objective function of one of the problems form the right-hand side vector of the 
constraints of the other problem. Every free variable corresponds to an equation, and every variable 
with restricted sign corresponds to an inequality of the other problem. 
2. Duality Theorems 
a) If both problems have feasible solutions, i.e., M #0, M* 4 0 (where M and M* denote the feasible 
sets of the primal and dual problems respectively), then 

f(x) <g(u) forall xe M, ue M*, (18.22a) 
and both problems have optimal solutions. 
b) The points x € M and u € M* are optimal solutions for the corresponding problem, if and only if 


f(x) = g). (18.22b) 
c) If f(x) has no upper bound on M or g(u) has no lower bound on ΛΙ”, then ΛΙ” = () or M = 0), i.e., 
the dual problem has no feasible solution. 


d) The points x € M and u € M* are optimal points of the corresponding problems if and only if: 


ui (Aix; + Αρχ — bj) 20. and x[(AT,u; + Agius --ει) = 0. (18.22c) 
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Using these last equations, a solution x of the primal problem can be found from a non-degenerate 
optimal solution u of the dual problem by solving the following linear system of equations: 


AsaX, + A22X2 — by = 0, (18.23a) 
(Aiix; + A12X2 — bı) = 0 for uj > 0, (18.23b) 
z; =0 for (Aiu + AZ, — σι), 4 0. (18.23c) 


The dual problem also can be solved by the simplex method. 
3. Application of the Dual Problem 


Working with the dual problem may have some advantages in the following cases: 

a) If it is simple to find a normal form for the dual problem, one switches from the primal problem to 
the dual. 

b) If the primal problem has a large number of constraints compared to the number of variables, then 
the revised simplex method can be used for the dual problem. 


W Consider the original problem of the example of 18.1.2, p. 912. 


Primal problem Dual problem 
OF: f(x) = 221 + 329 + 41} = max! ΟΕ”: g(u) = —u; + 2ug 20 + 2u, = min! 
CT: Ti 29 z3 € —1, ΟΙ: -—u - u + 2u, È 2, 
T2 = 2 σι + U2 — 3u, 2 3, 
—1i +273 < 2, =u + 218 + 2u4 > 4, 
2%, — 38ο + 243 2, 11, Up, tg, ta > 0. 
@1,%2,%3 > 0. 
If the dual problem is solved by the simplex method after introducing the slack variables, then the 


optimal solution u* = (u1, u2, u3, u4) = (0,7,2/3,4/3) with g(u) = 18 is got. A solution x* of the 
primal problem can be got by solving the system (Ax — b); = 0 for u; > 0, i.e., x9 = 2, — £1 +223 = 2, 
211 — 3x9 + 2x3 = 2, therefore: x* = (2.2.2) with f(x) = 18. 


18.1.4 Special Linear Programming Problems 
18.1.4.1 Transportation Problem 
1. Modeling 


A certain product, produced by m producers ΕἸ, F2,..., Em in quantities a1, a2, . . . , am, is to be trans- 
ported to n consumers Vi, Vo,..., Va with demands b1, b2, . . . , bn. Transportation cost of a unit product 
of producer E; to consumer V; is ος. The amount of the product transported from Æ; to V; is a; units. 
An optimal transportation plan is to be determined with minimum total transportation cost. The 
system is supposed to be balanced, i.e., supply equals demand: 


m 


Σια = Yo by. (18.24) 
i j=l 


The matrix of costs C and the distribution matrix X are constructed: 


E: a 
c- C1 77 Cin Ey X11 cC Lin αι 
7 ] , (18.25a) X= : : ta (18.25b) 
Cmi Gand Em Emi Ymn/ am 
"EE i yee pb, 


If condition (18.24) is not fulfilled, then two cases are distinguished: 

a) If Sa; > È bj, then a fictitious consumer V,+; is introduced with demand ὄρει = a; — Ὁ b; and 
with transportation costs Ci n1 = 0. 

b) If Sa; < Σύ). then introduce a fictitious producer Em+1 is introduced with capacity @m41 = 
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Σ ὁ; — Sa; and with transportation costs Cm+1,; = 0. 
In order to determine an optimal program, the following programming problem should be solved: 


OF: /(Χ) -- ΣΣ) ἄγοι; = min! (18.364) 
i=1 j=1 
CT: Mizg-ai (i21...,m) Yo αμ ὃ G=l,...,n), αὐ > 0). (18.26b) 
j=l i=l 


The minimum of the problem occurs at a vertex of the feasible set. There are m + n — 1 linearly 
independent constraints among the m + n original constraints, so, in the non-degenerate case, the 
solution contains m + n — 1 positive components x;;. To determine an optimal solution the following 
algorithm is used, which is called the transportation algorithm. 

2. Determination of a Basic Feasible Solution 

With the Northwest corner rule an initial basic feasible solution can be determined: 


a) Choose «πι = min(a;, bi). (18.272) 
b) If a, < bi, the first row of X is omitted. (18.27b) 
If a; > bi, the first column of X is omitted. (18.216) 
If αι = b4, either the first row or the first remaining column of X is omitted. (18.27d) 


If there are only one row but several columns, then one column is cancelled. The same applies for the 
rows. 

C) αι is replaced by αι — x; and b, by bı — xy, and the procedure in repeated in the left upper vertex 
of the reduced distribution matrix X. 

The variables obtained in step a) are the basic variables, all the others are non-basic variables with zero 
values. 


E: ‘ye 
5 3 2 T FA πια M2 πι Tis a =9 
C= ( 8211 | Ez, X= ( σοι 1223 T23 T24 | az = 10. 
9263/ Es X31 T32 ἆ T34 da — 3 
V: Vi Va V3 Vi Σ: bi = 4b = 6 bg = 5 b4 


The determination of an initial extreme point with the Northwest corner rule gives 


first step second step further steps 

4 55 4 5 — 50 4 5 — 0 
x-{ E ; κ-{ ) io ; x-( m 109 40 

| Λι MEE 118) ὁ 

4657 0657 0 Bf 

0 1 103 


There are alternative methods to find an initial basic solution which also takes the transportation costs 
into consideration (see, e.g., the Vogel approximation method in [18.13]) and they usually result in a 
better initial solution. 

3. Solution of the Transportation Problem with the Simplex Method 

If the usual simplex tableau is prepared for this problem, then it results in a huge tableau ((m-+n) x (m- 
n)) with a large number of zeros: In each column, only two elements are equal to 1. So, a reduced tableau 
is constructed, and the following steps correspond to the simplex steps working only with the non-zero 
elements of the theoretical simplex tableau. The matrix of the cost data contains the coefficients of 
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the objective function. The basic variables are exchanged for non-basic variables iteratively, while the 
corresponding elements of the cost matrix are modified in each step. The procedure is explained by an 
example. 


a) Determination of the modified cost matrix C from C by 


συ ου tar CS liam FH hee) (18.28a) 
with the conditions 

Gj — 0 for (i,j) if xij is an actual basic variable. (18.28b) 
The elements of C belonging to basic variables are marked and μι = 0 is substituted. The other 


quantities p; and q;, also called potentials or simplex multiplicators, are determined so that the sum of 
Pi, qj and the marked costs c;; should be 0: 


ΓῚ 
(5 (3) 2 7 \ m=0 dama 
C= 8 (2) (1) (Ὁ | m=l = C=| 40 00 |. (18.280) 
9 2 6 (3 /p-2-1 513190 
αι 5 qe 3 qa 2 q4 2 
b) The value 
ἔμ = min{č;;} (18.28d) 


must be determined. If ¢,, > 0, then the given distribution X is optimal; otherwise xp, is chosen as a 
new basic variable. In our example: Gp = €32 = —2. 


c) In C, ὄρη and the costs associated to the basic variables are marked. If C contains rows or columns 
with at most one marked element, then these rows or columns will be omitted. This procedure is 
repeated with the remaining matrix, until no further cancellation is possible. 


(18.28e) 
(0) 
d) The elements x;; associated to the remaining marked elements &;; form a cycle. The new basic 
variable Τρι is to be set to a positive value δ. The other variables Τι; associated to the marked elements 
Čij are determined by the constraints. In practice, ô is subtracted and added from or to every second 
element of the cycle. To keep the variables non-negative, the amount ὃ must be chosen as 


Ô = Trs = min{ Tij: Žij = Tij — ὃ}, (18.28f) 
where z,, will be the non-basic variable. In the example ὃ = min{1,3} = 1. 
Σ 
4 5 9 
"T: 7 45 
Xe vi 5 g4 10 = x-( ση). f(x) -- 53. (18.280) 


ô —33-6] 3 
y 46 5 7 


Then, this procedure is repeated with X — X. 


(5) (3) 2 T pı=0 (0) (0) (=2) 3 
C= 8 2 (1) (1) po=3 = C=] 6 2 (0) (0) |, (1828h) 
9 (2) 6 (3) p3=1 5 (0) 3 (0) 
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. 

a 

| 
ρα 
ή 


432 
( 3 z) , J(X)=49. (18.28) 
3 


1+6 — 2-6 
The next matrix C does not contain any negative element. So, X is an optimal solution. 


18.1.4.2 Assignment Problem 

The representation is made by an example. 

E ~ shipping contracts should be given to n shipping companies so that each company receives exactly 
one contract. The assignment has to be determined which minimizes the total costs, if the i-th company 
charges cj; for the j-th contract. 

An assignment problem is a special transportation problem with m = n and a; = b; = 1 for all i, j: 


OF: f(x ZSS ως = min! (18.29a) 
i=1j=1 
CT: Soa =1 =1,...,n), Soa =1 G=1,...,n), αμ € {0,1}. (18.29b) 
j=l i=1 


Every feasible distribution matrix contains exactly one 1 in every row and every column, all other 
elements are equal to zero. In a general transportation problem of this dimension, however, a non- 
degenerate basic solution would have 2n — 1 positive variables. Thus, basic feasible solutions to the 
assignment problem are highly degenerate, with n — 1 basic variables equal to zero. Starting with 
a feasible distribution matrix X, the assignment problem can be solved by the general transportation 
algorithm. It is time consuming to do so. However, because of the highly degenerate nature of the basic 
feasible solutions, the assignment problem can be solved with the highly efficient Hungarian method 
(see [18.9]). 


18.1.4.3 Distribution Problem 
The problem is represented by an example. 
W m products Ει, E»,..., Em should be produced in quantities a1, a2,...,@m. Every product can be 
produced on any of n machines Mi, M»,..., Mn. The production of a unit of product E; on machine 
Mi; needs processing time bj; and cost ο). The time capacity of machine Mj is bj. Denote the quantity 
produced by machine M; from product E; by πμ. The total production cust should be minimized. 
This distribution pr oblem has the following general model: 
m n 
OF: f(x) = cjr; = min! (18.30a) 
i=1j=1 
m n 
CT: Se vi; =a; (i — l,...,m), Msg <b; G=1,....n), mij 20 for all i, 7. (18.30b) 
j=l = 
The distribution problem is a generalization of the transportation problem and it can be solved by 
the simplex method. If all bj; = 1, then the more effective transportation algorithm can be used (see 
18.1.4.1, p. 921) after introducing a fictitious product E,,,4 (see 18.1.4.1, p. 920). 


18.1.4.4 Travelling Salesman 

Suppose there are n places O1, O2,...,On. The travelling time from O; to O; is cij. Here, ci; A Cji is 
possible. 

One wants to determine the shortest route such that the traveller passes through every place exactly 
once, and returns to the starting point. 

Similarly to the assignment problem, exactly one element is chosen in every row and column of the time 
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matrix C so that the sum of the chosen elements is minimal. The difficulty of the numerical solution of 
this problem is the restriction that the marked elements c;; should be arranged in order of the following 
form: 

Cit iar Cina. sso Cinging, With i, Ay for k Al and inp = à. (18.31) 
The travelling salesman problem can be solved by the branch and bound methods. 


18.1.4.5 Scheduling Problem 


n different products are processed on m different machines in a product-dependent order. At any time 
only one product can be processed on a machine. The processing time of each product on each machine 
is assumed to be known. Waiting times, when a given product is not in process, and machine idle times 
are also possible. 

An optimal scheduling of the processing jobs is determined where the objective function is selected 
as the time when all jobs are finished, or the total waiting time of jobs, or total machine idle time. 
Sometimes the sum of the finishing times for all jobs is chosen as the objective function when no waiting 
time or idle time is allowed. 


18.2 Non-linear Optimization 


18.2.1 Formulation of the Problem, Theoretical Basis 
18.2.1.1 Formulation of the Problem 


1. Non-linear Optimization Problem 
A non-linear optimization problem has the general form 


f(x) = min! subjectto xc" with (18.32a) 

gi(x) < 0, iel-íL..,mh hj;(x)20, je J={l,...,r} (18.32b) 
where at least one of the functions f, gi, hj is non-linear. The set of feasible solutions is denoted by 

M={xeR": g(x) <0, τεῖ, hj(x)=0, j € J}. (18.33) 


The problem is to determine the minimum points. 
2. Minimum Points 
A point x* € M is called the global minimum point if f(x*) < f(x) holds for every x € M. If this 
relation holds for only the points x of a neighborhood U of x*, then x* is called a local minimum point. 
Since the equality constraints h;(x) = 0 can be expressed by two inequalities, 
—hj(x)€0, hj(x) <0, (18.34) 
it can be supposed that the set J is empty, J = 0. 
18.2.1.2 Optimality Conditions 
1. Special Directions 
a) The Cone of the Feasible Directions at x € M is defined by 
Z(x) - (de R^: 3a>0: x+adEM, 0<a<a}, xeM, (18.35) 
where the directions are denoted by d. If d € Z(x), then every point of the ray x + ad belongs to M 
for sufficient small values of a. 
b) A Descent Direction at a point x is a vector d € R” for which there exists an à > 0 such that 
f(x+ad) < f(x) Ya € (0,a). (18.36) 
There exists no feasible descent direction at a minimum point. 
If f is differentiable, then d is a descent direction when V f (οστά < 0. Here, V denotes the nabla 
operator, so V f(x) represents the gradient of the scalar-valued function f at x. 
2. Necessary Optimality Conditions 
If f is differentiable and x* is a local minimum point, then 


Vf(x')'dz0 forevery de Z(x"*). (18.372) 
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In particular, if x* is an interior point of M, then 
V f(x") = 0. (18.37b) 


3. Lagrange Function and Saddle Point 
Optimality conditions (18.37a,b) should be transformed into a more practical form including the con- 
straints. The so-called Lagrange function or Lagrangian is constructed: 

m 


L(x,u) = f(x) + $ wigi(x) = f(x) -u'g(x, xceR, uc R}, (18.38) 


according to the Lagrange multiplier method (see 6.2.5.6, p. 456) for problems with equality constraints. 
A point (x*, u*) € R” x R7 is called a saddle point of L, if 

L(x*,u) < L(x*,u*) < L(x,u' forevery xc R”, uc R7. (18.39) 
4. Global Kuhn-Tucker Conditions 

A point x* € R” satisfies the global Kuhn- Tucker conditions if there is an u* € RIZ, i.e., u* > 0 such 
that (x*, u*) is a saddle point of L. 

For the proof of the Kuhn- Tucker conditions see 12.5.6, p. 683. 

5. Sufficient Optimality Condition 

If (x*, u*) € R” x R? is a saddle point of L, then x* is a global minimum point of (18.32a,b). 

If the functions f and g; are differentiable, then local optimality conditions can be deduced. 

6. Local Kuhn- Tucker Conditions 

A point κ” € M satisfies the local Kuhn- Tucker conditions if there are numbers u; > 0, i € Jo(x*) such 
that 


-Vf(x*)= M, uVg(x), where (18.404) 
i€lo(x*) 
Ίος) = {i € {1,...,m} : g(x) = 0} (18.40b) 


is the index set of the active constraints at x. The point x* is also called a Kuhn- Tucker stationary 
point. 


This means geometrically that a point x* € 
M satisfies the local Kuhn-Tucker condi- 
tions, if the negative gradient —V f(x*) lies 
in the cone spanned by the gradients V g;(x*) 
i € lo(x*) of the constraints active at x* 
(Fig. 18.5). 

The following equivalent formulation for 
(18.40a,b) is also often used: x* € IR" satis- 
fies the local Kuhn- Tucker conditions, if there 
is a u* € R7 such that 


g(x") < 0, (18.414) 
ugi(x*) =0, i=1,...,m, (18.41b) level lines 
m f(x) Ξ const 
Vf(x*) + So uiVgi(x*) = 0. (18.419) 
i=1 


Figure 18.5 


7. Necessary Optimality Conditions and Kuhn-Tucker Conditions 

Ifx* € M isa local minimum point of (18.32a,b) and the feasible set satisfies the regularity condition at 
x*: dd € R” such that Vgi(x*)"d < 0 for every i € Io(x*), then x* satisfies the local Kuhn-Tucker 
conditions. 
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18.2.1.3 Duality in Optimization 
1. Dual Problem 


With the associated Lagrangian (18.38) the maximum problem is formed, the so-called dual of (18.32a,b): 


L(x,u) = max! subject to (x,u)c Μ with (18.42a) 
M* = {(x,u) € R” x RẸ : L(x,u) = min L(z,u)}. (18.42b) 
zcR" 


2. Duality Theorems 

If x, € M and (κο, uy) € M*, then 

a) L(xs, αν) < f(x). 

b) If L(x», αὐ) = f (xi), then x, is a minimum point of (18.32a,b) and (x5, uy) is a maximum point of 
(18.42a,b). 


18.2.2 Special Non-linear Optimization Problems 


18.2.2.1 Convex Optimization 


1. Convex Problem 
The optimization problem 
f(x) = min! subjectto g(x) <0 (i=1,... m) (18.43) 
is called a convex problem if the functions f and g; are convex. In particular, f and g; can be linear 
functions. The following statements are valid for convex problems: 
a) Every local minimum of f over M is also a global minimum. 
b) If M is not empty and bounded, then there exists at least one solution of (18.43). 
c) If f is strictly convex, then there is at most one solution of (18.43). 
1. Optimality Conditions 
a) If f has continuous partial derivatives, then x* € M is a solution of (18.43), if 


(x—x')'Vf(x 50 forevry κε M. (18.44) 
b) The Slater condition is a regularity condition for the feasible set M. It is satisfied if there exists an 
x € M such that g;(x) < 0 for every non-affine linear functions qj. 
c) If the Slater condition is satisfied, then x* is a minimum point of (18.43) if and only if there exists a 
u* > 0 such that (x*, u*) is a saddle point of the Lagrangian. Moreover, if functions f and g; are dif- 
ferentiable, then κ” is a solution of (18.43) if and only if x* satisfies the local Kuhn- Tucker conditions. 
d) The dual problem (18.42a,b) can be formulated easily for a convex optimization problem with dif- 
ferentiable functions f and g;: 

L(x,u) = max!, subjectto (x,u)c M* with (18.45a) 

M* = {(x,u) € R” x R? : VxL(x,u) = 0). (18.45b) 

The gradient of L is calculated here only with respect to x. 
e) For convex optimization problems, the strong duality theorem also holds: 
If M satisfies the Slater condition and if x* € M is a solution of (18.43), then there exists a u* € RT, 
such that (x*, u*) is a solution of the dual problem (18.45a,b), and 


ο ως ο. ο (18.46) 


18.2.2.2 Quadratic Optimization 
1. Formulation of the Problem 
Quadratic optimization problems have the form 
f(x) =x™Cx+ p'x = min!, subjectto xe MCR" with (18.472) 


M=M,: M-(xcR": AxEb, x» 0). (18.47b) 
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Here, C is a symmetric (n, n) matrix, p € R^, A is an (m,n) matrix, and b € R”. 
'The feasible set M can be written alternatively in the following way: 
ΜΞ Μη: M={x: Ax=b, x20] (18.484. 
M=Myy: M={x : Ax< b). (18.48b 
2. Lagrangian and Kuhn-Tucker Conditions 
The Lagrangian to the problem (18.47a,b) is 


L(x,u) = x'Cx+p'x+u"(Ax—b). (18.49 
By introducing the notation 
L L 
v= ah =p+2Cx+A‘tu and — ——Ax-b (18.50 
Ox = = δα 
the Kuhn-Tucker conditions are as follows: 
Case I: Case II: Case III: 
a Ax+y=b, a) Ax=b, a) Ax+y=b, (18.51a) 
b) 2Cx- v + Au = -p, b) 2Cx-v+Au=-p, b) 2Cx+ Au = -p, (18.51b) 
ο) κ2θ,ν20,Υ20,420, c) x>0,v>0, c) u20,yz0, (18.51c) 
d) x'v+yTu=0. d) κἲν --0. d) yTu- 0. (18.51d) 


3. Convexity 

The function f(x) is convex (strictly convex) if and only if the matrix C is positive semidefinite (posi- 
tive definite). Every result on convex optimization problems can be used for quadratic problems with 
a positive semidefinite matrix C; in particular, the Slater condition always holds, so it is necessary 
and sufficient for the optimality of a point x* that there exists a point (x*, y, u, v), which satisfies the 
corresponding system of local Kuhn- Tucker conditions. B 

4. Dual Problem 

If C is positive definite, then the dual problem (18.45a,b) of (18.47a,b) can be expressed explicitly: 


L(x,u) = max!, subject to (x,u)c M*, where (18.52a) 
1 
M* = {(x,u) E R" x RẸ :x--5 7 (ATu 4 p)}. (18.52b) 
1 
If the expression x = -30 (Au + p) is substituted into the dual objective function L(x, u), then 
the equivalent problem is 
1 1 T 1 
φ(α) = - ΑΟ Au — (3407p + b) u- 15Ο Ῥ =max!, τα20. (18.53) 
Hence: If x* € M is a solution of (18.47a,b), then (18.53) has a solution u* > 0, and 
I) = ία". (18.54) 
Problem (18.53) can be replaced by an equivalent formulation: 
y(u) = u’Eu+h'u=min!, subject to u 0 where (18.55a) 


1 1 
E= 190 ^ and h= 5AC™'p+b. (18.55b) 
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18.2.3 Solution Methods for Quadratic Optimization Problems 
18.2.3.1 Wolfe’s Method 


1. Formulation of the Problem and Solution Principle 
The method of Wolfe is to solve quadratic problems of the special form: 


f(x) =x'Cx+p’x=min!, subjectto Ax=b, xD. (18.56 


C is supposed to be positive definite. The basic idea is the determination of a solution (x*, u*, v*) o 
the corresponding system of Kuhn-Tucker conditions, associated to problem (18.56): 


Ax — b, (18.572. 

2Cx — v - ATu = --ρ. (18.57b 
x20, v20; (18.5Το 

x'v=0. (18.58 


Relations (18.57a,b,c) represent a linear equation system with m + n equations and 2n + rn variables. 
Because of relation (18.58), either x; = O or v; = 0 (i = 1,2,...,n) must hold. Therefore, every solution 
of (18.57a,b,c), (18.58) contains at most m+n non-zero components. Hence, it must be a basic solution 
of (18.57a,b,c). 

2. Solution Process 
First, a feasible basic solution (vertex) x of the system Ax — b is determined. The indices belonging 
to the basis variables of X form the set Ig. In order to find a solution of system (18.57a,b,c), which also 
satisfies (18.58), the problem is formulated as 


2C -I AT -q is the zero vector of the corresponding dimension, 

in the following way: 

a) m columns belonging to x; with i € Ip, 

b) n — m columns belonging to v; with i ¢ Ig, 

c) all m columns belonging to wi, 

d) the last column, but then a suitable column determined in b) or c) will be dropped. 

If q — 0, then the interchange according to d) is not possible. Then x is already a solution. 
Now, a first simplex tableau can be constructed. The minimization of the objective function is per- 
formed by the simplex method with an additional rule that guarantees that the relation x'v = 0 is 
satisfied: 

The variables x; and v; (i = 1,2,...,n) must not be simultaneously basic variables. 

In the case of a positive definite C, considering this additional rule the simplex method provides a so- 
lution of problem (18.59), (18.60a,b,c), (18.61) satisfying µ = 0. For a positive semi-definite matrix C, 
because of the restricted pivot choice, it may happen that although u > 0, no more exchange-step can 
be made without violating the additional rules. In this case u cannot be reduced any further. 


—j — min!, (pu € R); (18.59 
Ax — b, (18.60a 
2Cx—v+ATu-pq=—p with q=2Cx+p, (18.60b 
κ20,ν20, μσ0; (18.60c 
xlv -- 0. (18.61 

If (x, v, u, μ) is a solution of this problem also satisfying (18.57a,b,c) and (18.58), then jz = 0. 
The vector (x, v, u, p) = (X,0,0,1) is a known feasible solution of the system (18.60a,b,c), and i 
satisfies the relation (18.61), too. A basis associated to this basic solution is formed from the columns 

of the coefficient matrix 

í A000 | 1 denotes the unit matrix, 0 the zero matrix and 0 (18.62 
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E f(x) = zx? + 403 — 10x) — 3222 = min! with zi-F2x5-- 33 — 7, 2gı + 32-24 — 8. 


1000 -10 
| (1210 (7 | [0400 _ { -32 
Cee b=) σος o 0o oe P= | o 

00 0 0 0 


In this case C is positive semi-definite. A feasible basic solution of Ax = b is x = (0,0,7, 8)", q= 


2Cx + p= (—10, —32, 0, 0)". The choices for the basis vectors are: a) columns 3 and 4 of ear 


0 
b) columns 1 and 2 of ( aa c) the columns of ( ar) and d) column ( F 


= » instead of the first 


0 i esc 
column of ( 1) The basis matrix is formed from 


these columns, and the basis inverse is calculated Scheme 18.9 
(see 18.1, p. 909). Multiplying matrix (18.62) and . . . 
b Tı 29 Ul Us U4 
the vectors ( ὧν ) by the basis inverse, the first T: 1 2 0 0 0 7 
p 3 
simplex tableau (Scheme 18.9) is obtained. τ : 1 : : " - 
Only σι can be interchanged with vs in this tableau vy 64 8 32 12 54 0 
according to the complementary constraints. Af- 10 10 10 10 
ter a few steps, we get the solution x* = ui 0 0 Q = Ἡ 0 0 
(2, 5/2,0, 3/2)" is obtained. The last two equations uz 0 0 0 0-1 0 
of2Cx—v--ATu μα Pare: v3 = U1, U4 = ua. n 2 0 1 1 2 1 
Therefore, by eliminating u; and ug the dimension 10 10 10 10 
of the problem can be reduced. 9 ῇ 1 1 9 i 
18.2.3.2 Hildreth-d’Esopo Method ~ 10 10 10 10) 
1. Principle 
The strictly convex optimization problem 
f(x) =x'Cx+p'’x=min!, Ax<b (18.63) 
has the dual problem (see 1., p. 926) 
ψ(α) = uTEu--h'u-— min! u>0 with (18.64a) 
1 1 
E= 190 A. h= 5ΑΟ (pth. (18.64b) 
Matrix E is positive definite and it has positive diagonal elements e;; > 0, (i = 1,2,...,m). The 
variables x and u satisfy the following relation: 
1 
x= -σο (Αα +p). (18.65) 


2. Solution by Iteration 

The dual problem (18.64a), which contains only the condition u > 0, can be solved by the following 
simple iteration method: 

a) Substitute u! > 0, (e.g., u! = 0), k = 1. 

b) Calculate uft! for i =1,2,...,m according to 


m 


rl À hi τ : ; 
( ἢ eju! + 3 + ἢ z ,(18.66a) ΠΝ = max {0, ως} : (18.66b) 
j=l 


j=i+1 


1 


Cii 


wet! = 
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c) Repeat step b) with k + 1 instead of k until a stopping rule is satisfied, e.g., 
e>0. 

Under the assumption that there is an x such that Ax < b, the sequence {ψ(α}}} converges to the 
minimum value Ymin and sequence {x*} given by (18.65) converges to the solution x* of the original 
problem. The sequence (u^! is not always convergent. 


va) — v(u*)| < e, 


18.2.4 Numerical Search Procedures 

By using non-linear optimization procedures acceptable approximate solutions can be found with rea- 
sonable computing costs for several types of optimization problems. They are based on the principle 
of comparison of function values. 

18.2.4.1 One-Dimensional Search 

Several optimization methods contain the subproblem of finding the minimum of a real function f(x) 
for x € [a,b]. It is often sufficient to find an approximation Τ of the minimum point z*. 

1. Formulation of the Problem 

A function f(x), x € R, is called unimodal in [a,b] if it has exactly one local minimum point on every 
closed subinterval J C [a,b]. Let f be a unimodal function on fa, b] and 2* the global minimum point. 
Then an interval [c, d] C [α, b] should be found with α΄ € [c, d] such that d — c < €, € > 0. 

2. Uniform Search 


—a € . . 

A positive integer n is chosen such that 6 = mud = y and the values f(x") for z^ = a + kô (k = 
n 

1,...,n) are calculated. If f(x) is the smallest value among these function values, then the minimum 


point α” is in the interval [a — ὃ, x + δ]. The number of required function values for the given accuracy 
can be estimated by 
2(b—a 
n> AM Je (18.67) 


E 


3. Golden Section Method, Fibonacci Method 
The interval [a,b] = αι, b1] will be reduced step by step so that the new subinterval always contains 
the minimum point z*. The points λι, μι are determined in the interval (αι, b1] as 


àı = a + (1 — T)(bı — αι), pi a1 7(b —a1) with (18.68a 
1 
T= 5 (V5 — 1) © 0.618. (18.68b 
This corresponds to the golden section. Two cases are distinguished: 
a) If. f(A1) < f(m), then a = a1, by = pı and pg = A, are substituted. (18.69a 
b) If {(λι) > f(m), then ag = Ay, bz = bj and àz = ji are substituted. (18.69b 


If b — a» > ε, then the procedure is repeated with the interval faz, b2], where one value is already 
known, f (A3) in case a) and f (y) in case b), from the first step. To determine an interval [an, bn], which 
contains the minimum point x*, altogether n function values are calculated. From the requirement 

E > bn — an = 7" (by - αι) (18.70 
the necessary number of steps n can be estimated. 
By using the golden section method, at most one more function value should be determined compared to 
the Fibonacci method. Instead of subdividing the interval according to the golden section, the interva 
is subdivided according to the Fibonacci numbers (see 5.4.1.5, p. 375, and 17.3.2.4, 4., p. 908). 


18.2.4.2 Minimum Search in n-Dimensional Euclidean Vector Space 


The search for an approximation of the minimum point x* of the problem f(x) = min!, x € R", can 
be reduced to the solution of a sequence of one-dimensional optimization problems. 
One takes 


a) x=x', k=1, where x! is an appropriate initial approximation of x*. (18.71a) 
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b) The one-dimensional problems 
e(o,) = f(x F*,... rs ο aT) = min! with o, € R (18.71b) 
are solved for r zl 2. ,n. If à is an exact or μμ μκνς minimum point of the r-th problem, 
then zët! = xk + à, are mice. 
c) If two consecutive approximations are close enough to each other, i.e., with some vector norm, 
x -x"|| «e or κ) FE) < ex. (18.710) 
then x**! is an approximation of κ”. Otherwise step b) is repeated with k + 1 instead of k. The one- 
dimensional problem in b) can be solved, by using the methods given in 18.2.4.1, p. 930. 


18.2.5 Methods for Unconstrained Problems 
The general optimization problem 

f(x)- min! for xcmR" (18.72) 
is considered with a continuously differentiable function f. Each method described in this section con- 
structs, in general, an infinite sequence of points (x^) € R”, whose accumulation point is a stationary 
point. The sequence of points will be determined starting with a point x! € IR" and according to the 
formula 


x" = x" + od? (k=1,2,...), (18.73) 
i.e., first a direction d^ € R” is determined at x^ and the step size ap € R indicates how far x**! 
from x* in the direction d*. Such a method is called a descent method, if 

f(x) < f(x") (k=1,2,...). (18.74) 
The equality V f(x) = 0, where V is the nabla operator (sce 13.2.6.1, p. 715), characterizes a stationary 
point and can be used as a stopping rule for the iteration method. 
18.2.5.1 Method of Steepest Descent 


Starting from an actual point x^, the direction d^ in which the function has its steepest descent is 


d* = -V f(x") (18.75a) and consequently x*t! = x" — o4 V f(x"). (18.75b) 


A schematic representation of the steepest descent method with level lines f(x) = f(x’) is shown in 
Fig. 18.6. 


1 The step size αι. is determined by a line search, i.e., 
f(x)=f(x ) αμ is the solution of the one-dimensional problem: 
f(x*-- ad^) 2 min!, o 20. (18.76) 
This problem can be solved by the methods given 
in 18.2.4, p. 930. 

The steepest descent method (18.75b) converges 
. relatively slowly. For every accumulation point x* 
level lines — of the sequence (x^), V f(x) = 0. In the case of a 
quadratic objective function, i.e., f(x) = x Cx + 


Vf(c) 


Figure 18.6 pix, the method has the special form: 
ἀνα’ 
yma agd (18.77a) with d* = —(2Cx* +p) and ak = sip (18.77b) 


18.2.5.2 Application of the Newton Method 
Suppose that at the actual approximation point x* the function f is approximated by a quadratic 
function: 


(x) = f (x^) + (x — x)T V f(x) + σα — x) H(x^)(x — x^). (18.78) 
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Here H(x^) is the Hessian matrix, i.e., the matrix of second partial derivatives of f at the point x*. If 
H(x^) is positive definite, then g(x) has an absolute minimum at x**! with Vq(x**!) = 0, therefore 
one gets the Newton method: 

x" = xt- HT (x*)Vf(x*) (k= 1,2,...), ie, (18.79a) 

d! = —H™(x*)V f(x") and αν in (18.73). (18.79b) 

The Newton method converges fast but it has the following disadvantages: 
a) The matrix H(x^) must be positive definite. 
b) The method converges only for sufficiently good initial points. 
c) The step size can not influenced. 
d) The method is not a descent method. 
e) The computational cost of computing the inverse of H ^! (x^) is fairly high. 
Some of these disadvantages can be reduced by the following version of the damped Newton method (see 
also 19.2.2.2, p. 962): 


x =x aH (xvf) (k-1,2,...). (18.80) 
The relaxation factor a; can be determined, for example, by the principle given earlier (see 18.2.5.1, 
p. 931). 


18.2.5.3 Conjugate Gradient Methods 


Two vectors d', d? € R” are called conjugate vectors with respect to a symmetric, positive definite 
matrix C, if 


αἱ cd? = o. (18.81 


If d! , d?,... , d" are pairwise conjugate vectors with respect to a matrix C, then the convex quadratic 


problem g(x) = x’Cx + p'x, x € R”, can be solved in n steps if a sequence x**! = x* + αμα” 


starting from x! is constructed, where a, is the optimal step size. Under the assumption that f(x 
is approximately quadratic in the neighborhood of x*, i.e., C ~ gH’), the method developed for 


quadratic objective functions can also be applied for more general functions f(x), without the explici 
use of the matrix H(x*). 
The conjugate gradient method has the following steps: 


a) x'e R”, d'=-Vf(x’), (18.82 
where x! is an appropriate initial approximation for x*. 


b) xt = x" + apd" (k=1,...,n) with a, > 0 so that f(x" + ad^) will be minimized. (18.83a 


d^ = -V f(x***) + upd? (k —1,...,n — 1) with (18.83b 
k+1)T k+1 

ih V f(x ) Vile ) and d"*!— —vf(x""), (18.83c 
V f(x") V f(x") 


n+1 


c) Repeating steps b) with x"*! and d"*! instead of x! and d!. 


18.2.5.4 Method of Davidon, Fletcher and Powell (DFP) 
With the DFP method, a sequence of points starting from x! € R” is determined according to the 
formula 

x" -x*—oaM,Vf(x* (k-1,2,..) (18.84) 
Here, M; is a symmetric, positive definite matrix. The idea of the method is a stepwise approximation 
of the inverse Hessian matrix by matrices ΜΙ. in the case when f(x) is a quadratic function. Starting 
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with a symmetric, positive definite matrix Mj, e.g., ΜΙ = I (Lis the unit matrix), the matrix My is 
determined from M;_; by adding a correction matrix of rank two 


vhv5T (Μι ιν) (Mi. w^)? 
T 


M; = M;i 4 νε νε w'TM,.wi ( 8.85) 
with v* = x* — x*-! and w" = V f(x") — V f(x*-!) (k = 2,3,...). The step size ap is obtained from 
f(x* —aM,Vf(x*)) =min!, a> 0. (18.86) 


If f e is a quadratic function, then the DFP method becomes the conjugate gradient method with 
p 


18.2.6 Evolution Strategies 
18.2.6.1 Evolution Principles 


Evolution strategies are examples of stochastic optimization processes imitating natural evolution. 
They are based on the principles of mutation, recombination and selection. 

1. Mutation 

From a parent point x p a offspring (descendant) xo = x p-- dis formed by applying a random variation 
d. The components of d are (0, 07) normally distributed random variables Z(0, o?) determined newly 
at every mutation: 


αι Z(0, 01) Z(0,1)-01 
d; Z(0, 03 Z(0,1) - σα 

deem i a ο ος (18.87) 
dn Z(0,02) Z(0, 1) ‘On 


With a normally distributed d small changes have high probabilities while large changes occur very 
rarely. The changes are controlled by the standard deviation σι. 

2. Recombination 

From the population of j; parents offspring can be obtained by mixing the information from two or more 
parents, which are randomly selected. The recombination can follow two types of changes. 

At intermediate recombination a offspring becomes as weighted average of o randomly chosen parents: 


ΧοΟΞΣ mxm J m=1, 2<oK<u. (18.88) 


At a discrete recombination of ϱ parents the i-th component of a offspring χο is determined by the i-th 
component of a randomly chosen parent: 


to = tipo j€lh.-eq0s $212 (18.89) 


3. Selection 

By using mutation and recombination, a set of offspring is formed randomly. In a subsequent selection 
process the objective function f(x) serves as a measure to compare the fitness of the individuals. The 
fittest individuals are selected for the next generation. At certain strategies only the offspring take part 
in the selection. Other strategies consider also the parents (see also [18.12]). 


18.2.6.2 Evolution Algorithms 

Every evolution strategy is based on the following algorithm: 

a) Determination of an appropriate starting population consisting of jj individuals. These are the 
first generation of parents. Xp = {Xp,,---,Xp,}- 

b) In the k-th step the creation of A offspring X $ = {x},,...,x6, } by mutation and recombination 


of parents of the actual generation ΧΡ = {x},,...,.x δι}. 
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c) Application of selection to get the best p individuals for the next parent generation X 5"! 


Ix Es x. 

d) Repeating steps b) and c) until stopping rule is satisfied. It can be fulfilling an optimal criterium of 
theoptimization problem, or to reach a given number of generations, or exceeding a given computer 
time, etc. 


18.2.6.3 Classification of Evolution Strategies 


Every evolution strategy is characterized by a sequence of parameters. Essential parameters are the size 
ofthe population ju, the number of the offspring A, the number of parents o taking part in recombination 
and rules of making mutation, recombination and selection. To distinguish different types of strategies 
a special notation is commonly used. For the strategies using only mutation in producing offspring 
the (u + A), or (u, A) strategy notation is used. Strategies (ji + A) and (u, A) differ from each other in 
the type of selection. At strategy (ji À) the selection of the new generation is made only among the 
offspring, while at strategy (u + À) the parents are also involved. 

For strategies using recombination the number 9 of the parents, which are involved, is seen in the 
notation (μ/ο + A)- and (μ/ρ, A)-strategy. 


18.2.6.4 Generating Random Numbers 


For the numerical evaluation of evolution procedures uniformly and normally distributed random vari- 
ables are needed. Values of uniformly distributed variables can be got by the methods given in subchap- 
ter 16.3.5.2, p. 843. Normally distributed random variables can be produced from uniform variables in 
the following way: 

Box-Muller Method: If G; and G are uniformly distributed random numbers in the interval [0, 1], 
then the following two equations give two statistically independent normally distributed (0, 0?) random 
numbers Ζι (0, o?) and Z4(0, o°): 


Z,(0,07) = o4 —21n Gi cos(2rG2) and Z2(0, 0°) = e —21n Gi sin(27G»).. (18.90) 


18.2.6.5 Application of Evolution Strategies 


In the practice optimization problems have usually high complexity. Here the conventional optimiza- 
tion processes described in 18.2.5, p. 931 are often not appropriate. Evolution strategies belong to the 
differentiation-free solution methods, which are based on comparisons of the values of the objective 
function. They have simple conditions on the structure of the objective function. The objective func- 
tion does not need to be differentiable or continuous. So the evolution strategies are appropriate for a 
wide spectrum of optimization problems. 

The application of evolution strategies is not restricted to unconstrained continuous optimization prob- 
lems. Optimization problems with constraints can also be handled, where the constraints are enforced 
by penalty terms in the objective function (see Penalty and Barrier Methods in 18.2.8, p. 940). 
Another field of application is the discrete optimization, where some or all components of x can take 
their values from a discrete set. One possible mutation mechanism is to replace the value of a discrete 
component by one of its neighboring values with the same probability. 


18.2.6.6 (1 + 1)-Mutation-Selection Strategy 
This method is similar to the gradient method discussed in 18.2.5, p. 931 with the difference that the 


direction d* is a normally distributed random vector. The population consists of a single individual 
which produces one offspring at every generation. 


1. Mutation Step 
In generation k a offspring is obtained from a parent by adding a normally distributed random vector: 


xb — x5 ad*. (18.91) 
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The factor a is a parameter by which the speed of the convergence can be affected. α is considered as 
the step size of the mutation. 


2. Selection Step 
The new parent of the next generation (k + 1) is selected by comparing the objective function values 
of both individuals, i.e., from the parent with the formula: 


bes if f(x5) < (x5). (18.92) 


x 
Ap ἃ 
xh otherwise. 


The procedure stops if no better offspring arrives over a given number of generations. The step size œ 
can be increased if the mutation results mostly in improved offspring. At small improvements the value 
of a should be decreased. 

3. Step Size Control 

'The choice of the mutation step size α is of important influence to the convergence properties of the 
evolution method. While large step sizes are recommended in order to have fast convergence, small step 
size is required in the close neighborhood of the optimum or in regions of fast changing or oscillating 
of the objective function. The optimal step size depends on the problem. Too small steps lead to 
stagnation, too large steps may result in overshooting of the evolution process. 

1. 1/5-Success rule: The ratio of the number of successful mutations and the total number of 
mutations in the last step defines the rate of success q. If q > 1/5, then the step size can be increased. 
For smaller success rate, a should be decreased: 


il 
CO, α στι 
αμα — 11 i with c=0,8...0,85. (18.93) 


στα (>77 
ο 5 


2. Mutative Step Size Determination: The rule of 1/5 is a rough choice, and considering any 
concrete problem it is not always satisfactory. In an extended model the step size α and the standard 
deviations σι, i = 1,2...,n are in correlation. Here a and σι are multiplied with equal probability by 
one of the factors c, 1, 1/c, where c — 1,1...1,5. Further information see [18.12]. 


18.2.6.7 Population Strategies 


The (14-1) strategy presented in the preceding paragraph reflects the principles of the natural evolu- 
tion only in a very simplified form. With the extension to population models further properties of the 
evolution process can be considered. A large number of individuals in an evolution process assures that 
different regions of the solution space will be searched. 

1. (w+A)-Evolution Strategy 


The (u + A) strategy is a generalization of the (1 + 1) strategy. From the u parents of the cur- 
rent generation ΧΡ = {x},,...,X} P, } a set of A parents is chosen randomly with equal probability. 


Repeated choices are allowed me even necessary in the case if y < A. By mutation, A offspring 
X6 = [xb,.....x5,) are produced. From the selection set Χὁ U ΧΡ the best y individuals are 
chosen to take over into the next generation. 

Since the parents are also taken in the selection, the quality of the population from a generation to 
the next one cannot be worse. The (u + A) strategy has the property that it keeps an already found 
local optimum, since large mutation steps, which are required to leave the optimal point, have very 
small probability. It means, that an individual can have an infinite life. This behavior can be avoided 
by adding penalty terms to the objective function values of parents that increase from generation to 
generation. In this way the aging of individuals can be simulated. 

2. (p,A)-Evolution Strategy 

In contrary to the (u + A) strategy the selection step is now made among the A offspring, to choose the 
u individuals for the next generation, i.e., in this strategy the parents do not survive. Therefore À > ju 
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must hold. The values of the objective function for the offspring can be larger than that for the parents. 
This procedure can depart from local optima. 

Selection Pressure: The ratio of the individuals taking part in the selection and the size of the pop- 
ulation defines the selection pressure S: 


λ 
Poe for (u + A)-strategy , 
ea with 1<S<oo. (18.94) 
= for (p, A)-strategy 
n 


If the selection pressure is close to 1, then the selection step has almost no impact. A large number of 
offspring À > p results in a strong selection pressure, since from the set of the present individuals only 
few will survive into the next generation. 

3. (u/g + λ)- and (µ/ο. A)-Evolution Strategies with Recombination 

With the concept of recombination some relations are built between the individuals of a population, so 
the information of several parents are mixed in a offspring. 
In order to get a offspring, o parents are chosen from the set of parents X 5 with the same probability. 
It is assumed that for every member of the A offspring a separate choice of o parents is taken. The 
offspring is a discrete or an intermediate recombination of the chosen parents. The offspring produced 
in this way will be mutated and enters the selection process. 

In the previously described (u + A) and (pi, A) strategies each individual is the result of a series of mu- 
tations applied to one individual of the first generation of parents. So, a wider evolution step is possible 
only through many generations. Wider evolution steps are possible with recombination. Especially, 
when the parents have large distances among each other, the offspring are formed with completely new 
properties. 
4. Evolution Strategies with more Populations 
The principles of evolution can be expanded formally to the dimension of populations. Instead of indi- 
viduals now populations are in competition. So, the evolution process has two steps. It is shown in the 
expanded notation: [u3/0» t λοίµι/οι t λι)]. 

From a set of jj parent populations a set of As offspring populations is created by recombination of 95 
populations that are chosen randomly for each offspring population. In these A» offspring populations 
the optimization is performed using a (μι/οι + A1) or (µι/θι, A) strategy. After a given number of 
generations the best populations are chosen based on an appropriate criterium. The comparison of 
populations can be done by considering the values of the objective function of the best individual or by 
the population mean. 


18.2.7 Gradient Method for Problems with Inequality 


Type Constraints 
If the problem 
f(x)- min! subject to the constraints g;(x) <0 (i=1,...,m) (18.95) 
has to be solved by an iteration method of the type 
xt =x" + ard" (k=1,2,...) (18.96) 


then two additional rules must be considered because of the bounded feasible set: 

1. The direction d^ must be a feasible descent direction at x*. 

2. The step size a; must be determined so that x^^! is in M. 

The different methods based on the formula (18.96) differ from each other only in the construction of 
the direction d”. To ensure the feasibility of the sequence (x^) C M, af, and a% are determined in the 
following way: 


o4 from f(x" + ad^) 2 min!, a> 0 
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all = max(a € R : xë +ad* € M}. (18.97) 
Then 

oj, = min(o 5, o ). (18.98) 
If there is no feasible descent direction d* in a certain step k, then x* is a stationary point. 
18.2.7.1 Method of Feasible Directions 
1. Direction Search Program 


A feasible descent direction d* at point x* can be determined by the solution of the following optimiza- 
tion problem: 


c = min, (18.99) 
σοκ) d < o, ie p(x"), (18.100a) Vf(x*)'d«c,(18100b) [la|] <1.  (18.100c) 


Ifo < 0 for the result d = d* of this direction search program, then (18.100a) ensures feasibility and 
(18.100b) ensures the descending property of d*. The feasible set for the direction search program is 
bounded by the normalizing condition (18.100c). If σ = 0, then x* is a stationary point, since there is 
no feasible descent direction at x^. 

A direction search program, defined by (18.100a,b,c), can result in a zig-zag behavior of the sequence 
x^, which can be avoided if the index set To (x*) is replaced by the index set 


I(x") = {i € {1,...,m}: -e < g(x") x0), & 0 (18.101) 


which are the so-called εκ active constraints in x^. Thus, local directions of descent are excluded 
which are going from x* and lead closer to the boundaries of M consisting of the ες active constraints 
(Fig. 18.7). 


81(x)=0 


i» 


g(x)-0 


Figure 18.7 


If ø = 0 is a solution of (18.100a,b,c) after these modifications, then x* is a stationary point only if 
Io(x^) = L, (x^). Otherwise £j; must be decreased and the direction search program must be repeated. 


2. Special Case of Linear Constraints 
If the functions g;(x) are linear, i.e., g(x) = 


a; x — bi, then a simpler direction search method 

" " can be established: 
VK) Lag Vix) Ls c =Vf(x*)'d=min! with (18.102) 
VAN Kat ad <0, ie p(x") or i € Ie, (x^), (18.103a) 
ilal] < 1. (18.103b) 
a) b) The effect of the choice of different norms ||d|| = 


max{|d;|} € 1 or ||d|| = /dTd < 1 is shown in 
Figure 18.8 Fig. 18.8a,b. 


In a certain sense, the best choice is the norm ||d|| = ||d||; = «Τά, since by the direction search 


program the direction d* is obtained, which forms the smallest angle with —V f (x^). In this case the 
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direction search program is not linear and requires higher computational costs. With the choice ||d|| = 
\|d||.. = max{|d;|} € 1 a system of linear constraints —1 < d; € 1, (i = 1,...,n) is obtained, so the 
direction search program can be solved, e.g., by the simplex method. 
In order to ensure that the method of feasible directions for a quadratic optimization problem f(x) = 
xTCx + pix = min! with Ax < b results in a solution in finitely many steps, the direction search 
program is completed by the following conjugate requirements: If e. = o, , holds in a step, i.e., x* 
is an "interior" point, then the condition 
LT 

d'' Cd-0 (18.104) 
is added to the direction search program. Furthermore the corresponding conditions are kept from the 
previous steps. If in a later step αι = a then the condition (18.104) is removed. 
E f(x) =a} + 403 — 10r, — 32%. = min! gi(x)-— --ᾱι <0, go(x) = —a2 < 0, 
g3(X) = 214-2253 — 7 € 0, ga(x) = 2a, 4-23 — 8 < 0. 
Step 1: Starting with x! = (3,0)7, V f(x!) = (—4,—32)", I(x!) = {2}. 
—4d, — 32d, = min! 
=d <0, ||d||;, <1 


kT k . 
l d VI) cui o={5 a 


Minimizing constant: aj, 


Direction search program: { } = d=(1,1)". 


2d* σα! 0 4 
Maximal feasible step size: αἴ = min { -A0 for i such that a;7d* > ο Q0 = = aj = : => 
. f18 2) 2 5 ΠΝ ΣΝ 
αἱ π᾿ το] τν 2 = (595) - 
Step 2: Vf(x2) = (-$ em. I(x?) = {4}. 


δ) 80 Pond . 
Direction search program: 4 3/41 3 2 5 hm = ἃ) = (- 
ad; +d, $0, [ᾶ].. «1 
4 ; 
πα x= (3, 2)". 


Step 3: V f(x?) = (—4,-16)", Ip = (x?) = {3,4}. 
Direction search program: 


—4d, — 16d5 = min! " 
dy + 2d; <0, ο ae 
137 5\ T 
(-15) , 0% = 1, af =3 o) = 1, x! (2. >) , 
The next direction search program results ino = 0. Here 
the minimum point is x* — x* (Fig. 18.9). Figure 18.9 


18.2.7.2 Gradient Projection Method 


1. Formulation of the Problem and Solution Principle 

Suppose the convex optimization problem 

f(x)- min! with a,’x € b, (18.105) 
fori — l,...,m is given. A feasible descent direction d^ at the point x* € M is determined in the 
following way: 

If — V f (x^) is a feasible direction, then d” = —V f (x^) is selected. Otherwise x" is on the boundary 
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of M and —V f (x^) points outward from M. The vector — V f (x^) is projected by a linear mapping 
P; into a linear submanifold of the boundary of M defined by a subset of active constraints of x^. 
Fig. 18.10a shows a projection into an edge, Fig. 18.10b shows a projection into a face. Supposing 
non-degeneracy, i.e., if the vectors a;, i € Jo(x) are linearly independent for every x € R”, such a 
projection is given by 


d* = -P,Vf(x^) = - (1- AL (AALT) Ας) V f(x"). (18.106) 


Here, A, consists of all vectors a;T, whose corresponding constraints form the sub-manifold, into which 
—V f (x,.) should be projected. 


Figure 18.10 
2. Algorithm 
The gradient projection method consists of the following steps, starting with x' € M and substituting 
k — 1 and proceeding in accordance to the following scheme: 
I: If — V f (x^) is a feasible direction, then d* = —V f (x^) is substituted, and continued with III. 
Otherwise A, is constructed from the vectors a; with i € Jo(x*) and continued with II. 
I: d* = — (1- A (AALT) Αι) V f(x) is sunstituted. If d" ¢ 0, then continued with III. If 


d = 0 and u = (ARAT) A.V f(x^) > 0, then x* is a minimum point. The local Kuhn-Tucker 


conditions —V f(x*) = uia; = Αμα are obviously satisfied. 
i€Io(x*) 


If u Z 0, then an i with u; < 0 is chosen, the i-th row from A, is deleted and II is repeated. 


III: Calculation of ap and x**! = x* + apd" and returning to I with k = k + 1. 
3. Remarks on the Algorithm 
If — V f (x) is not feasible, then this vector is mapped onto the sub-manifold of the smallest dimension 


which contains x*. If d^ = 0, then — V f (x^) is perpendicular to this sub-manifold. If u > 0 does 
not hold, then the dimension of the sub-manifold is increased by one by omitting one of the active 
constraints, so maybe d” 4 0 can occur (Fig. 18.10b) (with projection onto a (lateral) face). Since Aj 
is often obtained from A, ι by adding or erasing a row, the calculation of (A, AT) | can be simplified 
by the use of (Ay ΙΑ. a7)... 

W Solution of the problem of the previous example on p. 938. 

Step 1: x! = (3,0), 

I: Vf (x!) = (—4,—-32)", —V f (x!) is feasible, d! = (4, 32)". 


NE , j . 1; 10 
III: The step size is determined as in the previous example: o4 = —, x? = (= 
5 


0° * 
Step 2: 
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III: αν = Ξ x? = (3,2). 
Step 3: 
I: Vf (x3) = (—4, —16) (not feasible), Jo(x) = {3,4}, Ag = e 3 
0 0 28 8T 
II: P3 = (a 7 pod = (0,0)",u = (.-3) ug <0: As = (12). 
1/4 -2 " ( 16 T 
IL P=- ,d-(-—-2l). 
? 8 (À 1 ) j 5'5 
ENT 
lii. x= (2, ;) 
16 2 
Step 4 


I: Vf(x!) = (—6, —12)7 (not feasible), Ij(x*) = {3}, Aa = Ag. 
I: P= P5, dê = (0.0): u=6>0. 


It follows that x* is a minimum point. 


18.2.8 Penalty Function and Barrier Methods 

The basic principle of these methods is that a constrained optimization problem is transformed into 
a sequence of optimization problems without constraints by modifying the objective function. The 
modified problem can be solved, e.g., by the methods given in Section 18.2.5. With an appropriate 
construction of the modified objective functions, every accumulation point of the sequence of the solu- 
tion points of this modified problem is a solution of the original problem. 


18.2.8.1 Penalty Function Method 
The problem 


f(x) = min! subject to g;(x) <0 (i=1,2,...,m) (18.107 
is replaced by the sequence of unconstrained problems 

A(x, pe) = f(x) + pyS(x) = min! with x € R”, pp > 0 (kK =1,2,...). (18.108 
Here, p; is a positive parameter, and for S(x) 


=0 κε, 

SG)-[20 Xai ( 
holds, i.e., leaving the feasible set M is punished with a “penalty” term ιο (κ). The problem (18.108 
is solved with a sequence of penalty parameters p; tending to oo. Then 

Jim H(x, pk) = f(x), xe M. (18.110 

06 


8.109 


If x^ is the solution of the k-th penalty problem, then: 
H(x", pr) > He pra), fa fx, (18.111 

and every accumulation point x* of the sequence {x*} is a solution of (18.107). Ifx* € M, then x* is 

a solution of the original problem. 

For instance, the following functions are appropriate realizations of S(x): 


S(x) = max' (0, φι(κ),..., θμι(κ)} (r =1,2,...) or (18.112a 
S(x) = ο ge) ος). (18.112) 


If functions f(x) and g;(x) are differentiable, then in the case r > 1, the penalty function H (x, pp 
is also differentiable on the boundary of M, so analytic solutions can be used to solve the auxiliary 
problem (18.108). 

Fig. 18.11 shows a representation of the penalty function method. 
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f(x) 


Figure 18.11 Figure 18.12 


E f(x) 
The necessary optimality condition is: 

_ (2x, — 2p, max{0, 1 — xı — 29) V (0 
VH(x, Pi) = ‘ee —2p,max(0,1—24— 29) / | NO. 
The gradient of H is evaluated here with respect to x. By subtracting the equations we have 11 = πο. 


The equation 2, — 2p, max(0,1 — 24) = 0 has a unique solution xf = rk = 1 . . We get the 
Dk 


x? +22 = min! for x; +2 > 1, H(X, py) = 2? + 22 + py max?(0,1 — αι — T2}. 


VEMM" Di 
solution rj = 25 = jim is 
o0 Pk 


18.2.8.2 Barrier Method 
A sequence of modified problems is considered in the form 
H(x.q,) = f(x) + αι (κ) 2 min!, qj» 0. (18.113) 
The term q; B(x) prevents the solution leaving the feasible set M, since the objective function increases 
unboundedly on approaching the boundary of M. The regularity condition 
M°={xeM : g(x) «0 (i=1,2,...,m)} ZÜ and M» — M (18.114) 
must be satisfied, i.e., the interior of M must be non-empty and it is possible to get to any boundary 
point by approaching it from the interior, i.e., the closure of M? is M. 
'The function B(x) is defined to be continuous on M?. It increases to oo at the boundary of M. The 


modified problem (18.113) is solved by a sequence of barrier parameters qx tending to zero. For the 
solution x* of the k-th problem (18.113) 


FE) < pen). (18.115) 
holds and every accumulation point x* of the sequence (x^) is a solution of (18.107). 
Fig. 18.12 shows a representation of the barrier method. 
The functions, e.g., 


1 
- by letting k > oo. 


m 


B(x) = -X -In(-gi(x)), x€ M? or (18.116a) 
i=l 
m 1 Ü 
B(x)- (r212,..), XEM (18.116b) 


Σ [—gi(x)]" 


are appropriate realizations of B(x). 
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E f(x) = r? + αὖ = min! subject to zi + zo > 1, H(x,q) z2 + z + αε[-- ]η(αι + za — 1)), 


221 — dg — — —— 
πι — T2 — 1 : -- 
αι +x > 1, VA(x, qk) = i = (5) , απ r2 1. The gradient of H is given 
222 — qi ———— —— 
ae πι Ἔσο-- 1 
with respect to x. 
1 c A 
Subtracting the equations results in σι = 20, 22; — qk 0,%>-,—> -- ee 
2πι-- 1 2 4 
Lo . 1 1 1 * 4 ο 
0, EE x5 1 I 16 t 10» Κ » o0, qk > 0: v= 23 => 


The solutions of problems (18.108) and (18.113) at the k-th step do not depend on the solutions of the 
previous steps. The application of higher penalty or smaller barrier parameters often leads to conver- 
gence problems with numerical solutions of (18.108) and (18.113), e.g., in the method of (18.2.4), in 
particular, if no good initial approximation is available. Using the result of the k-th problem as the 
initial solution for the (k + 1)-th problem the convergence behavior can be improved. 


18.2.9 Cutting Plane Methods 


1. Formulation of the Problem and Principle of Solution 
Let consider the problem 


f(x) =c'x=min!, cem" (18.117) 
over the bounded region M C R” given by convex functions g;(x) (i = 1,2,...,m) in the form gi(x) € 
0. A problem with a non-linear but convex objective function f(x) is transformed into this form, if 

f(x)—zxaa <0, «πμ ER (18.118) 
is considered as a further constraint and 

T(E) = £n} = min! forall X= (X, z,,,) € R” (18.119) 
is solved with g;(x) = g;(x) < 0. 

The basic idea of the method is the iterative linear approximation of M by a convex polyhedron in the 


neighborhood of the minimum point x*, and therefore 
the original program is reduced to a sequence of linear 
programming problems. 

First, a polyhedron is determined: 


P,={xeR": ax< b; i=1,...,s}. (18.120) 
From the linear program 
f(x) =min! with xe P (18.121) 


an optimal extreme point x! of P, is determined with re- 
spect to f(x). If x! € M holds, then the optimal solu- 
Figure 18.13 tion of the original problem is found. Otherwise, a hy- 
perplane is determined: 

Ay = {κ: a, 47x = b, Aggy 72! > 0,4), which separates the point x! from M, so the new polyhe- 
dron contains 

Py ={xE Pi: ax Eb). (18.122) 
Fig. 18.13 shows a schematic representation of the cutting plane method. 
2. Kelley Method 
The different methods differ from each other in the choice of the separating planes H;. In the case of 
the Kelley method Hj is chosen in the following way: A jp is chosen so that 
gj, (X) = max{g;(x*) (i =1,...,m)}. (18.123) 
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At the point x = x^, the function gj,(x) has the tangent plane 
T(x) = gj, (x") + (x κ’) Vgj, (x^). (18.124) 


The hyperplane H; = {x € R” : T(x) = 0} separates the point x* from all points x with gj, (x) < 0. 
So, for the (k + 1)-th linear program, T(x) € 0 is added as a further constraint. Every accumulation 
point x* of the sequence {x*} is a minimum point of the original problem. 

In practical applications this method shows a low speed of convergence. Furthermore, the number of 
constraints is always increasing. 


18.3 Discrete Dynamic Programming 


18.3.1 Discrete Dynamic Decision Models 


A wide class of optimization problems can be solved by the methods of dynamic programming. The 
problem is considered as a process proceeding naturally or formally in time, and it is controlled by 
time-dependent decisions. If the process can be decomposed into a finite or countably infinite number 
of steps, then it is called discrete dynamic programming, otherwise it is a continuous dynamic program- 
ming. In this section, only n-stage discrete processes are discussed. 

18.3.1.1 n-Stage Decision Processes 

An n-stage process P starts at stage 0 with an initial state x, = Χρ and proceeds through the interme- 
diate states Χι, Χρ, ..., X, ; into a final state x, = x, € Xe C R”. The state vectors x; are in the 
state space X; C R”. To drive a state x; , into the state x;, a decision u; is required. All possible 
decision vectors u; in the state x; | form the decision space U;j(x; ,) C R*. From x; , the consecutive 
state x; can be obtained by the transformation (Fig. 18.14) 


xj;-gjí(Xx;,u) j= 1(1)n. (18.125) 
X,-X x x Xs. Xn = X 
πηρα) | 7 gu) | τν .“"ἠβιίκινα) R$ 

ue U; (xp) ue Ux) use UO) 


Figure 18.14 
18.3.1.2 Dynamic Programming Problem 


Our goal is to determine a policy (u,,...,u,) which drives the process from the initial state x, into 
the state x, considering all constraints so that it minimizes an objective function or cost function 
f (f(x, αι), ena J^ (X, αν ἄν). The functions Τσε, ιν uj) are called stage costs. The standard form 
of the dynamic programming problem is 
OF:  f(hi(xoyu)..... fa(x, 4, u,)) — min! (18.126a) 
CT: X; = gj (Xj ιν, Wy), aS 1(1}η, 
Xo = Xa, Xn = Xe E€ Xe, Xj c X; CR", 1-1(1}η, (18.126b) 
u; € Uj(x;-1) C R^, j= Yn. 


The first type of constraints x; are called dynamic and the others Χρ, u; are called static. Similarly to 
(18.126a), a maximum problem can also be considered. A policy (u,,...,u,,) satisfying all constraints 
is called feasible. The methods of dynamic programming can be applied if the objective function satis- 
fies certain additional requirements (see 18.3.3, p. 944). 
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18.3.2 Examples of Discrete Decision Models 
18.3.2.1 Purchasing Problem 


In the j-th period of a time interval which can be divided into n periods, a workshop needs v; units 
of a certain primary material. The available amount of this material at the beginning of period j is 
denoted by z; ,, in particular, rg = x, is given. The amounts uj are to be determined for a unit price 
cj, Which should be purchased by the workshop at the end of each period. The given storage capacity 


K must not be exceeded, i.e., xj-1 + uj € K. A purchase policy (u1,..., Un) should be determined, 

which minimizes the total cost. This problem leads to the following dynamic programming problem: 
n n 

OF:  f(u,...,u,) = do filu) = So eju; — min! (18.1272) 


j=l 


CT: 


i j=1(1)n, 
Uj(xj— i) dus: : max (0, vj — zj} < uj < K — rja}, j —1(1)n. 
In (18. 127b) itisensured that demands are satisfied and the storage capacity is not exceeded. If there is 
also a storage cost l per unit per period, then intermediate cost in the j-th period is (z;j.. + uj —v;/2)I, 
and the modified cost function is 


j — 1(1)n, 
(18.1275) 


n 


f (2o, αι... En-1; Un) = S (eju + (£j-1 + uy — vj/2) - D). (18.128) 


j=l 


18.3.2.2 Knapsack Problem 


One has to select some of the items A,..., A, with weights w1,..., Wn and with values c,...,c, so 
that the total weight does not exceed a given bound W, and the total value is maximal. This problem 
does not depend on time. It will be reformulated in the following way: At every stage a decision uj 
about the selection of item A; is made. Here, uj = 1 holds if A; is selected, otherwise u; = 0. The 
capacity still available at the beginning of a stage is denoted by xj—1, so the following dynamic problem 
arises: 

n 


OF: νι ἴδῃ} Cju; — max! 18.129a 
714} 


j=l 
CT: 2) = $2 —UWju, j — 1(1)n, 
a=W, 0<2;<W, j= 1(1)n, 
uj € {0,1}, falls zx; ιο}. 11) (18.129b) 
iy =0; falls xj- < w;, J= 1n: 


18.3.3 Bellman Functional Equations 
18.3.3.1 Properties of the Cost Function 


In order to state the Bellman functional equations, the cost function must satisfy two requirements: 


1. Separability 
The function f (fi (Xo, u1), --- , fa(Xn-1; u,)) is called separable, if it can be given by binary functions 
poo H4 and by functions Fi... Fn in the following way: 
FG xo)... fu o Un)) = Fifi (xoi αι)... fa a Un), 
Fifi Xo; uj: fn a uu) = Hi xo. αι), ρε Wa), μμ μα. 


Εν afa; Uu, 1), faa u,)) = Hya (fno; U, 4), Επ ης. Xs u,))) , 
Fifa as u,)) = fna αρ) 
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2. Minimum Interchangeability 
A function H (f(a), F(b)) is called minimum interchangeable, if: 

(in. , (Fla), F(b)) = min H (Κω. min Ρίο) (18.131) 
This property is satisfied, for example, if H is monotone increasing with respect to its second argument 
for every a € A, i.e., if for every a €. A, 

H (f(a), F(b))) < H (fla), F(bj)) for F(bi) € F(b»). (18.132) 


Now, for the cost function of the dynamic programming problem, the separability of f and the minimum 
interchangeability of all functions H;, j = 1(1)n — 1 are required. The following often occurring type 
of cost function satisfies both requirements: 


fe" — V f(x; 4, uj) or f" = E fix, Wj). (18.133) 
j=l 
The functions H; are 
Hy" βίο να) + Y. felo) and (18.134) 
k=j+1 
ας = ad fj Xj quj η); την ΠΠ ( δ. 35) 


18.3.3.2 Formulation of the Functional Equations 


The following functions are defined: 


p(x) = min H(G- w) -s fr(Xn-1,Un)), J =10)n, (18.136) 
a, EUR (X,_1) 
k=j(1)n 
Ónai(X,) = 0. (18.137) 
If there is no policy (u;....,u,) driving the state x; , into a final state x. € Xe, then we substitute 


@;(X;-1) = oo. Using the separability and minimum interchangeability conditions and the dynamic 
constraints for 7 = 1(1)n we get: 


O(a) μα uj) min PFja(fia(xj ua). fati ui). 
u;€Uj;(x; .,) us €Ux (xy. |) 
k=j+1(1)n 


= Ν᾽ (f; 35) 6) G5)) 


ΠΕ 


d&a) = min H (κ) ινα) $j41(9)(%)-1,0,))) - (18.138) 


η, εἴης, 4) 
Equations (18.138) and (18.137) are called the Bellman functional equations. φι(Χῃ) is the optimal 
value of the cost function f. 

18.3.4 Bellman Optimality Principle 


The evaluation of the functional equation 


éjG;a)- min H; (Hau), dl) (18.139) 


ujeUj(x, 4) 
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corresponds to the determination of an optimal policy (uj,..., τας) for which the subprocess P; starting 
at state x;_, and consisting of the last n—j +1 stages of the otal, process P minimizes the cost Fonction, 
i.e., 


Fj(fj(xj 4, λεν νι fn, 5 u,)) — min!. (18.140) 
'The optimal policy of the process P; with the initial state x; , is independent of the decisions u;, .. . 
uj of the first j — 1 stages of P which have driven P to the state x; ,. To determine $;(x; 1) ilie 
value ġj+ı(x;) is needed to know. Now, if (uj,..., už 


τας) is an optimal policy for Pj, then, obviously, 
(uj, .., u*) is an optimal policy for the anim Pj, starting at x; = g;(x; 4, uj). This state- 
ment is generalized i in the Bellman optimality principle. 

Bellman Principle: If (uj, ..., u% ) is an optimal policy of the process P and (Κον... x7) is the corre- 
sponding sequence of states, then for every subprocess P;, 7 = 1(1)n, with initial state Xj , the policy 


(uj,..., 


18.3.5 Bellman Functional Equation Method 


18.3.5.1 Determination of Minimal Costs 


With the functional equations (18.137), (18.138) and starting with $,,4(x,) = 0 every value ó;(x; 4) 
with x; € Xj- is determined in decreasing order of j. It requires the solution of an optimum problem 


už) is also optimal. 


over the decision space U;(x; ,)forevery x; , € Xj-1. For every x;_, there isa minimum point u; € Uj 
as an optimal decision in the first stage of a subprocess P; starting at x; 4. If the sets X; are noi finite 
or they are too large, then the values @; can be calculated for a set of selected nodes Xj € Xj. 
The intermediate values can be calculated by a certain interpolation method. φι(Χρ) is the optimal 
value of the cost function of process P. The optimal policy (uj, ..., už) and the corresponding states 


1n 


(x5. ..., X) can be determined by one of the following two methods. 


18.3.5.2 Determination of the Optimal Policy 

1. Variant 1: During the evaluation of the functional equations, the computed u; is also saved 
for every Xj, € πι. After the calculation of 9; (xo), an optimal policy is got if x} = gi(xj, uj) is 
determined from Χρ = xj; and the saved ται = uj. From xj and the saved decision uj follows x5, etc. 
2. Variant 2: For every x; , € X;., only ¢;(x;_;) is saved. After every g(x Xj |) is known, a 


forward calculation is made. Starting with j — 1 and Xo = Xp one determines uj in increasing order of 
j by the evaluation of the functional equation 


Φις 4) = min H; (fy X vu) Φος] j-» Uj )))- (18.141) 


ujcU;(xi. 

xj = g;(xj ,,uj)isobtained. During the forward calculation, an optimization problem must be solved 
again at every stage. 

3. Comparison of the two Variants: The computation costs of variant 1 are less than variant 2 
requires because of the forward calculations. However decision uj is saved for every state x; ;, which 
may require very large memory in the case of a higher dimensional decision space U;(x; ,), while in 
the case of variant 2, only the values o;(x; ,) must be saved. Therefore, sometimes variant 2 is used 
on computers. 


18.3.6 Examples for Applications of the Functional Equation 
Method 


18.3.6.1 Optimal Purchasing Policy 


1. Formulation of the Problem 
The problem from 18.3.2.1, p. 944, to determine an optimal purchasing policy 
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OF [(ωι...., Un) = Σ cju; — min! (18.142a 
1-1 
CT £j=2j1tuj—v, j=1(1)n, 
t= %,0<a,< Kh, j=1(1)n, (18.142b 


U;(aj-1) = (uj : max{0, v; --αγ-η} < uj K — tj}, j=1(1)n 

leads to the functional equations 
n+1 (Ln) = 0, (18.143 
$;(rj31) = min {σι + jlt- +u; —¥;)), j-1(1)n. (18.144 


uj EU; (α]- 1) 


2. Numerical Example 


€ —4, @=3, 455, G=3, 065 —4, 06 —2, 
vi —6, v=7, v4 —4, v,—2, vg —4, ug —3. 
Backward Calculation: The function values ó;(v;.,) will be determined for the states αι = 


mem. dU. duc. 


0,1,...,10. Now, it is enough to make the minimum search only for integer values of uj. 
j—6: ¢6(%5) = min  cgug = ce max(0, vg — 2ο} = 2max(0,3 — σε}. 
ug €U6 (xs) 


According to variant 2 of the Bellman functional equation method, only the values of (x5) are written 
in the last row. For example, $4(0) is determined. 


$4(0) = a + θείο = 2) 
= min(28, 27, 26, 25, 24, 25, 26, 27, 30) = 24. 


zP0|[1[2[3|]4[5[6|7]8]T9? [310] 
jei 75 

2| 59 | 56] 53] 50] 47 441 41 | 38 | 35 | 32 | 29 
3| 44 | 39) 34] 29] 24] 21) 18) 15} 12] 9 | 6 
4| 24 |21|18]15 12 9| 6| 4| 2 00 
5| 22 |18|14]10, 6|4|2[|0]|0]|0 0 
6| 6 |4|2|0/0/0/0[|0/0.0]|0 


Forward Calculation: 
φι(2) = 75 = „min (4u + ó»(u, — 4)). 
One gets uj = 4 as the minimum point, therefore a} = xj + uj — vı = 0. This method is repeated for 
(0) and for all later stages. The optimal policy is: 
(uj, u5, u5, uj, us, ug) = (4, 10, 1, 6,0, 3). 
18.3.6.2 Knapsack Problem 


1. Formulation of the Problem 
Consider the problem given in 18.3.2.2, p. 944 


OF :f(ui,..., Un) = M cju; — max! (18.1452) 
ja 
CT: zj = 2-1 — wt, j=1(1)n, 
t=W, 0<a;<W, j=1(1)n, : 
w é au κ ας μαμα. (18.145b) 
= if rj 4 < wy, j — Yn. 


Since this i is a maximum problem, the Bellman functional equations are now 
On+1 (απ) =0, 
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$;(rj31) = max (cjuj+ $ju(rji-—wj;uj), 7 =1(1)n. 
uj ο 


The decisions can be only 0 and 1, so it is practical to apply variant 1 of the functional equation method. 
For j =n,n—-1,...,1: 
slaja) = SY toral = w) if gja > wy and cj + jyt = w) > pilej), 
ὌΝ = losa (ma) otherwise, 
lif aj. > w; and cj + ġj+1(£j-1 — wj) > Qj (£j-1), 
uj(£;1) = 0 otherwise. l l 
2. Numerical Example 


€j—1, &=2, co =3, ο], o=5, ος 4, 
w =2, we=4, w=6, w=3, ws=7, ωςςθ. 


W=10, n=6. 


Since the weights w; are integers, the possible values for x; are z; € {0,1,...,10}, j = 1(1)n, and zo = 
10. The table contains the function values $;(x;..,) and the actual decision u;(x;..,) for every stage and 
for every state z;.,. For example, the values of (x5). 3(2), $3(6), and $3(8) are calculated: 


(ας) = 0, if z5 < wg = 4, (ο = 0, if r5 < 4, 

P6\"5) = 1 ce = 6, otherwise, “6\"5) = 3 0. otherwise. 

$3(2): vq =2<wz3=3: $32) φι(2) = 3, us(2) = 0. 

$3(6): πο ws and c3 + óa(z» — wa) = 6 +3 < da(x2) — 10: Φι(6) = 10, υα(6) = 0. 
$3(8): αυ» ws and c3 + ós(xa — w3) = 6 + 9 φι(ωυ) 210:  ó3(8) = 15, ux(8) = 1. 


The optimal policy is 


(uj, u$, u3, uj, u$, ug) = (0,1,1,1,0,1), Φι(10) = 19. 
z;—0 1 2 3 4 5 6 7 8 9 10 | 
j-1 19; 0 
2 0:0) 3:0 41 |) 71 |) 9:0) 10:1 1311 13; 1| 15:0) 16; 0 | 15:1 
3 0:0,330/30/6:1/91| 9;0 | 10;0 | 12;1 | 15;1 | 161| 16.0 
4 0;0 | 3;1 | 3;1 1911601 9; 1 | 10;1 | 10;1 | 10; 1 19:01 16; 1 
5 0:0 1:1 ας οὐ σοι Fl) Τ1, 1) 7;1|13;1|13;1 
6 0;0|0;0]0;0}0;0)6;1] 61] 61. 61| 61| 6;1] 61 


19 Numerical Analysis 


The most important principles of numerical analysis will be the subject of this chapter. The solution 
of practical problems usually requires the application of a professional numerical library of numerical 
methods, developed for computers. Some of them will be introduced at the end of Section 19.8.3. The 
special computer algebra system Mathematica will be discussed with its numerical programs in Chapter 
20, p. 1023 and in Section 19.8.4.2, p. 1016. Error propagation and computation errors will be examined 
in Section 19.8.2, p. 1004. 


19.1 Numerical Solution of Non-Linear Equations in a Single 


Unknown 
Every equation with one unknown can be transformed into one of the normal forms: 
Zero form: f(a) =0. (19.1) 
Fixed point form: x = φ(α). (19.2) 


Suppose equations (19.1) and (19.2) can be solved. The solutions are denoted by z*. To get a first 
approximation of z*, we can try to transfom the equation into the form fi(x) = fo(r), where the 
curves of the functions y = fi(x) and y = f(x) are more or less simple to sketch. 

E /(α) — 22 —sinz = 0. From the shapes of the curves y = x? and y = sin z it can be seen that 2} = 0 
and 1ο Ξε 0.87 are roots (Fig. 19.1). 


M A 
y=) 


7 
y=ọ(x) 


-2 -1 70 1 2 3x 
x*=0 x*,=0.87 


Xy 


Figure 19.1 Figure 19.2 


19.1.1 Iteration Method 


The general idea of iterative methods is that starting with known initial approximations x, (k = 
0, 1,...,n) a sequence of further and better approximations is formed, step by step, hence the solu- 
tion of the given equation is approached by iteration, by a convergent sequence. A sequence is tried to 
be created with convergence as fast as possible. 


19.1.1.1 Ordinary Iteration Method 
To solve an equation given in or transformed into the fixed point form x = y(x), the iteration rule 
Inti = P(n) (n =0,1,2,...; zo given) (19.3) 


is used which is called the ordinary iteration method. It converges to a solution zx* if there is a neigh- 
borhood of α” (Fig. 19.2) such that 


ole) 9}! ς κ «1 (K const) (19.4) 
T-T 
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holds, and the initial approximation vo is in this neighborhood. If y(x) is differentiable, then the cor- 
responding condition is 


le'(x)| € K « 1. (19.5) 
The convergence of the ordinary iteration method becomes faster with smaller values of K. 
B z’ = sin x, i.e., n 0 1 2 3 4 5 
X444 = vVSin Lp. Ta 0.87 | 0.8742 | 0.8758 | 0.8764 | 0.8766 | 0.8767 


sin £n | 0.7643 | 0.7670 | 0.7681 | 0.7684 | 0.7686 | 0.7686 


Remark 1: In the case of complex solutions, x = u + iv is substituted. Separating the real and the 
imaginary part, a system of two equations is obtained for the real unknowns u and v. 


Remark 2: The iterative solution of non-linear equation systems can be found in 19.2.2, p. 961. 
19.1.1.2 Newton's Method 


1. Formula of the Newton Method 
To solve an equation given in the form f(x) = 0, mostly the Newton method is used which has the 
formula 

(0η) 


Tug] = Ln — (αι) 
n 


(n — 0,1,2,... ; xo is given), (19.6) 


i.e., to get a new approximation £n+1, the values of the function f(a) and its first derivative f'(x) at x, 
are needed. 

2. Convergence of the Newton Method 

The condition 


f(x) £0 (19.7a) 
is necessary for convergence of the Newton method, and the condition 
y ία 
ος <K<1 (K const) (19.7b) 


is sufficient. The conditions (19.7a,b) must be fulfilled in a neighborhood of ο” such that it contains all 
the points x, and z* itself. If the Newton method is convergent, it converges very fast. It has quadratic 
convergence, which means that the error of the (n+ 1)-st approximation is less than a constant multiple 
of the square of the error of the n-th approximation. In the decimal system, this means that after a 
while the number of exact digits will double step by step. 

E The solution of the equation f(x) = x? — a = 0, i.e., the calculation of x = ya (a > 0 is given), 
with the Newton method results in the iteration formula 


1 
Tn+ = 2 ο + 2) . (19.8) 
n | 0 | 1 | 2 | 3 

15 | 14166666 | 14142157 | 1.4142136 


We get for a = 2: 


Zn 


3. Geometric Interpretation 
'The geometric interpretation of the Newton method is represented in Fig. 19.3. The basic idea of the 
Newton method is the local approximation of the curve y = f(x) by its tangent line. 
4. Modified Newton Method 
If the values of f'(x») barely change during the iteration, it can be kept as constant for a while, and the 
so-called modified Newton method can be used: 

fs) 


Ὅπη = Ln — f(a ) 
m 


(m fixed, m « n). (19.9) 
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ya 


KA 


xY 


Figure 19.3 Figure 19.4 


The goodness of the convergence is hardly modified by this simplification. 
5. Differentiable Functions with Complex Argument 


The Newton method also works for differentiable functions with complex arguments. 


19.1.1.3 Regula Falsi 


1. Formula for Regula Falsi 
To solve the equation f(x) = 0, the regula falsi method has the rule: 
z = Tn — Lm 
Tny = d 
” " f (xn) <= f) 
Only the function values are calculated. The method follows from the Newton method (19.6) by approx- 
imating the derivative f'(xn) by the finite difference of f(a) between £n and a previous approximation 
Lm (m <n). 
2. Geometric Interpretation 
The geometric interpretation of the regula falsi method is represented in Fig. 19.4. The basic idea of 
the regula falsi method is the local approximation of the curve y = f(x) by a secant line. 
3. Convergence 
The method (19.10) is convergent if m is chosen so that f(r,,) and f(v,) always have different signs. 
If the convergence already seems to be fast enough during the process, it will speed up if one ignores 
the change of sign, and substitutes £m = c, 4. 


E f(z) =2?-sinz=0. 


f(x) (n=1,2,...; m «τι; £o, z, are given). (19.10) 


n | Az, = $4 — Za Gi | Flan) | Ayn = f (£n) — f (£n-1) Atn 
Ayn 

0 0.9 0.0267 

1 —0.3 0.87 —0.0074 —0.0341 0.8798 

2 0.0065 0.8765 —0.000252 0.007148 0.9093 

3 0.000229 0.876729 | 0.000003 0.000255 0.8980 

4 — 0.000003 0.876726 


If during the process the value of Az, /Ay, only barely changes, one does not need to recalculate it 
again and again. 

4. Steffensen Method 

Applying the regula falsi method with £m = £n—1 for the equation f(x) = x— y(x) = 0 the convergence 
often can be sped up, especially in the case g'(x) < —1. This algorithm is known as the Steffensen 
method. 

E To solve the equation x? = sin z with the Steffensen method, the form f(x) = x— vsin x = 0 should 
be used. 
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n | Δι = Tn — Tn- Bin | f(x) | Ay = f (£n) — [απ ι) ve 
Un 

0 0.9 0.014942 

1 —0.03 0.87 —0.004259 —0.019201 1.562419 

2 0.006654 0.876654 | —0.000046 0.004213 1.579397 

3 0.876727 | 0.000001 


19.1.2 Solution of Polynomial Equations 
Polynomial equations of n-th degree have the form 
f(x) = pal) = αμα" + ay 12" + +++ + aya + ag = 0. (19.11) 


For their effective solution efficient methods are needed to calculate the function values and the deriva- 
tive values of the function p, (x) and an initial estimate of the positions of the roots. 


19.1.2.1 Horner's Scheme 


1. Real Arguments 
To determine the value of a polynomial p, (x) of n-th degree at the point x = xo from its coefficients, 
first the decomposition 


Ῥπ(α) = αμα" + Anz") 4 -- + aaa? + az + ag = (x — xo)pa-a (x) + Pn(Xo) (19.12) 
is considered where p, (1) is a polynomial of (n — 1)-st degree: 

Daa(z) =al 2771 al, γα" ὃ... alm d αμ. (19.13) 
The recursion formula 

dj ,— Coa, tax, (k— n,n—1,...,0; af, =0, a^, = pp(xo)) (19.14) 


is obtained by coefficient comparison in (19.12) with respect to z^. (Note that a/,_; = αμ.) In this 
way, the coefficients a}, of p, 1(x) and the value p, (xo) are determined from the coefficients a, of p, (x). 
Furthermore fewer multiplications are required than by the "traditional" way. By repeating this pro- 
cedure, a decomposition of the polynomial p, 4(v) with the polynomial p, ο) is obtained, 


Pn—1() = (x — zo)pa-2(x) + Pn-1 (0) (19.15) 
etc., and a sequence of polynomials p, (x), p, 1(x) ... , pi(@), po(x) is resulted. The calculations of the 


coefficients and the values of the polynomial is represented in (19.16). 


On ün—1 ü5—2 +++ 03 a2 αι αρ 
P" "s PA misil? ο ον ον ΣΙ 
Xo αρα ι. X004 5..- T0903 Toal LOA, Tolo 
/ / f / / / n 
an-ı 05-2 ᾱπ- 8 κ. lg αι a Pn(2o) 
η ES ill ως ρα) 
20 Lang X004 3... LOA, LOA, T0d9 
n" “ “ n n 3 
πο An_3 Ωρ 4 vee Gy αρ Pn-1 To) (19 16) 
; Sq 
To Tag 


af” = po(2o) 


From scheme (19.16) the value p, (zo), and derivatives p“) (ao) are as: 


p. (xo) = Mpa-i(xo), pl! (το) = 2p, 2(xo), ..., p™ (zo) = nlpo(z). (19.17) 
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α pa(x) = α΄ + 22? — 32? — T. 2 —3 0 -T We see: 
The substitution value and 2 2 8 10 20 pa(2) = 13, 
derivatives of p4() are cal- 4 5 10118 pi(2) = 44, 
culated at πο = 2 according ϱ 2 12 34 — ΡΊ(3) — 66. 
to (19.16). s irla mya sen 
p AQ) = 60, 
4 
τες pP) = 24. 
2| 2 
10 
2 = 


Remarks: 
1. The polynomial p, (x) can be rearranged with respect to the powers of 2; — xo, e.g., in the example 
above there is pa(x) = (x — 2)* + 10(x — 2)? + 33(x — 2)? + A4(x — 2) + 13. 

2. The Horner scheme can also be used for complex coefficients αχ. In this case for every coefficient we 
have to compute a real and an imaginary column according to (19.16). 


2. Complex Arguments 
If the coefficients αχ in (19.11) are real, then the calculation of p, (xo) for complex values xo = uo + ivo 
can be made real. In order to show this, p, (x) is decomposed as follows: 


Ῥπ(α) = αμα" + a, 12" | +++» + a,x + αρ 
= (a? — pr — q)(a, ou"? +++» +a) +rız+ro with (19.18a) 
x? — px — q = (x — zo)(z — To), ie, p —2uo, q = — (u0? + vo’). (19.18b) 
Then, 
Pn(@o) = τια + ro = (riuo + ro) + irivo. (19.18c) 
To find (19.18a) the so-called two-row Horner scheme introduced by Collatz can be constructed: 
an απ-ι Gn-2 ...03 02 A, ao 
q ; qa, pes qaz qaz σαι (αρ 
p Dd, 2 pd, 3 ... pa, pa, pao (19.18d) 
αγ ο dos Moa s. A, do fri Το 
= an 
Βρι(ι) = x + 22? — 3x? — 7. Calculate the value of p, at ro = 2 — i, i.e., for p = 4 and q = —5. 


1 2 -3 0 --τ It results in: 
το -5 —30 —80 pa(zo) = 3410 — 87 = —19 — 34i. 
4 4 24 64 
[16 16| 34 -87 
19.1.2.2 Positions of the Roots κ 


1. Real Roots, Sturm Sequence 
The Cartesian rule of signs gives a first idea of whether the polynomial equation (19.11) has a real root, 
or not. 
a) The number of positive roots is equal to the number of sign changes in the sequence of the cooeffi- 
cients 

Gn, An-1,+++; Q1, Q0 (19.19a) 


or it is less by an even number. 
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b) The number of negative roots is equal to the number of sign changes in the coefficient sequence 
αρ, --ᾱι, Gg,..., (1) αν (19.195) 


or it is less by an even number. 

Bl p;(x) = x? — 6x5 + 102? + 132? — 15x — 16 has 1 or 3 positive roots and 0 or 2 negative roots. 
To determine the number of real roots in any given interval (a, b), Sturm sequences are used (see 1.6.3.2, 
2., p. 44). 
After πο the function values y, = p,(x,) at a uniformly distributed set of nodes x, = zo +v- h 
(h constant, v = 0,1,...) (which can be easily performed by using the Horner scheme) a good guess 
of the graph of the function and the locations of roots are obtained. If p,(c) and p,(d) have different 
signs, there is at least one real root between c and d. 
2. Complex Roots 

In order to localize the real or complex roots into a bounded region of the complex plane the following 
polynomial equation is considered which is a simple consequence of (19.11): 

f*(x) ο ^ + |a, 2|r"? 4 ---  [ai|r + [ao| = |a; |r" (19.20) 
and an upper bound ro is determined for the positive roots of (19.20), e.g., by systematic repeated trial 
and error. Then, for all roots zz, (k = 1,2,...,n) of (19.11), 

|az| € ro. (19.21) 
α f(x) = p(x) = z1--4.42?—20.012?—50.1224-29.45 = 0, f*(x) = 4.4r?--20.01r?--50.12r4-29.45 = 
r^. Some trials are 

r=6: f*(6) = 2000.93 > 1296 = rt, 

r=T7: f*(7) = 2869.98 > 2401 = rf, 

r=8: f*(8) = 3963.85 < 4096 = r*. 


From this it follows that |x} | < 8 (k = 1, 2,3,4). Actually, for the root x} with maximal absolute value 
—7 < xj < —6 holds. 

Remark: A special method has been developed in electrotechnics in the so-called root locus theory 
for the determination of the number of complex roots with negative real parts. It is used to examine 
stability (see [19.11], [19.31]). 


19.1.2.3 Numerical Methods 
1. General Methods 


The methods discussed in Section 19.1.1, p. 949, can be used to find real roots of polynomial equations. 
The Newton method is well suited for polynomial equations because of its fast convergence, and the 
fact that the values of f(x) and f'(x,) can be easily computed by using Horner’s rule. By assuming 
that an approximation £n of the root z* of a polynomial equation f(x) = 0 is sufficiently good, then 
the correction term ὁ = z* — x, can be iteratively improved by using the fixed-point equation 
1 1 : x 

ὃ -- Fa) Fa) + 3 f" (z,)9 +++} = φ(δ). (19.22) 
2. Special Methods 
The Bairstow method is well applicable to find root pairs, especially complex conjugate pairs of roots. 
It starts with finding a quadratic factor of the given polynomial like the Horner scheme (19.18a-d) by 
determinung the coefficients p and q which make the coefficients of the linear remainder ro and rı equal 
to zero (see [19.30], [19.11], [19.31]). 
If the computation of the root with largest or smallest absolute value is required, then the Bernoulli 
method is the choice (see [19.19/). 
The Graeffe method has some historical importance. It gives all roots simultaneously including complex 
conjugate roots; however the computation costs are tremendous (see [19.11], (19.31]). 
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19.2 Numerical Solution of Systems of Equations 


In several practical problems, there are m conditions for the n unknown quantities x; (i = 1,2,...,n) 
in the form of equations: 


Fi(«1,22,---;2n) =0, 


Fo(zi, £2 2 =0, 
^ (19.23) 
Fu(zxi,23,...,*24) = 0. 
The unknowns z; are to be determined so that they form a solution of the system of equations (19.23). 
Mostly m Ξ n holds, i.e., the number of unknowns and the number of equations are equal to each 


other. In the case of m > n, (19.23) is called an over-determined system; in the case of m < n it is an 
under-determined system. 

Over-determined systems usually have no solutions. Then one looks for the *best" solution of (19.23), 
in the Euclidean metric with the least squares method 


m 


XO FP (a1, 29, ..., En) = min! (19.24) 
i=l 


or in other metrics as another extreme value problem. Usually, the values of n — m variables of an under- 
determined problem can be chosen freely, so the solution of (19.23) depends on n — m parameters. It 
is called an (n — m)-dimensional manifold of solutions. 

Linear and non-linear systems of equations are distinguished, depending on whether the equations are 
only linear or also non-linear in the unknowns. 


19.2.1 Systems of Linear Equations 


Consider the linear system of equations 


ayti + απο + +++ + ainin = bi, 
atı + Ag2%2 + +++ + Amn = bz, 
: : (19.25 
απιᾶι + Angle + +++ + Annin = bn. 
The system (19.25) can be written in matrix form 
Ax=b (19.26a 
with 
@11 Q12 *** Gin bi, Ti, 
Q21 Q22 *** Gan be, T2, 
A=]. ; b= : x= ca vif (19.26b 
Gn] An2°"* Ann Dn Tn 
Suppose the quadratic matrix A = (aix) (i,k = 1,2,...,n) is regular, so system (19.25) has a unique 


solution (see 4.5.2.1, 2., p. 309). In the practical solution of (19.25) two types of solution methods are 
distinguished: 

1. Direct Methods are based on elementary transformations, from which the solution can be obtained 
immediately. These are the pivoting techniques (see 4.5.1.2, p. 307) and the methods given in 19.2.1.1— 
19.2.1.3. 

2. Iteration methods start with a known initial approximation of the solution, and form a sequence 
of approximations that converges to the solution of (19.25) (see 19.2.1.4, p. 960). 


19.2.1.1 Triangular Decomposition of a Matrix 
1. Principle ofthe Gauss Elimination Method 


By elementary transformations 
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1. interchanging rows, 

2. multiplying a row by a non-zero number and 

3. adding a multiple of a row to another row, 

the system A x = b is transformed into the so-called row echelon form 


Tu ΤΊ» Τι ... Tin 
T22 Το ... Tn 
Rx=c with R= T33 +++ T3n |. (19.27) 
0 ο 
Tnn 


Since only equivalent transformations were made, the system of equations R x = c has the same solu- 
tions as A x = b. From (19.27) it follows: 


» 1 n . 
Zn i ου... (« 5 run] (i2n—1,n—2,...,1). (19.28) 


Tnn Tii k=i+1 


The rule given in (19.28) is called backward substitution, since the equations of (19.27) are used in the 
opposite order as they follow each other. 


The transition from A to R is made by n — 1 so-called elimination steps, whose procedure is shown by 
the first step. This step transforms matrix A into matrix Aj: 
a) Ὁ (1) 
αι 1ο --- Qin 
αι Q12 ..: Gin 0 ΠΣ 
üo| Q2 ... Gan 0 | agg ... Gon 
= | a: a32 ... Qg -- 6) (1) 
A=] %1 03 3n |, Aj—]| 0 αμ... αν]. (19.29) 
Ani (πα +--+ Ann : (1) 1 
0 απο... at 


Then: 


1. Ana,; 4 0 is chosen (according to (19.33)). If there is none, stop: A is singular. Otherwise a;4 is 
called the pivot. 


2. The first and the r-th row of A are interchanged. The result is A. 
3. The l; (i = 2,3,..., n) multiple of the first row is subtracted from the i-th row of the matrix A. 
The result is the matrix A; and analogously the new right-hand side b, with the elements 


al = Gz — (Ty, with la = B 
11 
WP =b; -lab (i,k =2,3,...,n). (19.30) 


The framed submatrix in A, (see (19.29)) is of type (n — 1,n — 1) and it will be handled analogously 
to A, etc. This method is called the Gaussian elimination method or the Gauss algorithm (see 4.5.2.4, 
p. 312). 

2. Triangular Decomposition 

The result of the Gauss elimination method can be formulated as follows: To every regular matrix A 
there exists a so-called triangular decomposition or LU factorization of the form 


PA-LR (19.31) 
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with 
Tij. ΤΊ» Τι :.. Tin 1 
T22 T23 ... Tn lı 1 0 
R= Ta e T |, = | {αι i» 1 . (19.32) 
0 ` à : : i 
Tnn lui Ino tee leni 1 


Here R is called an upper triangular matrix, L is a lower triangular matrix and P is a so-called permu- 
tation matriz. A permutation matrix is a quadratic matrix which has exactly one 1 in every row and 
every column, and the other elements are zeros. The multiplication P A results in row interchanges in 
A, which comes from the choices of the pivot elements during the elimination procedure. 


2$ 1 6 Ti 2 
W The Gauss climination method should be used for the system (2 1 s) (s) = ῃ jn 
1 1 1/ Ws 4 
schematic form, where the coefficient matrix and the vector from the right-hand side are written next 
to each other (into the so-called extended coefficient matriz), the calculations are: 


3 16 |2 31 6 |2 


3 1 6 |12 
(Α.8 τσ 1 3/7] > | 2/3 | 1/9 -1]|17/3 | 5 [1/32/3_-1 10/3 | ie, 
1 11/4/4 1/3 | [2/3] -1 |} 10/3 2/31/2|-1/2]| 4 


100 316 1 0 0 3 1 6 
P-|0 1) 2 PA=/111),L=[1/3 1 o|,R-[o 2/3 -1 J. 
010 2 1.3 2/3 1/2 1 0 0 -1/2 


In the extended coefficient matrices, the matrices A, A, and A», and also the pivots are shown in boxes. 
Solution: 73 = —8, 22 = —7, πι = 19. 

3. Application of Triangular Decomposition 

With the help of triangular decomposition, the solution of the linear system of equations Ax = b can 
be described in three steps: 

1. PA = LR: Determination of the triangular decomposition and substitution R x = c. 

2. Lc - P b: Determination of the auxiliary vector c by forward substitution. 


3. Rx = c: Determination of the solution x by backward substitution. 
If the solution of a system of linear equations is handled by the expanded coefficient matrix (A, b), 
as in the above example, by the Gauss elimination method, then the lower triangular matrix L is not 
needed explicitly. This can be especially useful if several systems of linear equations are to be solved 
after each other with the same coefficient matrix, with different right-hand sides. 


4. Choice ofthe Pivot Elements 


] k-l : 
Theoretically, every non-zero element αἱ 1 ) of the first column of the matrix A,_ could be used as a 
pivot element at the k-th elimination step. In order to improve the accuracy of solution (to decrease 
the accumulated rounding errors of the operations), the following strategies are recommended. 


1. Diagonal Strategy The diagonal elements are chosen successively as pivot elements if possible, 
i.e., there is no row interchange. This kind of choice of the pivot element makes sense if the absolute 
value of the elements of the main diagonal are fairly large compared to the others in the same row. 


2. Column Pivoting To perform the k-th elimination step, such row index r is chosen for which: 


afk). (19.33) 


k-1) 
laik | = max 
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Ifr σε k, then the r-th and the k-th rows will be interchanged. It can be proven that this strategy makes 
the accumulated rounding errors smaller. 


19.2.1.2 Cholesky's Method for a Symmetric Coefficient Matrix 
In several cases, the coefficient matrix A in (19.26a) is not only symmetric, but also positive definite, 
i.e., for the corresponding quadratic form Q(x) holds 

n n 


Q(x) = xTAx 2 Y Y onrar > 0 (19.34) 


i-lk-l 
for every x € R”, x Æ 0. Since for every symmetric positive definite matrix A there exists a unique 
triangular decomposition 


Α - ΤΙ (19.35) 
with 
lu 
li» 0 
L= la days dua , (19.364) 
li [12 Ing lan 
at) 
lek = ly = F (i=k,k+1,...,n); (19.36b) 
kk 
αἰ = aG — lade (7 =h+1,k+2,...,n), (19.566) 


the solution of the corresponding linear system of equations A x = b can be determined by the Cholesky 
method by the following steps: 

1. A = LLT: Determination of the so-called Cholesky decomposition and substitution LTx = c. 

2. Lc — b: Determination of the auxiliary vector c by forward substitution. 

3. LTx = c: Determination of the solution x by backward substitution. 

For large values of n the computation cost of the Cholesky method is approximately half of that of the 
LU decomposition given in (19.31), p. 956. 


19.2.1.3 Orthogonalization Method 
1. Linear Fitting Problem 


Suppose an over-determined linear system of equations 
n 
» ana, b; (6—1,2,..., m; m >n), (19.37 
k=1 


is given in matrix form 

Ax=b. (19.38 
Suppose the coefficient matrix A = (αι) with size (m x n) has full rank n, i.e., its columns are linearly 
independent. Since an over-determined linear system of equations usually has no solution, instead o 
(19.37) the so-called error equations are considered 


Τι Ξ > Ant, — bi (i—1,2,...,m; m >n) (19.39 
k=1 


with residues τι, and the sum of their squares should be minimalized: 


m m 


n 2 
5 τὸ = 5 | p» Qikčk — J = F(ri,25,..., 24) = min! (19.40 
i-l k=l 


19.2 Numerical Solution of Systems of Equations 959 


The problem (19.40) is called a linear fitting problem or a linear least squares problem (see also 6.2.5.5, 


p. 456). The necessary condition for the relative minimum of the sum of residual squares F'(a1,X2,..., 
Ln) is 

OF 

— =0 (E-242,42n 19.41 

ο ο. (19.41) 
and it leads to the linear system of equations 

ATAx = ΑΤΡ. (19.42) 


The transition from (19.38) to (19.42) is called a Gauss transformation, since the system (19.42) arises 
by applying the Gaussian least squares method (see 6.2.5.5, p. 456) for (19.38). Since A is supposed 
to be of full rank, ATA is a positive definite matrix of size (n x n), and the so-called normal equations 
(19.42) can be solved numerically by the Cholesky method (see 19.2.1.2, p. 958). 

One can have numerical difficulties with the solution of the normal equations (19.42) if the condition 
number (see [19.24]) of the matrix ATA is too large. The solution x can then have a large relative 
error. Because of this problem, it is better to use the orthogonalization method for solving numerically 
linear fitting problems. 


2. Orthogonalization Method 
'The following facts are the basis of the following orthogonalization method for solving a linear least 
squares problem (19.40): 
1. Thelength of a vector does not change during an orthogonal transformation, i.e., the vectors x and 
X = Qox with 

QQ =E (19.43) 
have the same length. 
2. For every matrix A of size (m,n) with maximal rank n (n < m) there exists an orthogonal matrix 
Q of size (m, m) such that 


ΤῊ Tig -.. Tin 
Τρο ... Tan 
^ ; T ^ R ee Y 
A-QR (19.44) with Q Q—E and R= ο) = ο. (19.45) 
Tnn 
(0) 


Here R is an upper triangular matrix of size (n, n), and O is a zero matrix of size (m — n,n). 
The factored form (19.44) of matrix A is called the QR decomposition. So, the error equations (19.39) 
can be transformed into the equivalent system 


ΤΊ131 + Tj229 +... + Tinkn --θι fi, 
Tata 4 + Tontn --ὂο f, 
Tnntn —bn = fa; (19.46) 


—bn+1 = Prt, 


= bm = Tm 


without changing the sum of the squares of the residuals. From (19.46) it follows that the sum of the 
squares is minimal for 7, = Το = +++ = f, = 0 and the minimum value is equal to the sum of the squares 
of f441 to Pm. The required solution x can be got by backward substitution 


Rx = by; (19.47) 
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where bo is the vector with components δι, bo, eee bn obtained from (19.46). 
There are two methods most often used for a stepwise transition of (19.39) into (19.46): 


1. Givens transformation, 

2. Householder transformation. 

The first one results in the QR decomposition of matrix A by rotations, the other one by reflections. 
The numerical implementations can be found in [19.33]. 


Practical problems in linear least squares approximations are solved mostly by the Householder trans- 
formation, where the frequently occurring special band structure of the coefficient matrix A can be 
used. 


19.2.1.4 Iteration Methods 
1. Jacobi Method 


Suppose in the coefficient matrix of the linear system of equations (19.25) every diagonal element aj; 
(i = 1,2,...,n) is different from zero. Then the i-th row can be solved for the unknown ση, and it 
immediately results the following iteration rule, where p is the iteration index: 


, b T Qi (p) j 
ο. x 5 =a σι ο μα ο) (19.48) 
Qii kı απ 
(i#i) 
(1:0,1,2, 5 αἴ), πο, T" ES) are given initial values). 


Formula (19.48) is called the Jacobi method. Every component of the new vector xt» is calculated 
from the components of x). If at least one of the conditions 


n 
Qik ae 
max Ne Sk ex column sum criterion (19.49) 
a a 
i=1 bid 
(“Ὁ 
or 
n. e 
max 5 —|«1 row sum criterion (19.50) 
i α 
k-i |i 
“πὴ 


holds, then the Jacobi method is convergent for any initial vector x (9). 


2. Gauss-Seidel Method 


If the first component α{μτ) is calculated by the Jacobi method, then this value can be used in the 
calculation of at +) While proceeding similarly in the calculation of the further components, the 
following iteration formula is obtained: 
b i—l Qin n αν, 
it) ----Σ = ot) Σ ik ple) (19.51) 
Qii pay Vii kai C 
(421,2, 495 πο, a. vis 0) given initial value; jj = 0,1,2,...). 


Formula (19.51) is called the Gauss-Seidel method. The Gauss-Seidel method usually converges faster 
than the Jacobi method, but its convergence criterion is more complicated. 


W 102, 3r» 493+ 2z4 = 14, 


921 + 26% + 523 Tä 22; 
4πι + ὅπο + 1623 + ὅσα πρ 
2%, + 3v 41η — 1224 20. 


The corresponding iteration formula according to (19.51) is: 
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(μι) l Aa αγ) a MONDO Some approximations and the solution 
si 10 (14 , DL δι ) are given here: 
1 (0) (1) (4) (5) 
ait) 26 (22 t 30h) bat) I ap, z | = | EE | = | x 
1 0 |14 1.5053 | 1.5012 | 1.5 
aft) (17 + 445}. 5a — 5a) , 0 | 1.0077 | 0.9946 | 0.9989 | 1 
16 0 | 1.0976 | 0.5059 | 0.5014 | 0.5 
1 f 
oer) πί 90 4 gt) 31 0*9 Ax. 0 | 1.7861 | 1.9976 | 1.9995 | 2 


3. Relaxation Method 
The iteration formula of the Gauss-Seidel method (19.51) can be written in the so-called correction 
form 


b; lag ” Qik 
it) a") 4 ( : μα. - a) , ie, 


i 


απ kzı Qi kzi Ci 
aD = pM 4d (6 =1,2,...,n; w= 0,1,2,...). (19.52) 
By an appropriate choice of a relaxation parameter w and rewriting (19.52) in the form 
αντ} 20 Lud (i =1,2,...,n; p= 0,1,2,...), (19.53) 
one can try to improve the speed of convergence. It can be shown that convergence is possible only for 
0<w<2. (19.54) 


For w = 1 we retrieve the Gauss-Seidel method. In the case of w > 1, which is called over-relaxation, 
the corresponding iteration method is called the SOR method (successive overrelaxation). The deter- 
mination of an optimal relaxation parameter is possible only for some special types of matrices. 
Iterative methods are applied to solve linear systems of equations in the first place when the main diag- 
onal elements a;; of the coefficient matrix have an absolute value much larger than the other elements 
aig (à 7 k) (in the same row or column), or when the rows of the system of equations can be rearranged 
in a certain way to get such a form. 


19.2.2 System of Non-Linear Equations 
Suppose the system of n non-linear equations 

Pii $53, 2.,254,)—0 ($21,2,...,5) (19.55) 
for the n unknowns αι. £2, ..., r, has a solution. Usually, a numerical solution can be given only by 
an iteration method. 
19.2.2.1 Ordinary Iteration Method 


The ordinary iteration method can be used if the equations (19.55) can be transformed into a fixed- 
point form 


i = fí(m1zxa...,94) (151,2,...,n). (19.56) 
Then, starting from estimated approximations a. a ) .., ©, the improved values are obtained 
either by 
1. iteration with simultaneous steps 

it) a f, (al? αν, "T al?) (i =1,2,...,n; w=0,1,2,...) (19.57) 


or by 
2. iteration with sequential steps 


it) = fi (αὖτ ενα ct) a), ol), m ol) (21,2,...,5; w= 0,1,2,...). (19.58) 
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It is of crucial importance for the convergence of this method that in the neighborhood of the solution 
the functions f; should depend only weakly on the unknowns, i.e., if f; are differentiable, the absolute 
values of the partial derivatives must be rather small. We get as a convergence condition 


7 4). (19.59) 


With this quantity A’, the error estimation is the following: 


n 


K <1 with K= max (x max 
k=1 


è α{ τ. ης 


K 
K max |= rn x xu (19.60) 


(u) 


i 


+1) 


Here, x; is the component of the required solution, x; and x; (n 


(u + 1)-th approximations. 


19.2.2.2 Newton's Method 


The Newton method is used for the problem given in the form (19.55). After finding the initial ap- 


n " 0 0 t " x 
proximation values ας ) D bas , a9. the functions F; are expanded in Taylor form as functions of n 


independent variables £1, 15,...,c, (see p. 471). Terminating the expansion after the linear terms, 
from (19.55) a linear system of equations is obtained, and iterative improvements can be got by the 
following formula: 


Fi (οὐ), 2$... 209) + D ae (21^... 209) (af — a) =0 (19.61) 


are the corresponding p-th and 


(i — 1,2: πμ SO AD) 


The coefficient matrix of the linear system of equations (19.61), which should be solved in every iteration 
step, is 
j OF; (u) (n 
F(x) = (= (υ χμ) ας. ἰὴ) (i,k =1,2,...,n) (19.62) 
and it is called the Jacobian matriz. Ifthe Jacobian matrix is invertible in the neighborhood of the so- 
lution, the Newton method is locally quadratically convergent, i.e., its convergence essentially depends 


on how good the initial approximations are. If x; DH. a) = di? are substituted in (19.61), then the 
Newton method can be written in the corr belio form 


aU =o) Ld (i212, n w= 0,1,2,...). (19.63) 


To reduce the sensitivity to the initial values, analogously to the relaxation method, a so-called damping 
or step length parameter y can be introduced (damping method): 


wl) =o) pod) (i212, m p =0,1,2,...; y> 0). (19.64) 


Methods to determine y can be found in. [19.24]. 

19.2.2.3 Derivative-Free Gauss-Newton Method 

Το solve the least squares problem (19.24), one proceeds iteratively in the non-linear case as follows: 

1. rn τ a suitable initial approximation aO af, ..., 0), the non-linear functions F;(21, x», 
Zn) (i = 1,2,...,m;m > n) are λα α as in the Newton method (see (19.61)) by linear 


n F, (£1, £2,- - , &n), which are calculated in every iteration step according to 
F(a, 244) 8 (ai? ae, ΜῊ alt) + P» Oti (αὐ es , ait) (αι — ap") 
SU, ; pees Ep 2 δει ΠΟΙ k 


@=1,2,...,m; μΞ0,1,2,...).. (19.65) 
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2. di ) = tp — ait ) are substituted in (19.65) and the corrections di" ) are determined by using the 
Gaussian least squares method, i.e., by the solution of the linear least squares problem 


m 


YO F? (a1,..-,2n) = min, (19.66) 
i=1 


e.g., with the help of the normal equations (see (19.42)), or the Householder method (see 19.6.2.2, 
p. 985). 
3. The approximations for the required solution are given by the formulas 


et) = a) + d? or (19.67a) 
aU — ol) Ld? (k= 1,2,...,n), (19.67b) 
where y (y > 0) is a step length parameter similar to the Newton method. 


(u+1) 


By repeating steps 2 and 3 with z; ' ' instead of ot) one gets the Gauss-Newton method. It results in 


a sequence of approximation ‘alii whose convergence strongly depends on the accuracy of the initial 
approximation. The sum of the error squares can be reduced by introducing a length parameter y. 
If the evaluation of the partial derivatives TA ον AD a) (i = 1,2,...,m; k =1,2,...,n) requires 
Tk 

too much work, the partial derivatives can be approximated by difference quotients : 

OF; ( qu) W) 1 W) ων i) , 

Bn. JU ον sna) mo Wi cu + aU pone ad) 

k 
_ pm (>) (Ht) MO rM — 
Bi eerste saan ee (1591,25 ms k= 15,2, i ft 0,152, c). (19.68) 


The so-called discretization step sizes no may depend on the iteration steps and the values of the vari- 
ables. 
If the approximations (19.68) are used, then only function values F; are to be calculated while perform- 
ing the Gauss-Newton method, i.e., the method is derivative free. 


19.3 Numerical Integration 


19.3.1 General Quadrature Formulas 


The numerical evaluation of the definite integral 


= / f(x) dz (19.69) 


must be done only approximately if the integrand f(x) cannot be integrated by elementary calculus, 
or it is too complicated, or when the function is known only at certain points αν, at the so-called inter- 
polation nodes from the integration interval [a,b]. The so-called quadrature formulas are used for the 
approximate calculation of (19.69). They have the general form 


-Σ CovYv + 5 ey +e +S ey? (19.70) 


v=0 v=0 
with y? = fH (αν) (u = 1,2,...,p; v = 1,2,...,n), y, = (αν). and constant values of c,,. Obvi- 
ously, 
I(f) 2 Q(f) - R, (19.71) 


where R is the error of the quadrature formula. In the application of quadrature formulas it is supposed 
that the required values of the integrand f (x) and its derivatives at the interpolation nodes are known 
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as numerical values. Formulas using only the values of the function are called mean value formulas; 
formulas using also the derivatives are called Hermite quadrature formulas. 


19.3.2 Interpolation Quadratures 


The following formulas represent so-called interpolation quadratures. Here, the integrand f(x) is in- 
terpolated at certain interpolation nodes (possibly the least number of them) by a polynomial p(x) of 
corresponding degree, and the integral of f(a) is replaced by that of p(z z). The formula for the integral 
over the entire interval is given by summation. Here the for mulas are given for the most practical cases. 
The interpolation nodes are equidistant: 


Zy = r9 vh (v=0,1,2,...,n), to =a, zx, =b, h= 2 (19.72) 
n 


An upper bound for the magnitude of the error |R] is given for every quadrature formula. Here, M, 


means an upper bound of |f (x)| on the entire domain. 


19.3.2.1 Rectangular Formula 


In the interval [zo, zo +h], f(x) is replaced by the constant function y = yo = f (xo), which interpolates 
f(a) at the interpolation node xo, which is the left endpoint of the integration interval. In this way the 
simple rectangular formula is obtained: 


ao+h h2 
J f()dreh:w, |R| < ZM. (19.73a) 
zo 


The left-sided rectangular formula by summation is 


(b — a)h 
2 


[roe (od yi +yz + ya |Β| τ Μι. (19.73b) 


M, denotes an upper bound of | f’(2)| on the entire domain of integration. 

One gets analogously the right-sided rectangular sum, if one replaces yo by y; in (19.73a). The formula 
is 

a)h 


[ree Alyy + yo ++ + Yn); inp e C=O, (19.74) 


19.3.2.2 Trapezoidal Formula 

f(x) is replaced by a polynomial of first degree in the interval [ro, xo + h], which interpolates f(a) at 
the interpolation nodes xp and zı = zo + h. The approximation is 

worn h h? 

J f(x)dze Slotu) [R] € το. (19.75) 
do 


The so-called trapezoidal formula can be obtained by summation: 


(b — a)? 


jp Me (19.76) 


fre ος 


My denotes an upper bound of | f”(2)| on the entire integration domain. The error of the trapezoidal 
formula is proportional to h?, i.e., the trapezoidal sum has an error of order 2. It follows that it converges 
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to the definite integral for  — 0 (hence, n — oc), if rounding errors are not considered. 


19.3.2.3 Simpson's Formula 
f(x) is replaced by a polynomial of second degree in the interval [xo, zo + 2h], which interpolates f(z) 
at the interpolation nodes £o, πι = % + hand πο = 20 + 2h: 
ro43-2h h h 5 
fle) de = gn +4y +y), |R|< gg Ma (19.77) 
αρ 


n must be an even number for a complete Simpson formula. The approximation is 


b 
h 
| κω dz = z (yo + 4yi 2o + 4ys ca Apa + Yn) (19.78) 


nd 
CTA 
180 


Mi is an upper bound for | / ? (x)| on the entire integration domain. The Simpson formula has an error 


of order 4 and it is exact for polynomials up to third degree. 


19.3.2.4 Hermite's Trapezoidal Formula 
f(x) is replaced by a polynomial of third degree in the interval [29, xo +h], which interpolates f(a) and 
f'(x) at the interplation nodes x and αι = 20 + h: 


|R| € 


roh 


h TOP P he 
Λίο) ἀν ο tn) ο σος. (19.79) 
ας 


The Hermite trapezoidal formula is obtained by summation: 


7 Yo Un hes, ; (b—a)h4 
πο n (5 Egi ya + sa d ) Γσσίθο- Yn) [RI < παρ M (19.80) 


2 12 


Ma denotes an upper bound for | f? (x)| on the entire integration domain. The Hermite trapezoidal 
formula has an error of order 4 and it is exact for polynomials up to third degree. 


19.3.3 Quadrature Formulas of Gauss 
Quadrature formulas of Gauss have the general form 


b n 
fro dx ~ X` cyy, with y, = (αν) (19.81) 
a v=0 

where not only the coefficients c, are considered as parameters but also the interpolation nodes z,. 
These parameters are determined in order to make the formula (19.81) exact for polynomials of the 
highest possible degree. 

The quadrature formulas of Gauss result in very accurate approximations, but the interpolation nodes 
must be chosen in a very special way. 


19.3.3.1 Gauss Quadrature Formulas 

If the integration interval in (19.81) is chosen as [a, b] = [—1, 1], and the interpolation nodes are chosen 
as the roots of the Legendre polynomials (see 9.1.2.6, 3., p. 566, 21.12, p. 1108), then the coefficients 
c, can be determined so that the formula (19.81) gives the exact value for polynomials up to degree 
2n +1. The roots of the Legendre polynomials are symmetric with respect to the origin. For the cases 
n = 1,2 and 3 they are: 
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m=]: zo9-—2au, Co = 
1 
ti = —= = 0.577 350 269..., εξ]. 
178 > G : 
n2: To= --ᾱο, Co = 9° 
7 HS 
σι = e 9 , (19.82) 
T2 = 'H = 0.774596669..., ο; = cy. 
n=3: To= —Zs, Co = 0.347 854 854... , 
πι = —Z2, c = 0.652 145 154... , 
= 0.339 981043... ose 
z3 = 0.861136311..., eg: 
Remark: A general integration interval [α, b] can be transformed into [—1, 1] by the transformation 
b- b 
t= 2 7:45 m (t € [a,b], x € [-1,1]). Then 
; b-ag i εὖ 
EO atx" y ef [ +H (19.83) 
2 2 LL 2 2 


with the values x, and c, given above for the interval [—1, 1]. 


19.3.3.2 Lobatto's Quadrature Formulas 

In some cases it is reasonable also to choose the endpoints of the subintervals as interpolation nodes. 
Then, there are 2n more free parameters in (19.81). These values can be determined so that polynomials 
up to degree 2n — 1 can be integrated exactly. In the cases n — 2 and n — 3: 


m= 2: qf om 

- DIT 1 D = "x == 1 
πο δν αρ —1, Co = 6 

7 M es 4 (19.84a) MN "TS 5 

1 7 = 3: 1 T2, απ $6 (19.84b) 
1-1, C= Cp. ty = — = 0.447213595..., co =e, 


The case n = 2 represents the Simpson formula. 


19.3.4 Method of Romberg 

To increase the accuracy of numerical integration the method of Romberg can be recommended, where 
one starts with a sequence of trapezoid sums, which is obtained by repeated halving of the integration 
step size. 

19.3.4.1 Algorithm of the Romberg Method 

The method consists of the following steps: 

1. Trapezoid sums determination 

The trapezoid sum T'(h;) according to (19.76) in 19.3.2.2, p. 964 is determined as an approximation of 


b 
the integral / f(x) dx with the step sizes 
Ja 


(055 071,2: 4. m). (19.85) 
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Here, the recursive relation 


hia hi4|l hia i | 3 
T(hi) τ - ) ; EQ ! s(a - ) -flat hia) 4 f(a Shia) 


2n — 1 1 
+fla+ 2h) +d ΠΟ; - hia) 240) (19.86) 
n—1l 
-$T (hi1) 4 τε oC H E : LS (i21,2,...,m; n = 2*1) 
j=0 


is considered. Recursion formula (19.86) tells that for the calculation of T(h;) from T (h;—1) the function 
values must be calculated only at the new interpolation nodes. 
2. Triangular Scheme 
To; = T(h;) (i = 0,1, 2,...) is substituted and the values 
ο. --- , 

Ta = Teac E^ i LL (k=1,2,...,m; i=k,k+1,...) (19.87) 
are calculated recursively. The arrangement of the values calculated according to (19.87) is most prac- 
tical in a triangular scheme, whose elements are calculated in a column-wise manner: 


T (ho) = Too 


T(hy)=To Tu 
Ths) Το Dis T (19.88) 
T(hs)=Tos Tis Tos T33 


The scheme will be continued downwards (with a fixed number of columns) until the lower values at 
the right are almost the same. The values Tı; (i = 1,2,...) of the second column correspond to those 
calculated by the Simpson formula. 


19.3.4.2 Extrapolation Principle 

The Romberg method represents an application of the so-called extrapolation principle. This will be 
demonstrated by deriving the formula (19.87) for the case k = 1. The required integral is denoted by 7, 
the corresponding trapezoid sum (19.76) by T(h). If the integrand of I is (2m + 2) times continuously 
differentiable in the integration interval, then it can be shown that an asymptotical expansion with 
respect to h is valid for the error R of the quadrature formula, and it has the form 


R(h) = I — T(h) = ah? + agh* +--+ + amh?” + Ο(μ” 13) (19.89a) 


or 
T(h) = I — αι) — agh — ++» — a, B?" + Ol): (19.89b) 


The coefficients αι, a2,. . . ‚ím are constants and independent of h. 


h 
T(h) and T (5) are formed according to (19.89b) and the linear combination 


Ti(h) = o4T(h) + aT (5) (αι + αὐ) — αι (οι | 230 az (αι | τ) μή... (19.90) 


: : ας A ; 
is considered. If a, + ag = 1 anda, 4 4 0 are substituted, then ΤΠ (A) has an error of order 4, while 


T(h) and T(h/2) both have errors of order only 2. The formula is 
h 


T|z|-T(h) 
T,(h) = 3T) | (5) στίς) TR L— (19.91) 
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This is the formula (19.87) for k = 1. Repeated application of the above procedure results in the 
approximation Tk; according to (19.87) and 


Ty = I + O(h? 12). (19.92) 


1 sing 


dx (integral sine, see 8.2.5, 1., p. 513) cannot be obtained in an 


Β The definite integral J = li 


20 
elementary way. Calculate the approximate values of this integral (calculating for 8 digits). 


k=0 | k-1 k-2 | k-3 
0.92073549 
1. Romberg method: 0.93979328 | 0.94614588 


0.94451352 | 0.94608693 | 0.94608300 
0.94569086 | 0.94608331 | 0.94608307 | 0.94608307. 


The Romberg method results in the approximation value 0.94608307. The value calculated for 10 digits 
is 0.9460830704. The order O ((1/8)*) æ 6 - 107* of the error according to (19.92) is verified. 

2. Trapezoidal and Simpson Formulas: From the scheme of the Romberg method it can be got 
directly that for hg = 1/8 the trapezoid formula has the approximation value 0.94569086 and the 


Simpson formula gives the value 0.94608331. 
The correction of the trapezoidal formula by Hermite according to (19.79) results in the value | ~ 


0.30116868 
.945 —— — — = 0.94608301. 
0.94569086 + 61-12 0.9460830 
3. Gauss Formula: By the formula (19.83) we get for 
if 1 il 1 1 
nel fee 5 [cof (5% + 5 taf (5 + >) = 0.94604113; 
1] 1 1 1 T 1 1 
m=2: IA 5 [cof (5% + 5 taf (5 } 5) + cof (5v: - 3] = 0.94608313; 
1f 1 1 1 1 
n=3 1x 5 [oof Ga +5) ef e " 5)] = 0.94608307. 
It can be observed that the Gauss formula results in an 8-digit exact approximation value for n = 3, 
ie., with only four function values. With the trapezoidal rule this accuracy would need a very large 
number (> 1000) of function values. 


Remarks: 

1. Fourier analysis has an important role in integrating periodic functions (see 7.4.1.1, 1., p. 474). The 
details of numerical realizations can be found under the title of harmonic analysis (see 19.6.4, p. 992). 
The actual computations are based on the so-called Fast Fourier Transformation FFT (see 19.6.4.2, 
p. 993). 

2. In many applications it is useful to take the special properties of the integrands under consideration. 
Further integration routines can be developed for such special cases. A large variety of convergence 
properties, error analysis, and optimal integration formulas is discussed in the literature (see, e.g., 


[19.4]). 


3. Numerical methods to find the values of multiple integrals are discussed in the literature (see, e.g., 
(19.26]). 


19.4 Approximate Integration of Ordinary Differential Equations 969 


19.4 Approximate Integration of Ordinary Differential 
Equations 


In many cases, the solution of an ordinary differential equation cannot be given in closed form as an 
expression of known elementary functions. The solution, which still exists under rather general circum- 
stances (see 9.1.1.1, p. 540), must be determined by numerical methods. These result only in particular 
solutions, but it is possible to reach high accuracy. Since differential equations of higher order than one 
can be either initial value problems or boundary value problems, numerical methods were developed 
for both types of problems. 


19.4.1 Initial Value Problems 


The principle of the methods presented in the following discussion to solve initial value problems 


y — f(x,y), vro) = vo (19.93) 
is to give approximate values y; for the unknown function y(x) at a chosen set of interpolation points 
xi. Usually, equidistant interpolation nodes are considered with a previously given step size h: 


zi =zoọ+ih (i=0,1,2,...). (19.94) 
19.4.1.1 Euler Polygonal Method 


An integral representation of the initial value problem (19.93) is given by integration 


v(e) = w+ f fæ yle) dz. (19.95) 
zo 
This is the starting point for the approximation 
woth 
γαι) = yo + / f(x, ν()) dx & yo + hf (xo, yo) = νι, (19.96) 
αρ 


which is generalized as the Euler broken line method or Euler polygonal method: 


Vii — yi hf(xi yi) (6 —0,1,2,... 5 yo) = wo). (19.97) 
For a geometric interpretation see Fig. 19.5. Comparison of (19.96) with the Taylor expansion 


yA 


y(x) ναι) = νίαο +h) 


y" (£) 2 
9 h (19.98) 


yo + f (xo. yo)h + 
with zo < € < xo + h shows that the approx- 
imation y, has an error of order h?. The ac- 
curacy can be improved by reducing the step 
Ys ». Size h. Practical calculations show that halv- 
0 Xo X X2 X3 x ing the step size h results in halving the error 
of the approximations y;. 
h h h Α quick overview of the approximate shape of 
the solution curve can be got by using the Eu- 
Figure 19.5 ler method. 


19.4.1.2 Runge-Kutta Methods 


1. Calculation Scheme 

The equation y'(x) = f (v, y) determines at every point (xo, yo) a direction, the direction of the tangent 
line of the solution curve passing through the point (xo, yo). The Euler method follows this direction 
until the next interpolation node. The Runge-Kutta methods consider more points “between” (xo, yo) 
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and the possible next point (a9+h, yı) of the curve, and depending on the appropriate choice of these ad- 
ditional points more accurate values are obtained for yı. There exist Runge-Kutta methods of different 
orders depending on the number and the arrangements of these "auxiliary" points. Here a fourth-order 
method (see 19.4.1.5, 1., p. 972) is shown. (The Euler method is a first-order Runge-Kutta method.) 


The calculation scheme of fourth order 


for the step τ πρ tor, = zo ch A y k=h- f(x,y) 
to get an approximate value for yı of 
(19.93) is given in (19.99). The further To Yo kı 
steps follow the same scheme. zo + h/2 yo + kı/2 ka 
The error of this Runge-Kutta method zo +h/2 Yo + k2/2 kg (19.99) 
has order A? (at every step) accord- 20 ch Yo + Fa ka 
ing to (19.99), so with an appropriate 1 
choice of the step size high accuracy zı zo h | yi — yod ge + 262 + 2ka + ka) 
can be obtained. 

1, ς : 1 é 
B -= i y?) with y(0) = 0. y(0.5) is determined x y k= gt y?) 
in one step, i.e. h = 0.5 (see the table on the right). The 0 0 0 
exact value for 8 digits is 0.01041860. 0.25 | 0 0.00781250 
2. Remarks 0.25 | 0.00390625 | 0.00781441 
1. For the special differential equation y’ = f(x), this 0.5 | 0.00781441 | 0.03125763 
Runge-Kutta method becomes the Simpson formula (see 
19.3.2.3, p. 965). 0.5 0.01041858 


2. For a large number of integration steps, a change of step size is possible or sometimes necessary. The 
change of step size can be decided by checking the accuracy so that the step is repeated with a double 
step size 2h. If, e.g., the approximate value is y2(h) for y(a9+2h) (calculated by the single step size) and 
y2(2h) (calculated by the doubled step size), then the estimation for the error Ho(h) = y(xo4-2h) — ο (11) 
is 
1 

Πο (1) ~ geh — y2(2h)]. (19.100) 
Information about the implementation of the step size changes can be found in the literature (see 
[19.24]). 
3. Runge-Kutta methods can easily be used also for higher-order differential equations, see [19.24]. 
Higher-order differential equations can be rewritten in a first-order differential equation system (see 
p. 550). Then, the approximation methods are performed as parallel calculations according to (19.99), 
as the differential equations are connected to each other. 


19.4.1.3 Multi-Step Methods 


The Euler method (19.97) and the Runge-Kutta method (19.99) are so-called single-step methods, since 
we start only from y; in the calculation of y;,4. In general, linear multi-step methods have the form 
Vick + Ok-iUipk-i  Qk-2Uick-2 + 77 + 04i + Qi 
= h( Br fire + θε τν a t Bia + Bofi) (19.101) 
with appropriately chosen constants o; and 8; (j = 0,1,...,k; a, = 1). The formula (19.101) is 
called a k-step method if |ag| + |Bo| 4 0. It is called explicit, if 8; = 0, since in this case the values 
fis; = f (£i+j, Yi4;) on the right-hand side of (19.101) only contain the already known approximation 
values yi, Yiti,--- 5 Yitr-1- If y A 0 holds, the method is called implicit, since then the required new 
value yi+x occurs on both sides of (19.101). 
The k initial values yo, y1, ..., yy-1 must be known in the application of a k-step method. These initial 
values can be got, e.g., by one-step methods. 
A special multi-step method to solve the initial value problem (19.93) can be derived if the derivative 


19.4 Approximate Integration of Ordinary Differential Equations 971 


y' (xi) in (19.93) is replaced by a difference formula (see 9.1.1.5, 1., p. 549) or if the integral in (19.95) 
is approximated by a quadrature formula (see 19.3.1, p. 963). 

Examples of special multi-step methods are: 

1. Midpoint Rule The derivative y'(x;+1) in (19.93) is replaced by the slope of the secant line between 
the interpolation nodes x; and 249, i.e.: 


Yi+2 — Yi σα (19.102) 
2. Rule of Milne The integral in (19.95) is approximated by the Simpson formula: 
h 
Yi+2 7 Vi = πο +4 fini + fia). (19.103) 


3. Rule of Adams and Bashforth The integrand in (19.95) is replaced by the interpolation poly- 
nomial of Lagrange (see 19.6.1.2, p. 983) based on the k interpolation nodes £i, z;,1,... , x; 41. Inte- 
grating between z;,,.; and x; results in 

k-1| “tk k-1 


Yitk — Yitk-1 = X / L;(2) da: Ταν, visi) =h M Bif (tity; Vij): (19.104) 


j=0 j=0 


Ti+k-1 


The method (19.104) is explicit for y;,,,. For the calculation of the coefficients 3; see [19.3]. 
19.4.1.4 Predictor-Corrector Method 


In practice, implicit multi-step methods have a great advantage compared to explicit ones in that they 
allow much larger step sizes with the same accuracy. But, an implicit multi-step method usually re- 
quires the solution of a non-linear equation to get the approximation value y;,;. This follows from 
(19.101) and has the form 


k k—1 
Yi+k = hY By fits = Du QjYi+j = F (yi+k). (19.105) 
j-0 1-0 


The solution of (19.105) is an iterative one. The procedure is the following: An initial value y is 
determined by an explicit formula, the so-called predictor. Then it will be corrected by an iteration 
rule 


uu PU) (uc ES (19.106 
which is called the corrector coming from the implicit method. Special predictor-corrector formulas 
are: 

poO _ tg 
- Vier = Vid 12 9 i-2 16fi1 + 2311), (19.107a 
i h E 
WT? = wt qs Cf SARI) (nm 01...) (19.107b 
2. y = yi2 + 9yia — Iyi + OR( fia + f), (19.108a 
h 
vat = wat giat4ht+ fh) (4=0,1,...). (19.108b 


The Simpson formula as the corrector in (19.108b) is numerically unstable and it can be replaced, e.g., 
by 


à / ! 
yt? = 0.9, 4 + 0.1y; + 5 01a 4 6.7f; a +30.7f; + 8.19). (19.109) 


972 19. Numerical Analysis 


19.4.1.5 Convergence, Consistency, Stability 


1. Global Discretization Error and Convergence 
Single-step methods can be written generally in the form: 

Uii = yi thF (ai, yi h) (i —0,1,2,...; yo given). (19.110) 
Here F(x,y, h) is called the increment function or progressive direction of the single-step method. The 
approximating solution obtained by (19.110) depends on the step size and it should be denoted by 
y(x, h). Its difference from the exact solution y(x) of the initial value problem (19.93) is called the global 
discretization error g(x, h) (see (19.111)), and we say: The single-step method (19.110) is convergent 
with order p if p is the largest natural number such that 

g(a, h) = y(a,h) — y(x) = O(h?) (19.111) 
holds. Formula (19.111) says that the approximation y(x, h) determined with the step size h = BM 


n 
converges to the exact solution y(x) for every x from the domain of the initial value problem if  — 0. 
Β The Euler method (19.97) has order of convergence p = 1. For the Runge-Kutta method (19.99) 
p — 4 holds. 

2. Local Discretization Error and Consistency 

The order of convergence according to (19.111) shows how well the approximating solution y(x, h) ap- 
proximates the exact solution y(x). Beside this, it is an interesting question of how well the increment 
function F(z, y, h) approximates the derivative y’ = f(x,y). For this purpose the so-called local dis- 
cretization error I(x, h) (see (19.112)) is introduced. The single-step method (19.110) is consistent with 
order p, if pis the largest natural number with 


y(x +h) — y(x) 


I(x, h) = n — F(v,y, h) = O(h?). (19.112) 
7 
It follows directly from (19.112) that for a consistent single-step method 
lim F(x,y, h) = f(x,y). (19.113) 
ὃς 


Β The Euler method has order of consistency p = 1, the Runge-Kutta method in (19.99) has order of 
consistency p — 4. 
3. Stability with Respect to Perturbation of the Initial Values 
In the practical performance of a single-step method, a rounding error O(1/h) adds to the global dis- 
cretization error O(h”). Consequently, we have to select a not too small, finite step size h > 0. It is 
also an important question of how the numerical solution y; behaves under perturbations of the initial 
values or in the case £x; — oo. 
In the theory of ordinary differential equations, an initial value problem (19.93) is called stable with 
respect to perturbations of its initial values if: 

(ία) — ν(υ)| < [io — νο]. (19.114) 
Here g(x) is the solution of (19.93) with the perturbed initial value (xo) = jo instead of yg. Estimation 
(19.114) tells that the absolute value of the difference of the solutions is not larger than the perturbation 
of the initial values. 


In general, it is hard to check (19.114). Therefore the linear test problem 

y — Ay with y(xo) = yo (A constant, \ < 0) (19.115) 
is considered which is stable, and a single-step method is applied to this special initial value problem. 
A consistent method is called absolutely stable with step size h > 0 with respect to perturbed initial 
values if the approximating solution y; of the above linear test problem (19.115) obtained by using the 
method satisfies the condition 

[ui] < lyol- (19.116) 
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W Applying the Euler polygon method for equation (19.115) results in the solution yi41 = (1 + Ah) yi 
(i = 0,1,...). Obviously, (19.116) holds if |1 + A] € 1, and so the step size must satisfy —2 < AA < 0. 
4. Stiff Differential Equations 
Many application problems, including those in chemical kinetics, can be modeled by differential equa- 
tions whose solutions consist of terms converging to zero exponentially but in a high different kind of 
exponential decreasing. These equations are called stiff differential equations. For example: 

ία) = Cye™® + Cye®* (ΟἹ, C2, X1, Az const) (19.117) 
with λι < 0, Ax < 0 and |Ay| < [ào], e.g., λι = —1, A» = —1000. The term with As does not have a 
significant affect on the solution function, but it does in selecting the step size h for a numerical method. 
In such cases the choice of the most appropriate numerical method has special importance (see [19.23]). 


19.4.2 Boundary Value Problems 


The most important methods for solving boundary value problems of ordinary differential equations 
will be demonstrated on the following simple linear boundary value problem for a differential equation 
of the second order: 

y" (x) + pay (2) + α(α)υ(α) = f(x) (a € x <b) with y(a) =a, y(b) = β. (19.118) 
The functions p(x), q(x) and f(x) and also the constants a and f are given. 

The given method can also be adapted for boundary value problems of higher-order differential equa- 
tions. 


19.4.2.1 Difference Method 
The interval [a, b] is subdivided by equidistant interpolation points x, = xo + vh (v = 0,1,2,...,n; 


20 = a, r, = b) and the values of the derivatives are substituted into the differential equation at the 
interior interpolation points 


y" (av) + p(x,)y (αν) + α(αν)ν(αν) = flav) (v =1,2,...,n-1) (19.119) 
by so-called finite divided differences, e.g.: 


/ n Uv — Vv-1 
(jag 19.120a 
y (a y, 2 ; ( 9 0a) 
vi — 2g, + Yv- 
ite ehe ΣΙ; (19.120b) 
L 


In this way n — 1 linear equations are obtained for the n — 1 approximation values y, ~ y(x,) in the 
interior of the integration interval [a, b], considering the conditions yo = a and y, = β. If the boundary 
conditions also contain derivatives, they must also be replaced by finite expressions. 

Eigenvalue problems of differential equations (see 9.1.3.2, p. 569) are handled analogously. The ap- 
plication of the difference method, described by (19.119) and (19.120a,b), leads to a matrix eigenvalue 
problem (see 4.6, p. 314). 
H The solution of the homogeneous differential equation y" + λυ = 0 with boundary conditions 
y(0) = y(1) = 0 leads to a matrix eigenvalue problem. The difference method transforms the dif- 
ferential equation into the difference equation y,41 — 2y, + yp—1 + h?A7y, = 0. If three interior points 
are chosen, hence h = 1/4, then considering yo = y(0) = 0, y4 = y(1) = 0 the discrete system is 


X? 
(-2+ 5) uic UP = 0, 
2 
ο. ys = 0, 
X 
; ο a = 0. 
n+ ( +) ys = 0 


This homogeneous system of equations has a non-trivial solution only when the coefficient determinant 
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is zero. This condition results in the eigenvalues Aj? = 9.37, A3? = 32 and As? = 54.63. Among them 
only the smallest one is close to its corresponding true value 9.87. 

Remark: The accuracy of the difference method can be improved by 

1. decreasing the step size h, 

2. application of a derivative approximation of higher order (approximations as (19.120a,b) have an 
error of order O(h?)), 

3. application of multi-step methods (see 19.4.1.3, p. 970). 

If the problem is a non-linear boundary value problem, then the difference method leads to a system of 
non-linear equations of the unknown approximation values y, (see 19.2.2, p. 961). 


19.4.2.2 Approximation by Using Given Functions 

The approximate solution of the boundary value problem (19.118) is a linear combination of suitably 
chosen functions g;(z), which are linearly independent and each one satisfies the boundary value con- 
ditions: 


γα) ~ g(x) = » aigi(x). (19.121) 


Substituting g(x) into the differential equation (19.118) results in an error, the so-called defect 
£(2;a3,02,...,a4) = g" (x) + p(x)g (x) + g(x) g(x) — f(x). (19.122) 
Το determine the coefficients a; the following principles (see also p. 978) can be used: 
1. Collocation Method The defect has to be zero at n given points τν, the so-called collocation 
points. 'The conditions 
ε(σν;αι,α»,... ἀπ) 20 (v=1,2,...,n), a«mz«zs«...«z,«b (19.123) 
result in a linear system of equations for the unknown coefficients. 
2. Least Squares Method The integral 
b 
F(a1,05,...,a04) = EC 04, 02, ..., Q4) dT, (19.124) 
a 
depending on the coefficients, should be minimal. The necessary conditions 
OF — 
Oa; — 
give a linear system of equations for the coefficients a;. 
3. Galerkin Method The requirement is that the so-called error orthogonality is satisfied, i.e., 
b 
EGENT 2+, Qn)gi(z)de=0 (¢=1,2,...,n), (19.126) 


a 


0 (6=1,2,...,n) (19.125) 


and in this way a linear system of equations is obtained for the unknown coefficients. 
4. Ritz Method The solution y(x) often has the property that it minimizes the variational integral, 


b 
Il] = [ Heyy) dx (19.127) 
(see (10.4), p. 610). If the function H (x, y, y) is known, then y(x) is replaced by the approximation 
g(x) as in (19.121) and I[y] = I(ai, a2,... , an) is minimized. The necessary conditions 
J 
s =0 (i=1,2,...,n) (19.128) 


result in n equation for the coefficients a;. 
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W Under certain conditions on the functions p, q, f and y, the boundary value problem 
—[p(x)y' (x) + a(zx)y(x) = f(x) with y(a) =a, y(b) = 8 (19.129 


and the variational problem 


Πο = [ ο ο. = mint with y(a) =a, y(6)=8 — (9190 


are equivalent, so H (x, y, y') can be got immediately from (19.130) for the boundary value problem o 
the form (19.129). 
Instead of the approximation (19.121), one often considers 


g(x) = go(x) + Daslo) (19.131 
where go() satisfies ile boundary values and the functions g;(r) satisfy the conditions 
gi(a) = g(b) =0 (i—1,2,...,m). (19.132 
For the problem (19. nd an appropriate choice is, e.g., 
β- 
go(x) = (x — a). (19.133 
—a 


Remark: In a linear boundary value problem, the forms (19.121) and (19.131) result in a linear system 
of equations for the coefficients. In the case of non-linear boundary value problems non-linear systems 
of equations are obtained, which can be solved by the methods given in Section 19.2.2, p. 961. 


19.4.2.3 Shooting Method 


With the shooting method, the solution of a boundary value problem is reduced to the solution of an 
initial value problem. The basic idea of the method is described below as the single-target method. 
1. Single-Target Method 
The initial value problem 
y" + p(x)y 1- α(α)υ = f(x) with y(a) = a, y(a) = s (19.134) 
is associated to the boundary value problem (19.118). Here s is a parameter, from which the solution 
y of the initial-value problem (19.134) depends on, i.e., y = y(x, s) holds. The function y(x, s) satisfies 
the first boundary condition y(a, s) = α according to (19.134). The parameter s should be determined 
so that y(x, s) satisfies the second boundary condition y(b,s) = 8. Therefore, one has to solve the 
equation 
F(s) = y(b, s) — B, (19.135) 
and the regula falsi (or secant) method is an appropriate method to do this. It needs only the values of 
the function F(s), but the computation of every function value requires the solution of an initial value 
problem (19.134) until x = b for the special parameter value s with one of the methods given in 19.4.1. 


2. Multiple-Target Method 
In a so-called multiple-target method, the integration interval [a, b] is divided into subintervals, and we 
use the single-target method on every subinterval. Then, the required solution is composed from the 
solutions of the subintervals, where the continuous transition at the endpoints of the subintervals must 
be ensured. 

This requirement results in further conditions. For the numerical implementation of the multiple-target 
method, which is used mostly for non-linear boundary value problems, see. [19.24]. 


976 19. Numerical Analysis 


19.5 Approximate Integration of Partial Differential 
Equations 

In this section only the principles of numerical solutions of partial differential equations are discussed 

using the example of linear second-order partial differential equations with two independent variables 

with the corresponding boundary or/and initial conditions. 


19.5.1 Difference Method 
A regular grid is considered on the integration domain by the chosen points (x, y,). Usually, this grid 
is chosen to be rectangular and equally spaced: 

v, = To + uh, y, — yo vl. (u,v = 1,2....). (19.136) 
It results in squares for | = h. If the required solution is denoted by u(x, y), then the partial derivatives 
occurring in the differential equation and in the boundary or initial conditions are replaced by finite 
divided differences in the following way, where u,,, denotes an approximate value for the function value 
"(τμ Yv): 


Partial Derivative Finite Divided Difference Order of Error 
Qu 1 1 , 
ay tw ww) rasta = Upu) ΟΥ ap tiere = uy 1v) O(h) or O(h?) 
Ou 1 1 , 

By Cw We) qnn = upv) or aj tuc = Uraaa) O(l) or O(P) 
Pu 1 (19.137) 
andy (ts Yv) ani ene μίνι — Upi wt + Upi) | OCh) 

Pu 1 . 

δαν Yv) ga enu = 2u,, + αμ αν) O(h?) 

Pu 1 , 

a e) gno — Qty t Upy—1) O(P) 


The error order in (19.137) is given by using the Landau symbol O. 
In some cases, it is more practical to apply the approximation 


ὃ “(a y) 56 μεινει — 2p d Uca , ü yita — 2u,, F Up- 
gpa ies "5ο h? | s h? 


with a fixed parameter σ (0 < ø < 1). Formula (19.138) represents a convex linear combination of two 
finite expressions obtained from the corresponding formula (19.137) for the values y = y, and y = y41. 


(19.138) 


A partial differential equation can be rewritten as a difference equation at every interior point of the 
grid by the formulas (19.137), where the boundary and initial conditions are considered, as well. This 
system of equations for the approximation values u,, has a large dimension for small step sizes h and 
l, so usually, it is solved by an iteration method (see 19.2.1.4, p. 960). 


ytuy =0 W A: The function u(x,y) should be the solution of the differential 

equation Au = uz,+tyy = —1 for the points (x, y) with |x| < 1, |y] < 2, 

NI ie., in the interior of a rectangle, and it should satisfy the boundary 
ua, F0} [Uo W, E0 conditions u = 0 for |x| = 1 and |y| = 2. The difference equation corre- 

ua, =0 Ugo q Uy g0 sponding to the differential equation for a square grid with step size h 

-Ἡ 0 T x is: duy, = Uppy + αμνει + αμ ιν + Upy—1 + h?. The step size h = 1 

tug (Fig. 19.6) results in a first rough approximation for the function val- 

4 T ues at the three interior points: 4uo = 0 + 0 + 0 + uoo 1, 4u00 = 


0+ uo, +0 + uo, 1 +1, 4ug, 1 — O + uoo +04+0+1. 


The solution is uoo = : & 0.429, ug = uo, 1 = 2 54 0.357. 


Figure 19.6 14 
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W B: Thesystem of equations arising in the application of the difference method for partial differential 
equations has a very special structure. It is demonstrated by the following example which is a more 
general boundary value problem. The integration domain is the square G: 0 < x < 1,0€ y € I. 
A function u(x, y) should be determined for which Au = Uss + uy, = f(x,y) in the interior of G, 
u(x,y) = g(x,y) on the boundary of G. The functions f and g are given. The difference equation 
associated to this differential equation is, for h — | — 1/n: 


1 
Uni, + Uppt + Usi, + Upu- — 4p = sif (αμ. ge) (u,v —1,2,...,n— 1). 
In the case of n = 5, the left-hand side of this system of difference equations for the approximation 
values u,,,, in the 4 x 4 interior points has the form (19.139) 
—4 1 0 0 1 0 0 0 
1-4 1 0 1 0 ο Un 
0 1-4 1 0 0 uoi 
0.0 1-4 0 1 131 
1 0 0 ο]-ι 10 0| 1 0 0 0 τη 
0 1.0 0 1-4 0 100 Nd 
0.01 0 1-4 1 0 0 tis; 
0 0 0 1 0 —4 0 0 1 » 
= 19.139) 
1 0 0 O//-4 1 0 0| 1000 Uia 
10 0| 1-4 0| 0100 428 
0 0 1-4 1 0.01 0 u33 
0 0 1 0 —4 0 0 0 1 Jas 
U14 
1 0 0 O0O||—4 1 0 0 Ὅρα 
ο 100 1-4 1 0 U34 
0 0 0 1-4 1 Usa 
0 0 1 0 0 1—4 
if the grid is considered row-wise from left to right, and considering that the values of the function are 


given on the boundary. The coefficient matrix is symmetric and is a sparse matriz. This form is called 
block-tridiagonal. It is obvious that the form of the matrix depends on how the grid-points are selected. 
For different classes of partial differential equations of second order, such as elliptic, parabolic and 
hyperbolic differential equations, more effective methods have been developed, and also the convergence 
and stability conditions have been investigated. There is a huge number of books about this topic (see, 


e.g., [19.22], [19.24]). 


19.5.2 Approximation by Given Functions 


The solution u(x, y) is approximated by a function in the form 


u(a,y) = v(z, y) = volz, y) + x av; (x, y). (19.140) 


i-i 
Here, two cases are distinguished: = 


1. υρ(α, y) satisfies the given inhomogeneous differential equation, and the further functions v;(x, y) (i = 
1,2,...,n) satisfy the corresponding homogeneous differential equation (then the linear combination 
has to be found which approximates the given boundary conditions as well as possible). 

2. ουρία, y) satisfies the inhomogeneous boundary conditions and the other functions v;(z, y) (i = 
1,2,...,n) satisfy the homogeneous boundary conditions (then the linear combination has to be found 
which approximates the solution of the differential equation on the considered domain as well as pos- 
sible). 
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In both cases substituting the approximating function v(x, y) from (19.140) in the first case into the 
boundary conditions, in the second case into the differential equation results in an error term, the so- 
called defect: 
E = £(1,y; 03,02, ..., 04). (19.141) 
Το determine the unknown coefficients a; one of the following methods can be applied. 
1. Collocation Method 
The defect € should be zero in n reasonably distributed points, at the collocation points (αν, yp) (v = 
1,2552. 4n) 
E(Ly, Yo} 01,02,...,04) 20. (v — 1,2,...,n). (19.142) 

The collocation points in the first case are boundary points (it is called boundary collocation), in the 
second case they are interior points of the integration domain (it is called domain collocation). 
From (19.142) are obtained n equations for the coefficients. Boundary collocation is usually preferred 
to domain collocation. 
W This method is applied to the example solved in 19.5.1 by the difference method, with the functions 
satisfying the differential equation: 
v(x, y; a1, a2, a3) i? + y?) +a + a(z? — y?) + a3(24 — 6x?y? ἠ- y^). 
The coefficients are determined to satisfy the boundary conditions at the points (z1,y1) = (1,0.5), 
(x2, yo) = (1, 1.5) and (x3, y3) = (0.5, 2) (boundary collocation). The linear system of equations 

0.3125 +a, +0.75a2 0.4375a3 = 0, 

0.8125 +a, — 1.25a2 7.4375a3 = 0, 

1.0625 t ay 3.75a2 t 10.0625a3 = 0 
has the solution αι = 0.4562, ag = —0.2000, a3 = —0.0143. The approximate values of the solution can 
be calculated at arbitrary points with the approximating function. To compare the values with those 
obtained by the difference method: v(0, 1) = 0.3919 and v(0, 0) = 0.4562. 
2. Least Squares Method 
Depending on whether the approximation function (19.140) satisfies the differential equation or the 
boundary conditions, it is required 
1. either the line integral over the boundary C 


I / &(a(t), y(t); αι... an) dt = min, (19.143a) 
(C) 


2 


where the boundary curve C 15 given by a parametric representation 2; = x(t), y = y(t), 
2. or the double integral over the domain G 


f= ni εἶία, y; a1, ..., αὐ) dz dy = min. (19.143b) 
(G) 


From the necessary conditions, 


— 0 (i= 1,2,...,n), n equations are obtained for computing the 
αἱ 
parameters a1, d3,... , An- 


19.5.3 Finite Element Method (FEM) 


After the appearance of modern computers the finite element methods became the most important 
technique for solving partial differential equations. These powerful methods give results which are 
easy to interpret. 

Depending on the types of various applications, the FEM is implemented in very different ways, so 
here only the basic idea is given. It is similar to those used in the Ritz method (sce 19.4.2.2, p. 974) 
for numerical solution of boundary value problems for ordinary differential equations and is related to 
spline approximations (see 19.7, p. 996). 

'The finite element method has the following steps: 
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1. Defining a Variational Problem A variational problem is formulated to the given boundary 
value problem. The process is demonstrated on the following boundary value problem: 


Au = Uge + Uyy = f in the interior of G, u = 0 on the boundary of G. (19.144) 
The differential equation in (19.144) is multiplied by an appropriate smooth function v(x, y) vanishing 
on the boundary of G, and it is integrated over the entire G to get 


I (s * os) vdz dy = ni fv dz dy. (19.145) 
(G) (G) 


Applying the Gauss integral formula (see 13.3.3.1, 2., p. 725), where P(x, y) = —vu, and Q(r, y) = vu; 
are substituted in (13.121), the variational equation from (19.145) 
a(u, v) — b(v) (19.1462) 


is obtained with 


Oudv | ὃι ὃυ 
mes J & Or Oy 2.) dxdy, b(v)— | (19.146b) 
ya 
Guy 
SNSD 
0 x x 
Figure 19.7 Figure 19.8 


2. Triangularization The domain of integration G is decomposed into simple subdomains. Usually, 
a triangularization is used, where G is covered by triangles so that the neighboring triangles have a 
complete side or only asingle vertex in common. Every domain bounded by curves can be approximated 
quite well by a union of triangles (Fig. 19.7). 
Remark: To avoid numerical difficulties, the triangularization should not contain obtuse-angled tri- 
angles. 
W A triangularization of the unit square could be performed as shown in Fig. 19.8. Here one starts 
from the grid points with coordinates x, = uh, y, = vh (μιν = 0,1,2,..., N; h = 1/N). There are 
(N — 1)? interior points. Considering the choice of the solution functions, it is always useful to consider 
the surface elements G, composed of the six triangles having the common point (αμ. y,). (In other 
cases, the number of triangles may differ from six. These surface elements are obviously not mutually 
exclusive.) 
3. Solution A supposed approximating solution is defined for the required function u(x, y) in every 
triangle. A triangle with the corresponding supposed solution is called a finite element. Polynomials 
in 2; and y are the most suitable choices. In many cases, the linear approximation 

ü(x,y) = αι + asx + azy (19.147) 
is sufficient. The supposed approximating function must be continuous under the transition from one 
triangle to neighboring ones, so a continuous final solution arises. 
The coefficients a4, az and ag in (19.147) are uniquely defined by the values of the functions ui, u2 
and ua at the three vertices of the triangle. The continuous transition to the neighboring triangles is 
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ensured by this at the same time. The supposed solution (19.147) contains the approximating values 
u; of the required function as unknown parameters. For the supposed solution, which is applied as an 
approximation in the entire domain G for the required solution u(x, y), 


N-1N-1 


ü(v,y) = Y, Y αμνυμνία, y). 


με v=1 


(19.148) 


is chosen. The appropriate coefficients αμν are determined. The following must be valid for the func- 
tions uj, (v, y): They represent a linear function over every triangle of G, according to (19.147) with 


the following conditions: 


1 fork=p,l=v, 


1. αμνίαν gu) = { 0 at any other grid point of Gv- 


2. αμα.) Ξθ for (x,y) £ Guy. 


Figure 19.9 


Analogously, there is: 


πο)... 


+ 


+ 


h 


-u 


(19.149a) 
(19.149b) 


The representation of u,,(r,y) over Ομ. is shown in 
Fig. 19.9. 

The calculation of upy over Gi, i.e., over all triangles 1 
to 6 in Fig. 19.8 is shown here only for triangle 1: 
Up (®,Y) = αι + ασ + a3 with (19.150) 


1 for z = £p, Y =Y, 


uy(r,y) = 4 0 for z —z, ay — Yo (19.151) 
0 for £= αμ, = Yvi- 


From (19.151) αι = 1 — v,a = 0, a3 = 1/h, follow and 
so for triangle 1 


u(x, y) =1+ (s = v) l (19.152) 


or triangle 2, 
for triangle 3, 
or triangle 4, (19.153) 


for triangle 5, 


or triangle 6. 


4. Calculation of the Solution Coefficients The solution coefficients αμν are determined by the 
requirements that the solution (19.148) satisfies the variational problem (19.146a) for every solution 
is substituted for u(x, y) and u(x, y) for v(x, y) in (19.146a). In this way, a 
linear system of equations 


function uj, i.e., U(x, y 


N-1N-1 


Σ Σ Ora (up, Uki) = blur) 


p=1 νι 


is obtained for the unknown coefficients, where 


Ou, Oui 


(k,1—1,2,...,N — 1) (19.154) 


Ou, Oui 


(Uy, ὑπ) / ( 
Gu 


Ou 


Ox 


Oy Oy 


) dzdy, b(ug) = furi dz dy. (19.155) 


σκι 
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In the calculation of a(ujv, Ux) the fact must be kept in mind that integration is needed only in the 
cases of domains G, and Gy with non-empty intersection. These domains are denoted by shadowing 
in Table 19.1. 


Table 19.1 Auxiliary table for FEM 


Surface Graphical Triangle of | Ouu | ὃαμν > Oui Ou, 
region representation | Gr Gw Em δα Ox Ox 
Tox 0 0 
2 2 —1/h | —1/h 
1 n-k 3 3 —1/h | -1/h 4 
ν-ἰ 4 4 0 0 m 
5 5 /h 1/h 
6 6 /h l/h 
μξκ 15 0| I/h 
Pure 9 4 -Mh| ο Q 
3 =k+1 2 6 —1/h | l/h 2 
ν-ἰ 3 5 —1/h | l/h E 
1.“ k+1 3 1 —1/h 0 0 
ν-ἰηι 4 ϐ 0 | 1/h 
> W=k 4 2 0 | 1/h 0 
” y=l+1 5 1 —1/h 0 
6 μ-[-ι 5.3 l/h | —1/h 2 
ν-ἰ AA 6 2 l/h | —1/h EN 
7 b= k-1 6 4 l/h 0 0 
ν-ἰ-1 n 1 3 0 | —1/h 
The integration is always performed over a triangle with an area h?/2, so for the partial derivatives m 


with respect to x: 


h? 
— (4a — 2a 204 19.156a 
D (dag k+1,l ki) 2 ( ) 
is obtained. Analogously, for the partial derivatives with respect to y the corresponding term is 
1 h? - 
: (4a — 20,111 — 20,3) 2k (19.156b) 
à 
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The calculation of the right-hand side b(ugz) of (19.154) gives: 
μι) = / f(a, y)uii(m, y) ἀν dy πε μιν», (19.157a) 
Gu 


where Vp is the volume of the pyramid over Gy; with height 1, determined by uj; (x, y) (Fig. 19.9). 
Since 


1 Ts , 
Vp = 3 6- PLE the approximation is b(ujj) 54 fuh”. (19.157b) 
So, the variational equations (19.154) result in the linear system of equations 
Aog — O14 — Qk- — Oii — Oi = h° fa (k,l = 1,2,..., N -- 1) (19.158) 


for the determination of the solution coefficients. 


Remarks: 

1. If the solution coefficients are determined by (19.158), then a(x, y) from (19.148) represents an 
explicit approximating solution, whose values can be calculated for an arbitrary point (x, y) from G. 
2. Iftheintegration domain must be covered by an irregular triangular grid then it is useful to introduce 
triangular coordinates (also called barycentric coordinates). In this way, the position of a point can 
be easily determined with respect to the triangular grid, and the calculation of the multidimensional 
integral is made easier as in (19.155), because every triangle can be easily transformed into the unit 
triangle with vertices (0, 0), (0, 1), (1,0). 
3. If accuracy must be improved or also the differentiability of the solution is required, then piecewise 
quadratic or cubic functions must be applied to obtain the supposed approximation (see, e.g., [19.22]). 
4. In practical applications, usually systems of huge dimensions are obtained. This is the reason why 
so many special methods have been developed, e.g., for automatic triangularization and for practical 
enumeration of the elements (the structure of the system of equations depends on it). For detailed 
discussion of FEM see [19.13], [19.7], [19.22]. 


19.6 Approximation, Computation of Adjustment, Harmonic 
Analysis 
19.6.1 Polynomial Interpolation 


The basic problem of interpolation is to fit a curve through a sequence of points (ην. yy) (v = 0,1, 
.,n). This can happen graphically by any curve-fitting gadget, or numerically by a function g(x), 
which takes given values y, at the points x, , at the so-called interpolation points. That is g(x) satisfies 
the interpolation conditions 
σία) = y, (v—0,1,2,...,n). (19.159) 
In the first place, polynomials are used as interpolation functions, or for periodic functions so-called 
trigonometric polynomials. In this last case one talks about trigonometric interpolation (see 19.6.4.1, 
2., p. 992). There are n + 1 interpolation points, the order of the interpolation is n, and the highest 
degree of the interpolation polynomial is at most n. Since with increasing degree of the polynomials, 
strong oscillation may occur, which is usually not required, the interpolation interval can be decom- 
posed into subintervals and a spline interpolation (see 19.7, p. 996) can be performed. 


19.6.1.1 Newton's Interpolation Formula 
To solve the interpolation problem (19.159) a polynomial of degree n is considered in the following form: 
g(x) = p.(x) = ao + aa(z — αρ) + as(z — αρα — αι) +-+- 
+an(x — xo)(z — σι)... (2 — 24a). (19.160) 
'This is called the Newton interpolation formula, and it gives an easy calculation of the coefficients 
à; (i — 0,1,...,n), since the interpolation conditions (19.159) result in a linear system of equations 
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with a triangular matrix. 


W For n = 2 one gets the annexed sys- 

tem of equations from (19.159). The inter- ρο(ζο) = ao = Yo 
polation polynomial p,(x) is uniquely de- — po(z1) = ao + a4 (x1 — xo) —u 
termined by the interpolation conditions pə(£2) = ao + a1(1a — xo) + as(xa — xo)(xa — τι) = Yo 


(19.159). 
The calculation of the function values can be simplified by the Horner schema (see 19.1.2.1, p. 952). 


19.6.1.2 Lagrange's Interpolation Formula 


A polynomial of n-th degree can be fitted through n + 1 points (ην, y,) (v = 0,1,...,n), with the 
Lagrange formula: 
g(x) = »ε(α) = L yu Lux). (19.161) 
μ-ο 


Here L,(x) (u = 0,1,...,n) are the Lagrange interpolation polynomials. Equation (19.161) satisfies 
the interpolation conditions (19.159), since 


Ae _ Ji foras, 
Ly(ty) = ὃμν = ta for μάν, (19.162) 
Here ὃμν is the Kronecker symbol. The Lagrange interpolation polynomials are defined by the formula 
f (r-zo)r—-z:1)-:(y—z,a)z —z1,4): (25) _ qr 2i (19.163) 
(αμ = 20) (sp — 21): (zu — Ly-1) (Lp — Όμει)''' (Lp — Ln) 2 Ly By 
vee 


W A polynomial is fitted through the points given by the table 7 : : 2 : 


The Lagrange interpolation formula (19.161) is used: 
(r—1)2—3) 1 


Lo(x) = (- ΠῚ = git 1)(a — 3), 
π-- 2 -- 1 

Li(x) = (i9 mE = σαία 3) 
z—0)r—1 1 

ll) = gB) ο -U 


5 x d 
po(z) = 1: Lo(z) 4-3: Τη(α) +2: L(x) = — 32" + ria +1. 


The Lagrange interpolation formula depends explicitly and linearly on the given values y, of the func- 
tion. This is its theoretical importance (see, e.g., the rule of Adams-Bashforth, 19.4.1.3, 3., p. 971). 
For practical calculation the Lagrange interpolation formula is rarely reasonable. 


19.6.1.3 Aitken-Neville Interpolation 


In several practical cases, the explicit form of the polynomial p, (x) is not needed, but only its value at 
a given location x of the interpolation domain. These function values can be obtained in a recursive 
way developed by Aitken and Neville. The useful notation 


Pala) = Po,1,....n(), (19.164) 
is applied in which the interpolation points xo, z1,..., x, and the degree n of the polynomial are de- 
noted. Notice that 


Pointe) = (x — 20)pis...n (2) — (x — Ln)Po,1,2,... nim) (19.165) 


Ln — To 
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i.e., the function value po3,.,,(r) can be obtained by linear interpolation of the function values of 
P12, n(x) and poj 5... 1(x), two interpolation polynomials of degree < n — 1. Application of (19.165) 
leads to a scheme which is given here for the case of n — 4: 


To | Yo = Po 
Tı | Yı = Pı Po 
πο | Yo = po pi2 Doi2 (19.166) 


T3 | Y3 = P3 P23 P123 P0123 
X4 | Ya = pa paa posa P1234 Por234 = Pa (T). 


The elements of (19.166) are calculated column-wise. A new value in the scheme is obtained from its 
west and north-west neighbors 


(r— 13)p3 — (x — va)p» Tc— T3 


poa = Ps + — (pa — po), (19.167a 
X3 — 22 T3 — 22 
2 — £1)po3 — (£ — T3)pıs 2 — T: 
P123 = ( pas C pia pagi — = (pos — p12), (19.167b 
£3 — Tı 24 — Tı 
2 — T1)p234 — (£ η 2 -- T4 
Pi234 = ( 1)pasa - ( n 234 + (p234 — pios). (19.167c 
L4— £1 L4— Tı 


For performing the Aitken-Neville algorithm on a computer only a vector p with n+ 1 components (see 
(19.4]) is introduced, which takes the values of the columns in (19.166) after each other according to 
the rule that the value p; 4; iii, (i = k, k+1,...,n) of the k-th column will be the i-th componen 
pi of p. The columns of (19.166) must be calculated from down to the top, so p contains all necessary 
values. The algorithm has the following two steps: 

1. For i—0,1,...,n set p; = yi. (19.168a 


TTi (pi— pi1).(19.168b 


2. For k=1,2,...,n and for i=n,n—1,...,k compute p; = pi + - 


£i — πε κ 


After finishing (19.168b) we have the required function value p, (x) at x in element pn. 


19.6.2 Approximation in Mean 

The principle of approximation in mean is known as the Gauss least squares method. In calculations 
continuous and discrete cases are distinguished. 

19.6.2.1 Continuous Problems, Normal Equations 

The function f(x) is approximated by a function g(x) on the interval [a, b] so that the expression 


F = [ w(a)[f(c) -- gle)? dz, (19.169) 


depending on the parameters contained by g(x), should be minimal. w(x) denotes a given weight func- 
tion, such that w(x) > 0 in the integration interval. 
If the best approximation g(x) is supposed to have the form 


πο ΣΩ (19.170) 
i=0 
with suitable linearly independent functions go(x), gi(x), ... , g9a(x), then the necessary conditions 
OF 
=0 (i=0,1,...,n) (19.171) 
0a; 


for an extreme value of (19.169) result in the so-called normal system of equations 
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n 


Magi gy) = (fg) (k =0,1,... n) (19.172) 
i=0 
to determine the unknown coefficients a;. Here the brief notations 
b 


(gi. gi.) zx n oX(x)gi(x)gs (v) dz, (19.173a 


(f gi) - fuos x glede (i,k =0,1,...,n) (19.173b 


are used, which are considered as the scalar products of the two indicated functions. 

The system of normal equations can be solved uniquely, since the functions go(x), g(x), ..., Gn(x 
are linearly independent. The coefficient matrix of the system (19.172) is symmetric, so the Cholesky 
method (see 19.2.1.2, p. 958) can be applied. The coefficients a; can be determined directly, withou 
solving the system of equations, if the system of functions g;(x) is orthogonal, that is, if 


(gi, gx) — 0 for i Z k. (19.174 
It is called an orthonormal system, if 
(Gi, gr) = { ΝΤ (19.175 
With ii the normal equations (19.172) are reduced to 
Ξ (1,6) (¢=0,1,...,n). (19.176 
ΠΝ independent αρ —— can be orthogonalized. From the power functions g;(r) = 2" 
(i = 0, 1,..., n), depending on the weight function and on the interval, the orthogonal polynomials in 


'Table 19.2 can be obtained. 
Table 19.2 Orthogonal polynomials 


| ία, b] w(x) | Name of the polynomials | see p. | 
[-1, 1] 1 Legendre polynomial P,(x) | 566 
1 
—1,1 —— | Chebysev polynomial T; (x 989 (19.177) 
Pil | ae ysev poly: (x) 
[0, oc) e Laguerre polynomial L(x) | 568 
(—00,00) | e=?/2 | Hermite polynomial H,(x) | 568 


These polynomial systems can be used on arbitrary intervals: 
1. Finite approximation interval. 
2. Approximation interval infinite at one end, e.g., in time-dependent problems. 
3. Approximation interval infinite at both ends, e.g., in stream problems. 
Every finite interval [a, b] can be transformed by the substitution 
b+a 3: b—a 
2 2 
into the interval [—1, 1]. 
19.6.2.2 Discrete Problems, Normal Equations, Householder's Method 


Let N pairs of values (x,, yy) be given, e.g., by measured values. A function g(x) has to be determined, 
whose values g(x,) differ from the given values y, in such a way that the quadratic expression 


t= 


t (x € [a,b], t € [-1,1]) (19.178) 


F= ΣῈ y» — ο(αν)] (19.179) 


v=1 
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is minimal. The value of F depends on the parameters contained in the function g(x). Formula (19.179) 
represents the classical sum of residual squares. The minimization of the sum of residual squares is called 


OF 
the least squares method. From the assumption (19.170) and the necessary conditions aa = 0 (i= 


αι 
0,1,...,n) for a relative minimum of (19.179) for the coefficients the normal system of equations is 
obtained: 
» ailgign] = [yg] (kK =0,1,...,n). (19.180) 
i=0 


Here the Gaussean sum symbols are used in the following notation: 


N 
[sign] = $ mer) o), (19.181a) 
N 
[ugr] = 2. υνθκίων) (i,k =0,1,...,n). (19.181b) 


Usually, n «& N. 

E For the polynomial g(a) = ao + aiz +--+ + anx”, the normal equations are ao[z] + ai [ο Τη +--+ 
αμ[αντη] = [xy] (k = 0,1,...,n) with [z*] 2 DX a ανν, [x9] = N, [ety] = DX ay, [y] = 
Dd ys. The coefficient matrix of the normal system of equations (19.180) is symmetric, so for the 
numerical solution the Cholesky method can be applied. 

The normal equations (19.180) and the residue sum square (19.179) have the following compact form: 


GTGa-Gly, F-(y-Ga)'(y- Ga) with (19.1822) 
golt) gm) gsQu) -.. gm) yı ao 
ο.) οι) ορ) ... gn(T2) Y2 αι 

G= | θο(ζα) glzs)  ge(va) ... Gn (ws) y=|% a= | οὐ (19.182b) 
golan) alan) (aw) ... I(EN) YN an 

If, instead of the minimalization of the sum of residual squares, one wants to solve the interpolation 


problem for the N points (x,, y,), then the following system of equations should be solved: 

Ga=y. (19.183) 
This system of equations is over-determined in the case of n < N — 1, and usually it does not have any 
solution. The equations (19.180) or (19.182a) are obtained by multiplying (19.183) by GT. 
From a numerical viewpoint, the Householder method (see 4.5.3.2, 2., p. 314) is recommended to solve 
equation (19.183), and this solution results in the minimal sum of residual squares (19.179). 


19.6.2.3 Multidimensional Problems 
1. Computation of Adjustments 


Suppose that there is a function f (z1, x9, ..., 2,4) of n indepen- τι eh) 40 a) 
dent variables x1, £2, ... , x. Its explicit form is not known; M ) O TN) 
T3 


only N substitution values f, are given, which are, in general, 
measured values. These data can be written in a table (see : ae. : 

(19.184)). In | ας) ας)... ο) 
The formulation of the adjustment problem is clearer by intro- 
ducing the following vectors: J | h fa... fiw 


(19.184) 
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x -(αιιαο,...,σω)᾽ : Vector of n independent variables, 
x = (x 2”,...,2¢)" + Vector of the v-th interpolation node (v = 1,..., N), 
fo = (fis fopi fU : Vector of the N function values at the N interpolation nodes. 
f (11,22, ..., v5) = f(x) is approximated by a function of the form 
m 
g(21,92,..., 24) = M aigi(ni, ον En: (19.185) 
i=0 
Here, the m + 1 functions g;(z1, 2, ..., &n) = g;(X) are suitable, selected functions. 


ον) Ξαρ-αιτι + α921 ++++ + s. 
W B: Complete quadratic approximation with three variables: 
glz, T2, 3) = Qo + AX + 523 + A3X3 + 0421? + 5.02" + αρα + A721 LQ + ακτι η  αο Το. 


y 2 
The coefficients are chosen to minimize ΣΥ [ fu-g (ay, αἴ ) M a?)] | 


v=1 


ΒΑ: Linear approximation by n variables: g(1, 22, . 


2. Normal System of Equations 
Analogously to (19.182b) the matrix G is formed in which the interpolation nodes z, are replaced by 


vectorial interpolation nodes x) (v = 1,2,..., N). To determine the coefficients, the normal system 
of equations 

G7TGa = Gf (19.186) 
can be used or the over-determined system of equations 

Ga =f. (19.187) 


W For an example of multidimensional regression see 16.3.4.3, 3., p. 842. 


19.6.2.4 Non-Linear Least Squares Problems 

The main idea is discussed for a one-dimensional discrete case. The approximation function g(a) de- 
pends non-linearly on certain parameters. 

WB A: g(x) = age?* + age". This expression does not depend linearly on the parameters αι and aa. 
Β D: g(x) = ape" cos aor. This function does not depend linearly on the parameters αι and a. 


The fact that the approximation function g(x) depends on a parameter vector a = (ao, a4, ...,a,,) is 
indicated by the notation 
g= 9(x,a) = g(x; ao, a1, iiz απ). (19.188) 


Suppose, N pairs of values (αν. yy) (v = 1,2,..., N) are given. To minimize the sum of residual squares 


N 
Ml — g(zy; a9, Q1,...,a4)]? = F(ag,a1,..., an) (19.189) 
wok 


the necessary conditions — = 0 (i = 0,1,...,n) lead to a non-linear normal equation system which 


2 
must be solved by an iterative method, e.g., by the Newton method (see 19.2.2.2, p. 962). 
Another way to solve the problem, which is usually used in practical problems, is the application of 
the Gauss-Newton method (see 19.2.2.3, p. 962) given for the solution of the non-linear least squares 
problem (19.24). The following steps are needed to apply it for this non-linear approximation problem 
(19.189): 


1. Linearization of the approximating function g(x, a) with the help of the Taylor formula with respect 


to a;. To do this, the approximation values a (i — 0, 1,...,n) are needed: 


g(z, a) ~ δ(υ, α) = g(z, αὐ) + 5 2 ia, a(9)(a; — af? ). (19.190) 


i=0 i 
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2. Solution of the linear minimum problem 


» — j(r,,a)]? = min! (19.191 


with Tw help of the normal equation system 
G'GAa = GTAy (19.192 


or by the Householder method. In (19.192) the components of the vectors Aa and Ay are given as 


Aa; = aj — a (i =0,1,2,...,n) and (19.193a. 
Ay, = y, — g(x,, a9) (v —1,2,..., N). (19.193b 
The matrix G can be determined analogously to G in (19.182b), where g;(«,) are replaced by 
99 ας aO) (i=0,1,...,n; v2 12,..., N). 
Oa; 
3. Calculation of a new approximation 
al) = al + Na; or a® = al + Aa; (20,152) 225), (19.194) 


where 7 > 0 is astep length ως. 

E n 0 i n : 
By repeating steps 2 and 3 with al? instead of αὖ), ) etc. a sequence of approximation values is ob- 
tained for the required parameters, whose conver gence strongly depends on the accuracy of the initial 
approximations. The value of the sum of residual squares can be reduced with the introduction of the 
multiplier y. 


19.6.3 Chebyshev Approximation 


19.6.3.1 Problem Definition and the Alternating Point Theorem 
1. Principle of Chebyshev Approximation 


Chebyshev approximation or uniform approximation in the continuous case is the following: The func- 
tion f(x) has to be approximated in an interval a € x < b by the approximation function g(x) = 
g(x; do, 01, ..., an) so that the error defined by 


max |f (x) — g(x; ao, a1,...,a4)| = P(ao, a1,..., a4) (19.195) 
should be as small as possible for the appropriate choice of the unknown parameters a; (i = 0,1,...,m). 


If there exists such an approximating function for f(x), then the maximum of the absolute error value 
will be taken at least at n+ 2 points x, of the interval, at the so-called alternating points, with changing 
signs (Fig. 19.10). This is actually the meaning of the alternating point theorem for the characteriza- 
tion of the solution of a Chebyshev approximation problem. 


y 


xy 


a) b) 
Figure 19.10 


E Ifthe function f(x) = x” is approximated on the interval [—1, 1] by a polynomial of degree < n — 1 
in the Chebyshev sense, then the Chebyshev polynomial T; (x) is obtained as an error function whose 
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maximum is normed to one. The alternating points, being at the endpoints and at exactly n — 1 points 


in the interior of the interval, correspond to the extreme points of T; (x) (Fig. 19.11α 0. 
τὴ, T6) τὴ, 
x M x 
» » » 
1 0 1 -1 0 T -1 0 1 
-1 -1 -1 
a) b) c) 
ΤΟ Τ-(χ)} Το) 
x x x 
> > -p 
-1 0 1 -1 0 1 -1 0 1 
ΕἼ -1 -1 


d) e) f) 
Figure 19.11 
19.6.3.2 Properties of the Chebyshev Polynomials 


1. Representation 


T,(x) = cos(n arccos x), (19.196a. 
1 n n 
Emo EN El τα 
Τι(ω) = 5 [6 Ενα 1) (a 1 1) ] (19.196b 
4. fcosnt, x = cost fo|rx| «1, , - : 
qn) = ee nt, x = cosht for |α] > 1 . (19.196c 
2. Roots of T, (x) 
2u—1 
Ly, = cos ae (ncs EA (19.197 
2n 
3. Position of the extreme values of T, (x) for x € [—1, 1] 
νπ 
αν =cos— (v=0,1,2,...,n). (19.198 
n 


4. Recursion Formula 

Tri = 2x T4, (x) —Tp-i(t) (n=1,2,...; To(z) =1, Τι(α) = x). (19.199 
This recursion results in e.g. 

To(x) = 2a? — 1, Τη(α) = 4x? — 32, (19.200a 
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Τι(α) = 8x5 — 8r? +1, Τι(α) = 162? — 20x? + δα, (19.200b) 
Το(α) = 3215 — 48x* + 182? — 1, (19.200c) 
Tr(a) = 64r" — 112x? + 562? — Τα, (19.200d) 
Τε(α) = 128x5 — 256x + 160x* — 32x? + 1, (19.200€) 
Το(α) = 256x9 — 576x" + 432x? — 1202? + θα, (19.200f) 
Τιο(α) = 5125 — 1280a° + 112029 — 400x* + 502? — 1. (19.200g) 


19.6.3.3 Remes Algorithm 


1. Consequences ofthe Alternating Point Theorem 
The numerical solution of the continuous Chebyshev approximation problem originates from the alter- 
nating point theorem. The approximating function is chosen as 


= Y agla) (19.201) 


i=0 
with n + 1 linearly independent known functions, and the coefficients of the solution of the Chebyshev 
problem are denoted by a;* (i = 0,1,...,n) and the minimal deviation according to (19.195) by o = 
P(ao*,a1*,...,@n*). In the case when the functions f and g; (i = 0,1,...,n) are differentiable, from 
the alternating point theorem one has 
n n 

So αι α(αν) + (-1"o— f(x), Soai*gi(av) = ων) (v =1,2,...,n +2). (19.202) 

i=0 i=0 
The nodes x, are the alternating points with 

ασκπι«σος...« πιο € b. (19.203) 


The equations (19.202) give 2n + 4 conditions for the 2n + 4 unknown quantities of the Chebyshev 
approximation problem: n + 1 coefficients, n + 2 alternating points and the minimal deviation o. If 
the endpoints of the interval belong to the alternating points, then the conditions for the derivatives 
are not necessarily valid there. 
2. Determination of the Minimal Solution according to Remes 

According to Remes, one proceeds with the numerical determination of the minimal solution as follows: 
1. An approximation of the alternating points x, (v =1,2,..., n 4- 2) are determined according to 
(19.203), e.g., equidistant or as the positions of the extrema ‘of Trail |) (see 19.6.3.2, p. 988). 


2. The linear system of equations 


5 agila, ®) + (-1"o = f(a) (v 9 1,2,...,n2) 


i=0 
is solved and the solutions are the approximations a; (i = 0,1,...,n) and go. 
3. A new approximation of the alternating points αμ) (v = 1,2,...,n + 2) is determined, e.g., as 


n 
positions of the extrema of the error function f(r) — Y; ajg;(x). Now, it is sufficient to apply only 
i=0 


approximations of these points. 

By repeating steps 2 and 3 with αν) and a;“) instead of z, and a;, etc. a sequence of approx- 
imations is obtained for the coefficients and the alternating points, whose convergence is guaranteed 
under certain conditions, which can be given (see [19.25]). The calculations are stopped if, e.g., from 
a certain iteration index p 


lou] = max |f(x) -X ai gi(z) (19.204) 


i=0 
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holds with a sufficient accuracy. 


19.6.3.4 Discrete Chebyshev Approximation and Optimization 


From the continuous Chebyshev approximation problem 


H(z) - F agla) 


i=0 


= min! (19.205 


max 
a<r<b 


the corresponding discrete problem can be got, if requiring N nodes z, (v = 1,2,..., N; N 2 n42 
are chosen with the property a < x, «πο. < ry € b and requiring 


ax ον) — igi(x,)| = min! . ; 
ENS 2, agn) min (19.206 


The substitution 


y= max |f(r,) — Y, aigi(v,)|. (19.207 
ν-1.5,....Ν xum 
has obviously the consequence 
(ων) — ΣΣ αιφι(αν) κα (w=1,2,..:,N). (19.208 
i=0 


Eliminating the absolute values from (19.208) a linear system of inequalities is obtained for the coeffi- 
cients a; and y, so the problem (19.206) becomes a linear programming problem (see 18.1.1.1, p. 909): 


yt Sage.) > fŒ), 
αν (v =1,2,...,.N). (19.209) 


y= min! subject to 


A » aigi(xy) 2 —f (xy) 


Equation (19.209) has a minimal solution with y > 0. For a sufficiently large number N of nodes and 
with some further conditions the solution of the discrete problem can be considered as the solution of 
the continuous problem. 


n 
If instead of the linear approximation function g(x) = X a;g;(x) a non-linear approximation function 
i=0 


glx) = g(x; ap, 41,...,@,) is used, which does not depend linearly on the parameters ao, @1,..., an, 
then analogously a non-linear optimization problem is obtained. It is usually non-convex even in the 
cases of simple function forms. This essentially reduces the number of numerical solution methods for 
non-linear optimization problems (see 18.2.2.1, p. 926). 
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19.6.4 Harmonic Analysis 


A periodic function f(x) with period 27, which is given formally or empirically, should be approximated 
by a trigonometric polynomial or a Fourier sum of the form 
a 0 n 
g(x) = * + Σ (αι cos ka + by sin kx), (19.210) 
k=1 

where the coefficients αρ, ag and bg are unknown real numbers. The determination of the coefficients is 
the topic of harmonic analysis. 
19.6.4.1 Formulas for Trigonometric Interpolation 


1. Formulas for the Fourier Coefficients 


Since the function system 1, cos kx, sin kr. (k = 1,2,...,n) is orthogonal in the interval [0,27] with 
respect to the weight function w = 1, the formulas for the coefficients are obtained as 
1 2π 1 2π 
ak = — [| f(x) cos κα dr, by = = | f(a) sin ke de (k = 0,1,2,...,n) (19.211) 
π π 
0 0 


by applying the continuous least squares method according to (19.172). The coefficients a; and bp 
calculated by formulas (19.211) are called Fourier coefficients of the periodic function f(x) (see 7.4, 
p. 474). 

If the integrals in (19.211) are complicated or the function f(x) is known only at discrete points, then 
the Fourier coefficients can be determined only approximately by numerical integration. 


Using the trapezoidal formula (see 19.3.2.2, p. 964) with N + 1 equidistant nodes 


2 
z,—-vh (v=0,1,...,N), h= F (19.212) 
the approximation formula 
2 N ο Ν 
ak 54 ἂχ = N » f(a) coskay, bk e bk = N M f(a,)sinka, (k=0,1,2,...,n) (19.219) 
“Ἔ ws TU wes 


is obtained. The trapezoidal formula becomes the very simple rectangular formula in the case of peri- 
odic functions. It has higher accuracy here as a consequence of the following fact: If f(x) is periodic 
and (2m + 2) times differentiable, then the trapezoidal formula has an error of order O(h?"'*?), 


2. Trigonometric Interpolation 

Some special trigonometric polynomials formed with the approximation coefficients ἄμ and b, have 
important properties. Two of them are mentioned here: 

1. Interpolation Suppose N = 2n holds. The special trigonometric polynomial 


n—1l 


1 - 1 
q(x) = PL + ΣΩ. cos kx + by sin kx) + gan cos 13; (19.214) 
k=1 


with coefficients (19.213) satisfies the interpolation conditions 
σι(αν) = f(x.) (v= 2N) (19.215) 
at the interpolation nodes x, (19.212). Because of the perodicity of f(x) f(xo) = f(xy) holds. 
2. Approximation in Mean Suppose N = 2n. The special trigonometric polynomial 
1 m 


ζο(α) = 340 + Y (a& cos ka + b, sin Κα) (19.216) 
ki 
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for m < n and with the coefficients (19.213) approximates the function f(x) in discrete quadratic mean 
with respect to the N nodes x, (19.212), that is, the residual sum of squares 


N 
ΕΞ» [f(z») -- δο(ωω)]} (19.217) 
νε 
isminimal. The formulas (19.213) are the originating point for the different ways of effective calculation 
of Fourier coefficients. 


19.6.4.2 Fast Fourier Transformation (FFT) 


1. Computation costs of computing Fourier coefficients 
The sums in the formulas (19.213) also occur in connection with discrete Fourier transformation, e.g., 
in electrotechnics, in impulse and picture processing. Here N can be very large, so the occurring sums 
must be calculated in a rational way, since the calculation of the N approximating values (19.213) of 
the Fourier coefficients requires about N? additions and multiplications. 
For the special case of N = 2?, the number of multiplications can 

be largely reduced from N?(= 2°) to pN (= p2?) with the help of p 
the so-called fast Fourier transformation FFT. The magnitude of 
this reduction is demonstrated on the example on the right-hand 
side. 


N? pN 
~ 101 


~ 107 


(19.218) 


By this method, the computation costs and computation time are reduced so effectively that in some 
important application fields even a smaller computer is sufficient. 

The FFT uses the properties of the N-th unit roots, i.e., the solutions of equation z'" = 1 to a successive 
sum up in (19.213). 
2. Complex Representation of the Fourier Sum 
The principle of FFT can be described fairly easily if the Fourier sum (19.210) is rewritten with the 
formulas 


7 νο 1 ikr —ika ἃ E i —ikr ika 
cos ka = 7 (e +e ) . sinka = 5 (ο —e ) (19.219 
into the complex form 
il z 1 [αι — idk ike Ge dby ika 
g(x) = zao + Σ (ap cos ka + b; sin Κα) = 2ag + S> LL ον... Mk the (19.220 
2 k=1 2 kci 2 2 
By substitution 
ap — ibi. 1 zm —ika 
üt ae (19.221a) because of (19.311) cj = on f(aje"™ dz, (19.221b 
m Jo 
so (19.220) becomes the complex representation of the Fourier sum: 
g(x) = 3 cpet with cp = σι. (19.222 
k-—n 
If the complex coefficients cp are known, then the required real Fourier coefficients can be got in the 
following simple way: 
ao = 2c, aj —2Re(cy), by = —2Im(c) (k-—1,2,...,n). (19.223 


3. Numerical Calculation of the Complex Fourier Coefficients 
For the numerical determination of c; the trapezoidal formula can be applied for (19.221b) analogously 
to (19.212) and (19.213), and the discrete complex Fourier coefficients ὄν are obtained: 
]N : N-1 
& — M flee" = V fw (k=0,1,2,...,n) with (19.224a) 


v=0 v=0 
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Si. q 585 ο... Ig 19.2241 
fo = wf), ty = (v=0,1,2,...,N-—1), wn=e . (19. ») 
Relation (19.224a) with the quantities (19.224b) is called the discrete complex Fourier transformation 
of length N of the values f, (v = 0,1,2,..., N — 1). 
The powers why = z (v = 0,1,2,..., N — 1) satisfy equation 2N = 1. So, they are called the N -£h unit 
roots. Since e?" = 1, 
ωΝ --1, ωΝ ΤΙ awh, wk? -wh... (19.225) 


The effective calculation of the sum (19.224a) uses the fact that a discrete complex Fourier transfor- 
N 

mation of length N = 2n can be reduced to two transformations with length oed in the following 

way: 

a) For every coefficient ci with an even index, i.e., k = 21, 


2n-1 n—l n—l 


Cy = > ων’ = DL [FoR F five | = VE [fv + fas] UN" (19.226) 
v=0 v=0 v=0 
holds. Here the equality 2? = ww” = wl” is used. 
Substituting 
V» = fo + fav (v=0,1,2,...,n-1) (19.227) 
and considering that w, = wn, the sum 
n-i 
ča= J yw’ (v=0,1,2,...,n-1) (19.228) 
v=0 
is the discrete complex Fourier transformation of the values y, (v = 0,1,2,...,n — 1) with length 
= N 
n=>- 
b) For every coefficient &j with an odd index, i.e., with k = 21 + 1 
2n—1 n—1l 
Cy = 5D foy = Σ [60 = ινών] ων. (19.229) 
v=0 v=0 


is obtained analogously. Substituting 
Όπεν = (fo — Γπιν)ων (v =0,1,2,...,n— 1) (19.230) 


and considering that wł, = wn, the sum 


n—1l 
ὄρ = S ya! (v —0,1,2,...,n — 1) (19.231) 
v=0 
is the discrete complex Fourier transformation of the values y44, (v = 0,1,2,...,n — 1) with length 
N 
n = —. 
2 


The reduction according to a) and b), i.e., the reduction of a discrete complex Fourier transformation 
to two discrete complex Fourier transformations of half the length, can be continued if Ν is a power of 2, 
i.e., if N = 2? (pis a natural number). The application of the reduction after p times is called the FFT. 


Since every reduction step requires z complex multiplications because of (19.230), the computation 


cost of the FFT method is 


N N 
FP = 5 log N. (19.232) 
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4. Scheme for FFT 


For the special case N = 8 = 23, the three corresponding reduction steps of the FFT according to 
(19.227) and (19.230) are demonstrated in the following Scheme 1: 


Scheme 1: 
Step 1 Step 2 Step 3 
fo | Yo = fo + fa Yo :— Yo + y» Yo :— Yo + Yı = Co 
filmahtfs Vii M+ ys yi = (yo — Ψι)ω8 — 
fo ys fact fe Y2 = (Yo απ Yo = Y2 + Y3 = & 
fs | Y3 = fa + fr ys :— (yi — ys)wi Ys :— (Yo — ya) = ὃς 
fa | ya = (fo — Sajos Ya = Ya + Yo Ya = Ya + Ys =O 
fs | us = (fi — fs) Ys := Ys + Wr Ys := (Ya — Ys)ws = 65 
fs | Yo = (fa — ]ο)ωᾷ Ye = (Ya — Yow Yo = Yo + yr = 63 
fr | y= (fa — frog Y7 := (ys — Ότ)ωα yr := (Yo — yr)e = Cr 
N =8, n := 4, wg = es | N: 4, n:= 2, w = 02 | N := 2, n := 1, w = w? 


It can be observed how terms with even and odd indices appear. In Scheme 2 (19.233) the structure 


of the method is illustrated. 


Scheme 2: 
5 Čsk 
z EN ση zt { Čsk+4 
2s no 3 { Čsk+2 
AEN X Car (19.233) 
~ C8k+1 ` 
~ Más πε Čsk+5 
πα - το 
Οἱ. > { Car 
(k—0,1,...,7) (k=0,1,2,3) (k—0,1) (k — 0). 
Scheme 3: Index | Step 1 | Step 2 | Step3 Index 
If the coefficients cj are substi- z > " " 
tuted into Scheme 1 and one ro ΠΝ a a fo unm 
considers the binary forms of σι 001, Čo C4 C4 L00 
the indices before step 1 and 5 2 m 5 
after step 3, then it is easy to s ara S 2 - ma 
recognize that the order of the [ο OLL Če C6 Če LLO 
required coefficients can be ob- z 5 " > 
tained by simply reversing the C4 L00 e a a 001, 
order of the bits of the binary Čs LOL C3 65 C5 LOL 
form of their indices. This is 5 " - - 
shown in Scheme 3. 66 LLO (5 65 65 OLL 
Cr LLL Cy Cr Cr LLL 
27? for x = 0, 


Β In the case of the function f(x) = { 2 for0<2<2r 


crete Fourier transformation. N = 8 is chosen. With x, = 


27i a 
e 8 =0.707107(1 — i), ωξ = —i, ωὖ 


π 
το ο} f= 


with period 27, 
1 
8 8 


f (25) (9 —0,1,2..«. 


the FFT is used for the dis- 


0.707107(1 +i) Scheme 4 is got: 
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Scheme 4: Step 1 Step 2 Step 3 
fo = 2.467401 | yo = 3.701102 Yo = 6.785353 Yo = 13.262281 = Cy 
fi = 0.077106 | γι = 2.004763 yı = 6.476928 yı = 0.308425 = δι 
f» = 0.308425 | y2 = 3.084251 y2 = 0.616851 y» = 0.616851 + 2.467402i = č 
fa = 0.693957 | ya = 4.472165 ys = 2.467402i | y3 = 0.616851 — 2.467402i = če 
fa = 1.233701 | y4 = 1.233700 ya = 1.233700 ya = 2.106058 + 5.9568331 = č 
+2.467401i 
fs = 1.927657 | ys = —1.308537(1 — i) | ys = 0.872358 ys = 0.361342 — 1.022031i = 6; 
+3.489432 i 
fe = 2.775826 | ye = 2.467401 i ye = 1.233700 | ye = 0.361342 + 1.022031i = č 
—2.4674011 
fz = 3.778208 | y; = 2.180895(1 + i) y; = —0.872358 | y; = 2.106058 — 5.956833i = č 
+3.489432 i 
= [-4 
From the third (last) reduction step the required real t κ τ = 
: : . à a, = 4212116 bı = —11.913 666 
Fourier coefficients are obtained according to (19.223). e F i $ 
See the riehit-hand.sid az = 1.233 702 b= — 4.934 804 
( Ki ρε» i ye P M — a3 = 0.722684 by — — 2.044 062 
n this example, the general property a1 = 0.616850 b,— 0 
ἕν ες Čr (19.234) 


of the discrete complex Fourier coefficients can be observed. For k = 1, 2,3, it can be observed that 
ὅτ = C1, Cg = Co, C = C3. 


19.7 Representation of Curves and Surfaces with Splines 


19.7.1 Cubic Splines 


Since interpolation and approximation polynomials of higher degree usually have unwanted oscillations, 
it is useful to divide the approximation interval into subintervals by the so-called nodes and to consider 
a relatively simple approximation function on every subinterval. In practice, cubic polynomials are 
mostly used. A smooth transition is required at the nodes of this piecewise approximation. 


19.7.1.1 Interpolation Splines 


1. Definition of the Cubic Interpolation Splines, Properties 
Suppose there are given N interpolation points (x;, fi) (i = 1,2,..., N; xq < £2 < ... xw). The cubic 
interpolation spline S(x) is determined uniquely by the following properties: 
1. S(x) satisfies the interpolation conditions S(v;) = f; (i = 1,2,..., N). 
2. S(x) is a polynomial of degree < 3 in any subinterval [r;, 44] (i = 1,2,..., N — 1). 
3. S(x) is twice continuously differentiable in the entire approximation interval [21, x y]. 
4. S(x) satisfies the special boundary conditions: 
a) ο (αι) = S"(xy) = 0 (we call them natural splines) or 
b) S'(a1) = fi’, S'(xx) = fr’ (fi! and [ν΄ are given values) or 
c) 5(αι) = (αν), in the case of fı = fy, 5(αι) = S'(xy) and ο (σι) = (αν) (they are called 


periodic splines). 
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It follows from these properties that for all twice continuously differentiable functions g(x) satisfying 
the interpolation conditions (σι) = f; (i = 1,2,..., N) 
IN ΖΝ 
[ 8" OP ax < Jor dx (19.235) 
T1 ὧν 
is valid (Holladay's Theorem). Based on (19.235) one can say that S(x) has minimal total curvature, 
since for the curvature & of a given curve, in a first approximation, & ~ S" (see 3.6.1.2, 4., p. 246). 
It can be shown that if a thin elastic ruler (its name is spline) is led through the points (z;, fi) (i = 
1,2,..., N), its bending line follows the cubic spline S(x). 
2. Determination of the Spline Coefficients 
The cubic interpolation spline S(x) for x € [v;, i41] has the form: 


S(x) = Si(x) = aj + bi(x — αι) + ci(x — x)? + di(a zi (i—1,2,..., N — 1). (19.236) 


The length of the subinterval is denoted by h; = x4; — x;. The coefficients of the natural spline can 
be determined in the following way: 


1. From the interpolation conditions we get 

ai= fe (i21,2,...,N — 1). (19.237) 
It is reasonable to introduce the additional coefficient αν = fx, which does not occur in the polynomi- 
als. 
2. The continuity of 5”(α) at the interior nodes requires that 
Cj — Ci-i 

3hi-1 

The natural conditions result in c; = 0, and (19.238) still holds for i = N, if cy = 0 is introduced. 
3. The continuity of S(x) at the interior nodes results in the relation 


dii = (i—2,3,..., N — 1). (19.238) 


ü;—Gj.3  2cii-c Ci 
hia 3 


4. The continuity of ο) (1) at the interior nodes requires that 


bii 


hia (i=2,3,...,N). (19.239) 


Qi+1 — Qi Qi — A-1 
hy hia 


Because of (19.237), the right-hand side of the linear equation system (19.240) to determine the coef- 
ficients c; (i = 2,3,..., N — 1; cq = cy 0) is known. The left hand-side has the following form: 


Ciihia + 2(hii + hi)ci + cia hi = 3 ( 


2(hy + ha) ha C2 
ha 2(hə + ha) ha Oo C3 
h: 2(h3 + ha) hi ο, 
j i n al ο... i (19.241) 
ο hy-2 
hn—2 2(hw—2 + hy) CN-1 


The coefficient matrix is tridiagonal, so the system of equations (19.240) can be solved numerically 
very easily by an LR decomposition (see 19.2.1.1, 2., p. 956). Then all other coefficients in (19.239) 
and (19.238) can be determined with these values c;. 


19.7.1.2 Smoothing Splines 

The given function values f; are usually measured values in practical applications so they have some 
error. In this case, the interpolation requirement is not reasonable. This is the reason why cubic smooth- 
ing splines are introduced. This spline is obtained if in the cubic interpolation splines the interpolation 
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requirements are replaced by 


$ ea +A fror dx = minl. (19.242) 


i=1 i 


The requirements of continuity of S, S” and S" are kept, so the determination of the coefficients is a con- 
strained optimization problem with conditions given in equation form. The solution can be obtained 
by using a Lagrange function (see 6.2.5.6, p. 456). For details see [19.26]. 
In (19.242) A (A > 0) represents a smoothing parameter , which must be given previously. For \ = 0 
the result is the cubic interpolation spline, as a special case. For "large" A the result is a smooth ap- 
proximation curve, but it returns the measured values inaccurately, and for A = oo the result is the 
approximating regression line as another special case. A suitable choice of λ can be made, e.g., by 
computer-screen dialog. The parameter o; (c; > 0) in (19.242) represents the standard deviation (see 
16.4.1.3, 2., p. 851) of the measurement errors, of the values f; (i = 1,2,..., N). 
Until now, the abscissae of the interpolation points and the measurement points were the same as the 
nodes of the spline function. For large N this method results in a spline containing a large number of cu- 
bic functions (19.236). A possible solution is to choose the number and the position of the nodes freely, 
because in many practical applications only a few spline segments are satisfactory. It is reasonable also 
from a numerical viewpoint to replace (19.236) by a spline of the form 

r+2 


S(x) = 2 a; Nia(x). (19.243) 


Here r is the number of freely chosen nodes, and the functions N; 4(x) are the so-called normalized B- 
splines (basis splines) of order 4, i.e., polynomials of degree three, with respect to the i-th node. For 
details see [19.5]. 


19.7.2 Bicubic Splines 
19.7.2.1 Use of Bicubic Splines 


Bicubic splines are used for the following problem: A rectangle R of the x, y plane, given by a € x < b, 
c € y € d, is decomposed by the grid points (;, yj) (i =0,1,...,n; j =0,1,...,m) with 

a = To < T1 «νι < Tn =b, c-yo«y ck «yd (19.244) 
into subdomains Rij, where the subdomain Ri; contains the points (x, y) with x; < x X z;,y; y € 
yj+ı (i=0,1,...,n— 1; j =0,1,...,m — 1). The values of the function f(x,y) are given at the grid 
points 

Frnt) = fa @=0,1,...57 9 = hlm) (19.245) 
A possible simple, smooth surface over R is required which approximates the points (19.245). 


19.7.2.2 Bicubic Interpolation Splines 


1. Properties 
The bicubic interpolation spline S(x, y) is defined uniquely by the following properties: 
1. S(x,y) satisfies the interpolation conditions 
θα) = fg (-0,1,...,τ; 7 =0,1,...,m). (19.246) 
2. S(x,y) is identical to a bicubic polynomial on every Rj; of the rectangle R, that is, 
3 


δία, y) = SG y) = ΣΣ Y aule — αἱ) (y — yj) (19.247) 


k=0 1=0 
on Rij. So, S(x,y) is determined by 16 coefficients, and for the determination of S(x,y) 16-m-n 
coefficients are needed. 
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3. The derivatives 


0S OS OS 


CN 19.248 
Ox’ ὃν Oxdy ( ) 
are continuous on R. So, a certain smoothness is ensured for the entire surface. 
4. S(x,y) satisfies the special boundary conditions: 
os 
a; (tir Y;) = pg for i=0,n; 7 =0,1,...,m, 
ὃς 
gy E) = gy for Ὁ--0,1.....ῃ; j-0,m, (19.249) 
y j 5 
PS 


Irog Co 9) =r; for i—0,n; j =0,m. 


Here pij, qij and rj; are previously given values. 

The results of one-dimensional cubic spline interpolation can be used for the determination of the co- 
efficients αι. 

1. There is a very large number (2n + m + 3) of linear systems of equations but only with tridiagonal 
coefficient matrices. 

2. The linear systems of equations differ from each other only on their right-hand sides. 

In general, it can be said that bicubic interpolation splines are useful with respect to computation cost 
and accuracy, and so they are appropriate procedures for practical applications. For practical methods 
of computing the coefficients see the literature. 


2. Tensor Product Approach 
The bicubic spline approach (19.247) is an example of the so-called tensor product approach having the 
form 
n T 
S(x,y) = 35 Y S aggi(x)h;(v) (19.250) 
i-0 j=0 
and which is especially suitable for approximations over a rectangular grid. The functions g;(r) (i = 
0,1,...,n) and hj(y) (j = 0,1,...,m) form two linearly independent function systems. The tensor 
product approach has the big advantage, from numerical viewpoint, that, e.g., the solution of a two- 
dimensional interpolation problem (19.246) can be reduced to a one-dimensional one. Furthermore, 
the two-dimensional interpolation problem (19.246) is uniquely solvable with the approach (19.250) if 
1. the one-dimensional interpolation problem with functions g;(r) with respect to the interpolation 
nodes zo, z1,..., 2, and 
2. the one-dimensional interpolation problem with functions h;(y) with respect to the interpolation 
nodes Yo, Y1, -- - , Ym 
are uniquely solvable. 
An important tensor product approach is that with the cubic B-splines: 


r+2p+2 


S(2,y) = Y Ð ay Niala) Njalu). (19.251) 


i=1 j=1 


Here, the functions N;4(x) and Nj4(y) are normalized B-splines of order four. Here r denotes the 
number of nodes with respect to x, p denotes the number of nodes with respect to y. The nodes can be 
chosen freely but their positions must satisfy certain conditions for the solvability of the interpolation 
problem. 

The B-spline approach results in a system of equations with a band structured coefficient matrix, which 
is a numerically useful structure. 
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For solutions of different interpolation problems using bicubic B-splines see the literature. 


19.7.2.3 Bicubic Smoothing Splines 

The one-dimensional cubic approximation spline is mainly characterized by the optimality condition 
(19.242). For the two-dimensional case there could be determined a whole sequence of corresponding 
optimality conditions, however only a few special cases make the existence of a unique solution possible. 
For appropriate optimality conditions and algorithms for solution of the approximation problem with 
bicubic B-splines see the literature. 


19.7.3 Bernstein—Bézier Representation of Curves and Surfaces 


1. Bernstein Basis Polynomials 
The Bernstein—Bézier representation (briefly B-B representation) of curves and surfaces applies the 
Bernstein polynomials 


Bip (t) = (1)6a —t)" (i =0,1,...,n) (19.252) 
and uses the following fundamental properties: 
1. O€ Bint) <1 for 0<t<1, (19.253) 
2. Y Bin(t)=1. (19.254) 
i=0 


Formula (19.254) follows directly from the binomial theorem (see 1.1.6.4, p. 12). 
E A: Bat) 21—1, Βιι() = t (Fig. 19.12). 
E B: Bo;(t) = (1-0)? , Big(t) = 3t(1 —0?, Bos(t) = 3t (1 — t) Baa(t) = t (Fig. 19.13). 


3 
l B; (t) 
B; 9 Ί 
| 4 
9 
0 1 t 0 
Pigure:19.12 Figure 19.13 
2. Vector Representation 
Now, a space curve, whose parametric representation is x = x(t), y = y(t), z = z(t), will be denoted 
in vector form by 
T = F(t) = x(t) 6, + y(t) ἕν + z(t) &.. (19.255) 
Here t is the parameter of the curve. The corresponding representation of a surface is 
T = F(u, v) = z(u,v) ës + y(u, v) ἕν + z(u, v) 8.. (19.256) 


Here, u and v are the surface parameters. 

19.7.3.1 Principle of the B-B Curve Representation 

Suppose there are given n + 1 vertices P; (i = 0,1,...,n) of a three-dimensional polygon with the 
position vectors P,. Introducing the vector-valued function 


O) - Y; B. (E, (19.257) 


i=0 
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a space curve is assigned to these points, which is called the B-B curve. Because of (19.254) for- 
mula (19.257) can be considered as a “variable convex combination” of the given points. The three- 
dimensional curve (19.257) has the following important properties: 


A 


1. The points J Py and P, are interpolated. 

2. Vectors PoP; and P, 4 P, are tangents to r(t) at points Po 
and F,,. 

The relation between a polygon and a B-B curve is shown in 
Fig. 19.14. 

The B-B representation is considered as a design of the curve, 
since it is easy to influence the shape of the curve by changing 
the polygon vertices. 

Often normalized B-splines are used instead of Bernstein poly- 
Figure 19.14 nomials. 


The corresponding space curves are called the B-spline curves. Their shape corresponds basically to 
the B-B curves with the following advantages: 

1. The polygon is better approximated. 

2. The B-spline curve changes only locally if the polygon vertices are changed. 

3. In addition to the local changes of the shape of the curve the differentiability can also be influenced. 
So, it is possible to produce break points and line segments for example. 


19.7.3.2 B-B Surface Representation 


Suppose there are given the points P;; (i = 0,1,...,n; j — 0, 1,..., m) with the position vectors P; 
which can be considered as the nodes of a grid along the parameter curves of a surface. Analogously to 
the B-B curves (19.257), a surface is assigned to the grid points by 
n m 
r(u, v) =X Y Bi, (u) Bj, (v)P 5. (19.258) 
i=0 j=0 
Representation (19.258) is useful for surface design, since by changing the grid points the surface can be 
changed. Anyway, the influence of every grid point is global, so one should change from the Bernstein 
polynomials to the B-splines in (19.258). 


19.8 Using the Computer 
19.8.1 Internal Symbol Representation 


Computers are machines that work with symbols. The interpretation and processing of these symbols is 
determined and controlled by the software. The external symbols, letters, cyphers and special symbols 
are internally represented in binary code by a form of bit sequence. A bit (binary digit) is the smallest 
representable information unit with values 0 and 1. Eight bits form the next unit, the byte. In a byte 
one can distinguish between 2° bit combinations, so 256 symbols can be assigned to them. Such an 
assignment is called a code. There are different codes; one of the most widespread is ASCII (American 
Standard Code for Information Interchange). 


19.8.1.1 Number Systems 


1. Law of Representation 

Numbers are represented in computers in a sequence of consecutive bytes. The basis for the internal 
representation is the binary system, which belongs to the polyadic systems, similarly to the decimal 
system. 
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The law of representation for a polyadic number system is 


a= M, zB' (m>0, n> 0; m,n integer) (19.259) 


i=—m 
with B as basis and z; (0 € z; < B) as a digit of the number system. The positions i > 0 form the 
integers, those with i « 0 the fractional part of the number. 
W The decimal number representation, i.e., B. — 10, of the decimal number 139.8125 has the form 
139.8125 = 1- 10? + 3: 10! +9- 10° - 8-101 + 1-10? - 2:10? +5. 107. 
The number systems occurring most often in computers are shown in Table 19.3. 


Table 19.3 Number systems 


Number system Basis | Corresponding digits 
Binary system 2 0.1 
Octal system 8 0,1,2,3,4,5,6,7 


Hexadecimal system 16 0,1,2,3,4,5,6,7,8,9, A,B,C,D,E,F 
(The letters A-F are for the values 10-15.) 


Decimal system 10 0,1,2,3,4, 5,0, 7,8,9 


2. Conversion 

The transition from one number system to another is called conversion. If different number systems 
are used in the same time, in order to avoid confusion the basis is denoted as an index. 

W The decimal number 139.8125 is in different systems: 139.812519 = 10001011.1101s = 215.64. = 
8B.Dis. 
1. Conversion of Binary Numbers into Octal or Hexadecimal Numbers The conversion of 
binary numbers into octal or hexadecimal numbers is simple. Groups of three or four bits are formed 
starting at the binary point to the left and to the right, and their values are determined. These values 
are the digits of the octal or hexadecimal systems. 

2. Conversion of Decimal Numbers into Binary, Octal or Hexadecimal Numbers For the 
conversion of a decimal numbers into another system, the following rules are applied for the integer and 
for the fractional part separately: 


a) Integer Part: If G is an integer in the decimal system, then for the number system with basis B 
the law of formation (19.259) is: 


G-M zB’! (n>0). (19.260) 
i=0 
If G is divided by B, then an integer part (the sum) is obtained and a residue: 
G n " 20 
== D m. 19.261 
πι πας (19.261) 


Here, zp can have the values 0,1,..., B — 1, and it is the lowest valued digit of the required number. If 
this method is repeated for the quotients, further digits can be got. 
b) Fractional Part: If g is a proper fraction, then the method to convert it into the number system 
with basis B is 

m 


gB=24+> B, (19.262) 
i=2 
i.e., the next digit is obtained as the integer part of the product gB. The values z_2, z_3, ... can be 


obtained in the same way. 
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W A: Conversion of the decimal number W B: Conversion of a decimal fraction 
139 into a binary number. 0.8125 into a binary fraction. 

139:2— 69 residue 1 (1= z) 0.8125.2 = 1.625 (1=z_1) 
69:2=34 residue 1 (1-21) 0.625 .221.25 {155 9) 
34:2=17 residue 0 (0-3) 0.25 -2=0.5 (0 = z_3) 
17:2= 8 residue 1 $ 05 «21.0 (1 = 24) 

8:2= 4 residue 0 0.0 -2=0.0 
ο... 05125 = 010 
1:2= 0 residue 1 (1= 27) 


13010 = 100010115 


3. Conversion of Binary, Octal, and Hexadecimal Numbers into a Decimal Number The 
algorithm for the conversion of a value from the binary, octal, or hexadecimal system into the decimal 
system is the following, where the decimal point is after 20: 


a= y zB' (πι 0, n> 0,integer). (19.263) 
The calculation is convenient with the Horner rule (see 19.1.2.1, p. 952). 
W LLLOL=1- 211: P41 P402 = 29. 


111 0 1 
The corresponding Horner scheme is shown on the right. 2 6 14 98 
19.8.1.2 Internal Number Representation INR 1 3 7 14 |29 


Binary numbers are represented in computers in one or more bytes. Two types of form of representation 
are distinguished, the fixed-point numbers and the floating-point numbers. In the first case, the decima. 
point is at a fixed place, in the second case it is “floating” with the change of the exponent. 


1. Fixed-Point Numbers 4—————— binary number (t bits) ————> 
The range for fixed-point numbers with the 
given parameters is 


0<ļaļ<ž-1. (19.264) 
Fixed-point numbers can be represented in the i 
form of Fig. 19.15. sign v of the fixed-point number 
2. Floating-Point Numbers 
Basically, two different forms are in Figure 19.15 


use for the representation of floating-point numbers, where the internal implementation can vary in 
detail. 


exponent E mantissa M ' g . . 
(p bits) (t bits) 1. Normalized Semilogarithmic Form 
P » |« » n the first form, the signs of the exponent 
E and the mantissa M of the number a are 
stored separately 
a = XMB*F. (19.2652) 
ii ^ Here the exponent F is chosen so that for the 
sign vr sign Vm mantissa 
of the exponent of the mantissa l/BEM«1 (19.265b) 
olds. It is called the normalized semiloga- 


Figure 19.16 rithmic form (Fig. 19.16). 
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The range of the absolute value of the floating-point numbers with the given parameters is: 
zr? < |a| < (1-27) 20, (19.266) 


2. IEEE Standard The second (nowadays used) form of floating-point numbers corresponds to the 
TEEE (Institute of Electrical and Electronics Engineers) standard accepted in 1985. It deals with the 
requirements of computer arithmetic, roundoff behavior, arithmetical operators, conversion of num- 
bers, comparison operators and handling of exceptional cases such as over- and underflow. 


The floating-point number representations are | characteristic C » mantissa M 


shown in Fig. 19.17. 

'The characteristic C comes from the exponent 

E by addition of a suitable constant K. This is 

chosen so that only positive numbers occur in the 

characteristic. The representable number is i . . 
a = (51 35 d bas. ea sign v of the floating point number 


with E-C- K. (19.267) 


Figure 19.17 


Here: Cmin = 1, Cmax = 254, since C = 0and C = 255 are reserved. 

The standard gives two basic forms of representation (single-precision and double-precision floating- 
point numbers), but other representations are also possible. Table 19.4 contains the parameters for 
the basic forms. 


Table 19.4 Parameters for the basic forms 
Parameter Single precision | Double precision 
Word length in bits 32 64 
Maximal exponent Emax +127 +1023 
Minimal exponent Emin —126 —1022 
Constant K +127 +1023 
Number of bits in exponent 8 11 
Number of bits in mantissa 24 53 


19.8.2 Numerical Problems in Calculations with Computers 
19.8.2.1 Introduction, Error Types 


The general properties of calculations with a computer are basically the same as those of calculations 
done by hand, however some of them need special attention, because the accuracy comes from the rep- 
resentation of the numbers, and from the missing judgement with respect to the errors of the computer. 
Furthermore, computers perform many more calculation steps than human can do manually. 

So, there is the problem of how to influence and control the errors, e.g., by choosing the most appropriate 
numerical method among the mathematically equivalent methods. 

In further discussions, the following notation is used, where x denotes the exact value of a quantity, 
which is mostly unknown, and is an approximation value of x: 


" Δα 
Absolute error: |Ax| = |x — i|. (19.308) Relative error: |——| = (19.269) 
x 
The notations 
ε(ᾳ) 2 z— £ and E(x) = - (19.270) 


£ 
are also often used. 
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19.8.2.2 Normalized Decimal Numbers and Round-Off 


1. Normalized Decimal Numbers 
Every real number 2 Æ 0 can be expressed as a decimal number in the form 

x = X0.b,b,...-107 (b, £0). (19.271) 
Here 0, 0,55 . . . is called the mantissa formed with the digits b; € {0,1,2,...,9}. The number E is an 
integer, the so-called exponent with respect to the base 10. Since b, 4 0, (19.271) is called a normalized 
decimal number. 
Since only finitely many digits can be handled by a real computer, one has to restrict himself to a fixed 
number t of mantissa digits and to a fixed range of the exponent E. So, from the number x given in 
(19.271) the number 


+0.b, by ++ - b, - 10” for bı <5 (round-down), 


ε(0.διὀο:::ἐι + 1070108 for bı > 5 (round-up), (19.272) 


Ts] 


is obtained by round-off (as it is usual in practical calculations). The absolute error caused by round-off 
[Δα] = |x — ž| < 0.5 - 10 ^10. (19.273) 

2. Basic Operations and Numerical Calculations 

Every numerical process is a sequence of basic calculation operations. Problems arise especially with 

the finite number of positions in the floating-point representation. Here a short overview is given. It is 

supposed that x and y are normalized error-free floating-point numbers with the same sign and with a 

non-zero value: 


ο y=m,B® with (19.274a 
t s 
m=) a) B, aC) #0, and (19.274b 
k=1 
a= 0orlor... or B-1 for k>1 (i=1,2). (19.274c 


1. Addition If E, > E, then the common exponent becomes £i, since normalization allows us to 
make only a left-shift. The mantissas are then added. 


If B^ €| m, 4 m Bo!) |< 9 (19.275a) and [πη +m,B οι λα, (19.275b 


then shifting the decimal point by one position to the left results in an increase of the exponent by one. 
WI 0.9604 - 10? + 0.5873 - 10? = 0.9604 - 10? + 0.05873 - 10? = 1.01913 - 10? = 0.1019 - 104. 

2. Subtraction The exponents are equalized as in the case of addition, the mantissas are then sub- 
tracted. If 


| m, — m3B- οι Έλι 1 — Bo (19.276a) and |m πως αι Έλις B, (19.276b) 


shifting the decimal point to the right by a maximum off positions results in the corresponding decrease 
of the exponent. 
BI 0.1004. 10? — 0.9988 - 10? = 0.1004- 10? — 0.09988 - 103 = 0.00052 - 107 = 0.5200- 10°. This example 
shows the critical case of subtractive cancellation. Because of the limited number of positions (here 
four), zeros are carried in from the right instead of the correct characters. 
3. Multiplication The exponents are added and the mantissas are multiplied. If 

mim < Bt, (19.277) 
then the decimal point is shifted to the right by one position, and the exponent is decreased by one. 
Wi (0.3176 - 10°) - (0.2504 - 10°) = 0.07952704 - 108 = 0.7953 - 107. 
4. Division The exponents are subtracted and the mantissas are divided. If 

mi 


— muB (19.278) 


msg 
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then the decimal point is shifted to the left by one position, and the exponent is increased by one. 
α (0.3176 - 10%) /(0.2504 - 10°) = 1.2683706 . . . 10^? = 0.1268 - 107. 
5. Error of the Result The error of the result in the four basic operations with terms that are 
supposed to be error-free is a consequence of round-off. For the relative error with number of positions 
t and the base B, the limit is 

B 
$3487. (19.279) 
6. Subtractive cancelation As it was mentioned above, the critical operation is the subtraction 
of nearly equal floating-point numbers. If it is possible, one should avoid this by changing the order of 
operations, or by using certain identities. 

= V1985 — V/1984 = 0.4455 : 10? — 0.4454 107 = 0.1- 107! or z = V1985 — ν1954 = 

1985 — 1984 
= 0.1122 107! 


ν 1085 + V/1984 


19.8.2.3 Accuracy in Numerical Calculations 


1. Types of Errors 
Numerical methods have errors. There are several types of errors, from which the total error of the final 
result is accumulated (Fig. 19.18). 


Total error 


Input error Error of method Round-off error 


Truncation error Discretization error 


Figure 19.18 

2. Input Error 
1. Notion of Input Error Input error is the error of the result caused by inaccurate input data. 
Slight inaccuracies of input data are also called perturbations. The determination of the error of the 
input data is called the direct problem of error calculus. 'The inverse problem is the following: How 
large an error the input data may have such that the final input error does not exceed an acceptable 
tolerance value. The estimation of the input error in rather complex problems is very difficult and is 
usually hardly possible. In general, for a real-valued function y = f(x) with x = (v1, 25,..., £n)" the 
absolute value of the input error is 

|Ay| = |f(zi, 22, ..., 24) — f (ži, $2, ..., 
n Of " n Of 
=I δα, Go 2o ἔπ)(αι = &)| S X (vs ος 091 J IAzil, (19.280) 

i 


ici οτι i=1 


if the Taylor formula (see 7.3.3.3, p. 471) is used for y = f(x) = f (£1, £2,..., αι) with a linear residue. 

£1, £2, ..., En denote the intermediate values, Τι, Ζο,..., 2, denote the approximating values of x1, £2, 
ΕΒ `. The appr oximating values are the perturbed input data. Here, also the Gauss error propagation 

law (see 16.4.2.1, p. 855) is considered. 

2. Input Error of Simple Arithmetic Operations The input error is known for simple arith- 

metical operations. With the notation of (19.268)-(19.270) for the four basic operations: 
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e(a + y) = e(x) + e(y), (19.281) e(vy) = ye(x) + xe(y) + e(x)e(y), (19.282 
£ 1 Hx : : R 
€ ( ) e(x) =é€(y) + terms of higher order ine, (19.283 
y y y 
eur ey) = SDE veal (19284) cir) = ea) ea) + eDesa), 19:285 
Erel (2) = ére (£) — Era (y) + terms of higher order ine. (19.286 
y 


The formulas show: Small relative errors of the input data result in small relative errors of the result 
on multiplication and division. For addition and subtraction, the relative error can be very large i 
jn yl < |x| + |]. 

3. Error of the Method 


1. Notion of the Error of the Method The error of the method comes from the fact that theoret- 
ically continuous phenomena are numerically approximated in many different ways as limits. Hence, 
there are truncation errors in limiting processes (as, e.g., in iteration methods) and discretization er- 
rors in the approximation of continuous phenomena by a finite discrete system (as, e.g., in numerical 
integration). Errors of methods exist independently of the input and round-off errors; consequently, 
they can be investigated only in connection with the applied solution methodology. 

2. Applying Iteration Methods If an iteration method is used, then both cases may occur: A 
correct solution or also a false solution of the problem can be obtained. It is also possible that no 
solution is obtained by an iteration method although there exists one. 

Το make an iteration method clearer and safer, the following advices should be considered: 


a) To avoid “infinite” iterations, count the number of steps and stop the process if this number exceeds 
a previously given value (i.e., stop without reaching the required accuracy). 

b) The location of the intermediate result should be tracked on the screen by a numerical or a graphical 
representation of the intermediate results. 
c) All known properties of the solution should be used such as gradient, monotonicity, etc. 

d) The possibilities of scaling the variables and functions should be investigated. 

e) Several tests should be performed by varying the step size, truncation conditions, initial values, etc. 
4. Round-off Errors 
Round-off errors occur because the intermediate results should be rounded. So, they have an essential 
importance in judging mathematical methods with respect to the required accuracy. They determine 
together with the errors of input and the error of the method, whether a given numerical method is 
strongly stable, weakly stable or unstable. Strong stability, weak stability, or instability occur if the 
total error, at an increasing number of steps, decreases, has the same order, or increases, respectively. 
At the instability one distinguishes between the sensitivity with respect to round-off errors and dis- 
cretization errors (numerical instability) and with respect to the error in the initial data at a theo- 
retically exact calculation (natural instability). A calculation process is appropriate if the numerical 
instability is not greater than the natural instability. 


For the local error propagation of round-off errors, i.e., errors at the transition from a calculation step 
to the next one, the same estimation process can be used as the one applied at the input error. 

5. Examples of Numerical Calculations 

Some of the problems mentioned above are illustrated by numerical examples. 

W A: Roots of a Quadratic Equation: 

ax? + bx +c = 0 with real coefficients a, b, c and D = b? — Λας > 0 (real roots). Critical situations are 
the cases a) | 4ac |« b? and b) 4ac ~ b?. Recommended proceeding: 
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b + sign(b)/D E 
a) m + = ve » t= : (Vieta root theorem, see 1.6.3.1, 3., p. 44). 
a axı 
b) The vanishing of D cannot be avoided by a direct method. Subtractive cancellation occurs but the 
error in (b + sign(bv D) is not too large since |b| > vD holds. 
ΒΒ: Volume of a Thin Conical Shell for ^ < r 
Vd Cth m 


because of (r 4- h) z r there is a case of subtractive cancellation. However in the 


: 3r?h + 3rh? + h? : 
equation V — 4x there is no such problem. 


3 


a 1 
E C: Determining the Sum S = Σ᾽ Pal (S = 1.07667...) with an accuracy of three signif- 


icant digits. Performing the calculations with 8 digits, about 6000 terms should be added. After the 
1 1 1 
k? | 1 k2? 11) 
gom 


1 
S me "m Σ B gs FEY and S » FG) hold. By this transformation only 


eight terms are soa deed. 


identical transformation 


0 2 
B D: Avoiding the 0 Situation in the function z — (1 1 -- 224 y) x y for r — y =0. 
x? y? 


Multiplying the numerator and the denominator by (1 + /1 + x? + y?) one avoids this situation. 
W E: Example for an Unstable Recursive Process. Algorithms with the general form y,41 = 


2 
aYn + Όψη ι (n = 1,2,...) are stable if the condition 5 + 4 +b] < 1 is satisfied. The special 
case Yny1 = —3Yn + Ay, i (n = 1,2,...) is unstable. If yo and y; have errors € and —e, then for 


Y2, Y3, UA: Y5, ον... the errors are Τε, —25e, 103e, —409e, 1639e, .... The process is instable for the pa- 
rameters a — —3 and b — 4. 
ΒΕ: Numerical Integration of a Differential Equation. The numerical solution for the first- 
order ordinary differential equation 

y = f(x,y) with [ία y) = ay (19.287 
and the initial value y(xo) = yo will be represented. 
a) Natural Instability. Together with the exact solution y(x) for the exact initial values y(xo) = yo 
let u(x) be the solution for a perturbed initial value. Without loss of generality, it may be assumed tha 
the perturbed solution has the form 
u(x) = y(x) + εη(α), (19.288a 
where ε is a parameter with 0 < £ < 1 and ņ(x) is the so-called perturbation function. Considering 
that u/(x) = f(v, u) one gets from the Taylor expansion (sce 7.3.3.3, p. 471) 


u(x) = f(x.y(x) ἠ- eu(x)) = f(x,y) +en(z) fy (x. y) + terms of higher order (19.288b 
which implies the so-called differential variation equation 
nz) = fur. yn. (19.288c 


The solution of the problem with f(x,y) = ay is 


n(x) = No e πο) with no = η(αρ). (19.288d 
For a > 0 even a small initial perturbation no results in an unboundedly increasing perturbation n(x). 
So, there is a natural instability. 
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b) Investigation of the Error of the Method in the Trapezoidal Rule. With a = —1, the stable 


differential equation y'(x) = —y(a) has the exact solution 
ylz) = yoe 79), where yo = υ(αο). (19.289a) 
The trapezoidal rule is 
Tit) si 
f y(x)da = eh with h = fiy — vi. (19.289b) 
zı 


By using this formula for the given differential equation 


ΠΕΙ 


Wit Yi " 2—h. 
Uii = δι 4 ] (—y)dx = iji si 5 πμ or μι "EL or 
2—hV 
"m " 19.289c 


is valid. With 2; = zo + ih, i.e., with i = (£x; — xg/h for0 < h < 2 


2+h h 12 80 
(19.2894) 


1 G — i) 

(zi—zo)/h n 2 4 
2—h -- 2-Ε h h 

Ji = ( ) Yo = Foe i70) with c(h) cs 1-5 : 


is obtained. If J = yo, then Ji < y;, and so for h > 0, jj; also tends to the exact solution yoe 779. 


c) Input Error In b) it is supposed that the exact and the approximate initial values coincide. Now, 
the behavior is investigated when yo 7 Jo with | go — yo |< εο. 


i " —-h,. "Pe J= MH 7 
Since (ει — Yi+1) (9; — yi) there is (δει — yii) € ( ) (Uo — νο). (19.290a) 


ra imis M UE 
24h 2+h 

So, £&j44 is at most of the same order as ερ, and the method is stable with respect to the initial values. 

It has to be mentioned that in solving the above differential equation with the Simpson method an 

artificial instability is introduced. In this case, for h — 0, the general solution is obtained as 


δις Cie ^ + OS(-1)ie*5, (19.290b) 
'The problem is that the numerical solution method uses higher-order differences than those to which 
the order of the differential equation corresponds. 


19.8.3 Libraries of Numerical Methods 


Over time, libraries of functions and procedures have been developed independently of each other for 
numerical methods in different programming languages. An enormous amount of computer experimen- 
tation was considered in their development, so in solutions of practical numerical problems one should 
use the programs from one of these program libraries. Programs are available for current operating 
systems like WINDOWS, UNIX and LINUX and mostly for every computation problem type and they 
keep certain conventions, so it is more or less easy to use them. 


The application of methods from program libraries does not relieve the user of the necessity of thinking 
about the expected results. This is a warning that the user should be informed about the advantages 
and also about the disadvantages and weaknesses of the mathematical method he/she is going to use. 


19.8.3.1 NAG Library 


The NAG library (Numerical Algorithms Group) is a rich collection of numerical methods in the form 
of functions and subroutines/procedures in the programming languages FORTRAN 77, FORTRAN 90 
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and C. Here is a contents overview: 


1 
2 
3 
4 
5 
6 
7 
8. 
9. 
10 
11 
12 
13 


Complex arithmetic 

Roots of polynomials 

Roots of transcendental equations 
Series 
ntegration 
Ordinary differential equations 
Partial differential equations 
Numeric differentiation 

ntegral equations 
nterpolation 


Minimum/maximum of a function 
Matrix operations, inversion 


Approximation of curves and surfaces from data 


14. Eigenvalues and eigenvectors 

15. Determinants 

16. Simultaneous linear equations 

17. Orthogonalization 

18. Linear algebra 

19. Simple calculations with statistical data 
20. Correlation and regression analysis 
21. Random number generators 

22. Non-parametric statistics 

23. Time series analysis 

24. Operations research 

25. Special functions 

26. Mathematical and computer constants 


Furthermore the NAG library contains extensive software concerning statistics and financial mathe- 
matics. 


19.8.3.2 IMSL Library 
The IMSL library (International Mathematical and Statistical Library) consists of three synchro- 
nized parts: 
General mathematical methods, 
Statistical problems, 
Special functions. 
The sublibraries contain functions and subroutines in FORTRAN 77, FORTRAN 90 and C. Here is a 
contents overview: 


General Mathematical Methods 


QUod re 


S 


ect 


2) ο DIS 


Linear systems 

Eigenvalues 

Interpolation and approximation 
Integration and differentiation 
Differential equations 


atistical Problems 


Elementary statistics 
Regression 
Correlation 

Variance analysis 


Non-parametric statistics 
Covariance and factor analysis 


Discriminance analysis 
Cluster analysis 


Special Functions 


1. 
2. 


3. 
4. 
5 


Elementary functions 
Trigonometric and hyperbolic 
functions 

Exponential and related functions 
Gamma function and relatives 
Error functions and relatives 


Analysis of time series and forecasting 


ew 


Ce o 


12. 
13. 
14. 
15. 


Categorization and discrete data analysis 


16. 


Test of goodness of fit and test of randomness 17. 


18. 
19. 
20. 


Transformations 

Non-linear equations 
Optimization 

Vector and matrix operations 
Auxiliary functions 


Random sampling 

Life time distributions and reliability 
Multidimensional scaling 

Estimation of reliability function, 
hazard rate and risk function 
Line-printer graphics 

Probability distributions 

Random number generators 
Auxiliary algorithms 

Auxiliary mathematical tools 


Bessel functions 

Kelvin functions 

Bessel functions with fractional orders 
Weierstrass elliptic integrals and 
related functions 

Different functions 
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19.8.3.3 Aachen Library 


The Aachen library is based on the collection of formulas for numerical mathematics of G. Engeln— 
Miillges (Fachhochschule Aachen) and F. Reutter (Rheinisch-Westfálische Technische Hochschule Aa- 
chen). It exists in the programming languages BASIC, QUICKBASIC, FORTRAN 77, FORTRAN 90, C, 
MODULA 2 and TURBO PASCAL. Here is an overview: 


Numerical methods to solve non-linear and special algebraic equations 
Direct and iterative methods to solve systems of linear equations 
Systems of non-linear equations 

Eigenvalues and eigenvectors of matrices 

Linear and non-linear approximation 

Polynomial and rational interpolation, polynomial splines 

Numerical differentiation 

Numerical quadrature 

nitial value problems of ordinary differential equations 

10. Boundary value problems of ordinary differential equations 


Soo EUG Ce Dn Por 


The programs of the Aachen library are especially suitable for the investigation of individual algorithms 
of numerical mathematics. 


19.8.4 Application of Interactive Program Systems 
and Computeralgebra Systems 


19.8.4.1 Matlab 

The commercial program system Matlab Matlab (Matrix Laboratory) is an interactive environment 
for solving mathematically formulated problems and in the same time it is a high level script language 
for scientific technical computations. The set up priorities are the problems and algorithms of linear 
algebra. Matlab unifies the convenient well developed implementations of numerical procedures with 
advanced graphical representation of the results and data. The computations are processed mostly 
with double precision floating point numbers according to IEEE-standards (see Table19.4, p. 1004). 
As further alternatives which are compatible to Matlab are the systems Scilab and Octave with free 
downloads. 

1. Functions Survey 

'There is a short survey of the procedures and functions available in Matlab: 


General Mathematical Functions 


1. Trigonometry 5. Coordinate transformations 

2. Exponential functions, Logarithms 6. Round-of and fractions 

3. Special functions 7. Discrete mathematics 

4. Complex arithmetic 8. Mathematical constants 
Numerical Linear Algebra 

1. Manipulation of fields and matrices 5. Eigenvalues and singular values 

2. Special matrices 6. Matrix factorization 

3. Matrix analyzes (norms, condition) 7. Matrix functions 

4. Systems of linear equations 8. Methods for sparse matrices 
Numerical Methods 

1. Calculation of statistical data 7. Determination of the convex closures 

2. Correlation and regression 8. Numerical integration 

3. Discrete Fourier transformation 9. Ordinary differential equations 

4. Polynomials and splines 10. Partial differential equations 

5. One- and more-dimensional interpolation 11. Non-linear equations 

6. Triangulations and decompositions 12. Minimization of functions 
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In addition there are several program packages of Matlab, the so called toolbooxes, which can be applied 
in the cases of special mathematical classis of problems. As some examples can be mentioned here the 
curve fitting, filtering, business mathematics, time series analysis, signal and pattern processing, neural 
networks, optimization, partial differential equations, splines, statistics and wavelets. 

In the following paragraphs the possibilities of Matlab are demonstrated by simple examples. The same 
problems are partially discussed here as in the paragraphs of numerical applications of Mathematica and 
Maple. 


2. Numerical Linear Algebra 


After starting with Matlab the command prompt >> appears in the command window to indicate the 
readiness to accept commands. If a command is not closed by a semicolon, then the result appears in 
the command window. The basic command to solve a system of linear equations A x — b (see 19.2.1, 
p. 955) is the backslash operator V. 


103 
WB Given matrix A = {211 | and vector b = (-2,3,2)7 . For the input 
123 
>> A=(103;211;123], b=[-2;3;2], x=A\b, norm(A*x—b) 


the output is 


103 -2 1.0000 
A=211 b= 3 x= 20000 ans = 8.8818e — 016 
123 2 —1.0000 


As the Euclidean norm of the residual shows the obtained solution x (for which not all the digits are 
shown) satisfies the system of equations with an accuracy allowed by the mashine floating point repre- 
sentation. 
If the matrix A is quadratic and nonsingular, then the linear system has a unique solution. By the 
backslash operator V ordinary the Gaussian elimination is used with column pivoting, i.e., a triangle 
decomposition PA — LR is obtained (see 19.2.1.1, p. 955). 
W The triangle decomposition of A can be realized also with the input 

>> [L,R, P] = 1u(A) 
giving the output 


1.0000 0 0 2.0000 1.0000 1.0000 01 
L= | 0.5000 1.0000 0 R= 0 1.5000 2.5000 Ρ-|00 
0.5000 —0.3333 1.0000 0 0 3.3333 10 


0 

i 

0 
(Here the matrices are given in brackets in order to avoid confusion). 
The backslash operator first tests the properties of the coefficient matrix A. If A is a permutation 
of a triangle matrix, then the corresponding echelon form is solved. For a symmetric A the Cholesky 
method is applied (see 19.2.1.2, p. 958). 
If the condition number of the coefficient matrix A is too high, then numerical problems can occure 
during the solution. Because of this problem, during the procedure Matlab calculates an estimation of 
the reciprocal value of the condition number, and gives a warning if it is too small. 
Β The Hilbert matrix H = (hig) of order n = 13 can serve as an example, where hip = 1/(i + k — 1). 

>> x = hilb(13)\ones(13, 1); 
Warning: Matrix is close to singular or badly scaled. 

Results may be inaccurate. RCOND = 2.409320e-017. 
In the case of overdetermined linear systems the corresponding linear fitting problem is handled by an 
orthogonalization procedure, i.e. by orthogonal transformations into a QR-decomposition A = QR 
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(see 19.2.1.3, p. 958). 
E > A=([103;211;123;11 -1|; b= [-2;3;2;1]; x=A\b, norm(Axx—b) 
0.4673 


x= 1.4393 ans = 2.0508 
—0.4953 


>> [Q,R] = ar(4) 


—0.3780 —0.4583 0.6466 0.4785 —2.6458 —1.8898 —2.6458 

Q= 0.7559 —0.2750 —0.2321 —0.5469 Ra 0 1.5584 0.6417 
~ | —0.3780 0.8250 0.4145 —0.0684 ΝΙΝ 0 0 3.5480 
—0.3780 0.1833 —0.5969 0.6836 0 0 0 


The backslash operator also gives meaningful results in the cases of underdetermined and rank deficient 
linear systems of equations. The details of these cases with the ways how to handle large sparse matrices 
can be found in the corresponding documentation of Matlab and in the introductions [19.20], [19.29]. 
3. Numerical Solution of Equations 

A polynomial 


n—1l 


p(z) = αμα” + Anu" ++ aim + ao 


is in Matlab represented by the row vector (an, à, 1,...,04, αρ) of the coefficients. Several functions 
are available to handle polynomials. 


Β As an example the polynomial value at 1, the derivative (i.e. the coefficient vector of the derivative 
polynomial) and the roots are determined for the polynomial p(x) = x9 + 3x? — 5. 
>> p-[100030 - 8]; 
>> polyval(p, 1) ans = —1 
>> polyder(p) ans = 600060 
>>  roots(p) ans = 0.8673 + 1.15291, 0.8673 — 1.15291, 1.0743, 
0.8673 + 1.15291, —0.8673 — 1.15291, —1.0743 


The roots are determined as the eigenvalues of the corresponding companion matrix. 
'The command fzero is used to find the approximate solutions of nonlinear scalar equations. 
W Calculation of three solutions of equation e * — 4g? -- 0. 
>>  fzero(Q(x)exp(—x^3) — 4 * x^2, 1) ans — 0.4741 
>>  fzero(Q(x)exp(—x^3) — 4 * x^2, 0) ans — —0.5413 
>>  fzero(G(x)exp(-x^3) — 4 « x^2, — 1) ans — —1.2085 
'The input of the equation for the command fzero is made as an unnamed function. As it is obvi- 
ous, it depends on the given initial value in the second argument, which solution is approximated. A 


combination of the bisection method with regula falsi (see 19.1.1.3, p. 951) is used for the iteration 
process. 


4. Interpolation 

Function fitting based on a given data set can be done either by interpolation (see 19.6.1, p. 982 or 
19.7.1.1, p. 996) or by best approximating functions (see 19.6.2.2, p. 985). In Matlab the command plot 
is the most simple way to represent the data set graphically. The menu in the selfopening graphical 
window contains tools for editing the figure (linetypes, symbols, titles and legends), for exporting and 
printing Basic Fitting under Tools. 
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The Basic Fitting is a subroutine of Tools by which a variety of interpolation methods and best ap- 
proximating polynomials of different degrees are offered. It is realized by the functions interp1 and 
polyfit. 
W By the input 
>> plot(|1.70045, 1.2523, 0.638803, 0.423479, 0.249091, 0.160321, 0.0883432, 0.0570776, 
0.0302744, 0.0212794]); 


the data values are located at the data positions 1, 2, ... , 10 and are graphically represented. Fig.19.19a 
shows the data set, the corresponding cubic interpolation spline and the best approximating polynomial 
of degree four. 
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Figure 19.19 a) and b) 


The Function interp2 offers appropriate methods to interpolate the data given over a two dimensional 
rectangular grid (see 19.7.2.1, p. 998). To interpolate irregularly distributed data is served by calling 
griddata. 
W The command sequence 

>> [X,Y] =meshgrid(—2:1:2); F—4-sqrt(16 — X.^2 — Y.^2); 

>>  [Xe,Ye| = meshgrid(—2:0.1:2); S = interp2(X, Y,F, Xe, Ye, spline’): 

>> surf(Xe, Ye, 8) 

>> holdon; stem3(X,Y,F,; fill’) 


realizes the bivariable cubic spline interpolation of the function f(x,y) = y16 — x? — y? given on a 
grid. The interpolation spline is evaluated on a finer rectangular grid. Fig.19.19b represents the 
interpolation function where the data points are also shown. 

5. Numerical Integration 

Numerical integration is available in Matlab by the procedures quad and quadl. Both procedures are 
based on the recursive application of interpolation quadratures with adaptive step-size selection. quad 
is based on the Simpson formula, and in quad1 the Lobatto formulas of higher order are applied (see 
19.3.2, p. 964). In the case of sufficiently smooth integrand and higher accuracy requirements quadl 
works more effectively than quad. 


1 sing 


dx (Integral sine see 


W As the first example the approximation of the definite integral J = [ 
Jo 


8.2.5,1. p. 513) is considered. 
>> format long; |I, fwerte] = quad(@(x)(sin(x)./x), 0, 1) 
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Warning: Divide by zero. > In @(x) (sin(x)./x) In quad at 62 

I = 0.94608307007653 fwerte = 14 

>> format long; [1, fwerte] = quad1(@(x)(sin(x)./x), 0,1) 
Warning: Divide by zero. > In @(x) (sin(x)./x) In quadl at 64 
I = 0.94608307036718 fwerte = 19 


Both procedures obviously recognize the discontinuity of the integrand at the left endpoint of the in- 
terval, but the approximated value of the integral can be obtained without any difficulty. Based on the 
results of the same example in 19.3.4.2, p. 968 the number of function evaluations seems to be high but 
it is determined for the adaptive recursion. 

>> format long; [1, fwerte] = quad(G(x)(sin(x)./x). O0, 1, 1e — 14) 

I = 0.94608307036718 fwerte = 258 

>> format long; [I,fwerte] = quadl(@(x)(sin(x)./x),0,1,1e— 14) 

I = 0.94608307036718 fwerte = 19 


(The warning messages are not repeated here.) The determination of the accuracy of 10714, which is 
given as a a further argument (the default is a 107% tolerance), makes the advantages of quad1 obvious 
in this case. 


1000 2 


W The definite integral J = ri e * dx is to be determined. 
—1000 


>> format long; |I, fwerte] = quad(G(x)(exp(—x.^2)), — 1000, 1000, 1e — 10) 
I = 1.77245385094233 fwerte = 585 

>> format long; [I,fwerte] = quad1(@(x)(exp(—x.”2)), —1000, 1000, 1e — 10) 
I = 1.77245385090571 fwerte = 768 


The flat shape of the integrand in a very wide part of the integration interval and the relatively steep 
peak at x = 0 make quad better in this case. 

6. Numerical Solution of Differential Equations 

Matlab offers several procedures to determine the numerical solutions of initial value problems of sys- 
tems of first order ordinary differential equations. A standard procedure is ode45, in which Runge- 
Kutta methods of 4-th and 5-th order are applied with adaptive step-size selection (see 19.4.1.2, p. 969). 
To achieve higher accuracy the program ode113 is more effective with implemented linear multi-step 
methods of predictor-corrector type (see 19.4.1.4, p. 971). Besides there are procedures which are es- 
pecially effective for stiff systems of differential equations (see 19.4.1.5,4., p. 973). 


ie , 
E To solve the problem γ' = rid c y?), y(0) -- 0, by the Runge-Kutta method (see 19.4.1.2, p. 970) 


in the interval [0, 1] the input is 
>> [x,y] = ode45(@(x, y) (^ + y.”)./4), [0 1],0); plot(x. y) 
The shape of the resulted solution is represented in Fig.19.20a. 
W To solve the special Lorenz-system (see W in 17.2.4.3, p. 887) = 


xi = l0(ro— x1), αὐ 28x,— Tə — πια, Th = πιο — 375 
in the interval 0 € t < 50 with initial conditions z(0) = (0, 1, 0)7 the following commands are used: 
>> [t,x] = ode45(@(t, χ)([10 * (x(2) — x(1)); 
28 * x(1) — x(2) — x(1) * x(3);x(1) « x(2) — 8 x(3)/3]), [0 50], [0; 1; 0]); 
>> —plot(x(:,1),x(:,3)) 


1016 19. Numerical Analysis 


0.09 50 
0.08 45 
0.07 b 
35 

0.06 
30 

0.05 
25 

0.04 
20 

0.03 
15 
0.02 40 
0.01 5 

ο 

0 0.2 04 0.6 0.8 1 20 


Figure 19.20 a) and b) 
The last command creates a phase-diagram in the ση, 3 plane (see Fig.19.20b). 
19.8.4.2 Mathematica 


1. Tools for the Solution of Numerical Problems 

'The computer algebra system Mathematica offers a very effective tool that can be used to solve a large 
variety of numerical mathematical problems. 'The numerical procedures of Mathematica are totally dif- 
ferent from symbolic calculations. Mathematica determines a table of values of the considered function 
according to certain previously given principles, similarly to the case of graphical representations, and 
it determines the solution using these values. Since the number of points must be finite, this could be 
a problem with * badly " behaving functions. Although Mathematica tries to choose more nodes in 
problematic regions, we have to suppose a certain continuity on the considered domain. This can be 
the cause of errors in the final result. It is advised to use as much information as possible about the 
problem under consideration, and if it is possible, then to perform calculations in symbolic form, even 
if this is possible only for subproblems. 

In Table 19.5, we represent the operations for numerical computations: 


Table 19.5 Numerical operations 


NIntegrate calculates definite integrals 
n 
NSum calculates sums © f (1) 
i 
NProduct calculates products 
NSolve numerically solves algebraic equations 
NDSolve numerically solves differential equations 


After starting Mathematica the * Prompt ” In[1] :— is shown; it indicates that the system is ready to 
except an input. Mathematica denotes the output of the corresponding result by Out [1]. In general: 
The text in the rows denoted by In [n] := is the input. The rows with the sign Out [n] are given back 
by Mathematica as answers. The arrow —> in the expressions means, e.g., replace x by the value a. 


2. Curve Fitting and Interpolation 


1. CurveFitting Mathematica can perform the fitting of chosen functions to a set of data using the 
least squares method (see 6.2.5, p. 454ff.) and the approximation in mean to discrete problems (see 
19.6.2.2, p. 986). The general instruction is: 


Fit[(yi, vo, . ..], funkt, x]. (19.291) 
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Here the values y; form the list of data, funkt is the list of the chosen functions, by which the fitting 
should be performed, and x denotes the corresponding domain of the independent variables. If funkt 
is chosen, e.g., as Table[x^i, {1. 0, n)] , then the fitting will be made by a polynomial of degree n. 
W Let the following list of data be given: 
In[1] := 1 = (1.10045, 1.2523, 0.638803, 0.423479, 0.249091, 0.160321, 0.0883432, 0.0570776, 
0.0302744, 0.0212794} 
With the input 
In[2] := f1 =Fitll, {1,2, x^2, z^3, x4}, x] 
it is supposed that the elements of l are assigned to the values 1, 2,..., 10 of x. The result is the following 
approximation polynomial of degree four: 
Out [2] = 2.48918 — 0.853487x + 0.0998996α7 — 0.003713932? — 0.00002192244 
With the command 
In[3] := Plot|ListPlot[l, {α, 10}], f1, (, 1,10}, AxesOrigin—> {0,0} 


a representation of the data and the approximation curve given in Fig. 19.21a can be obtained. 
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Figure 19.21 


For the given data this is completely satisfactory. The terms are the first four terms of the series ex- 
pansion of e!- 99, 

2. Interpolation Mathematica offers special algorithms for the determination of interpolation func- 
tions. They are represented as so-called interpolating function objects, which are formed similarly 
to pure functions. The directions for using them are in Table 19.6. Instead of the single function 
values y; a list of function values and values of specified derivatives can be given at the given points. 
B With In[3] :— Plot|Interpolation[][r], {α, 1, 10}] one gets Fig. 19.21b. Obviously Mathemat- 
ica gives a precise correspondence to the data list. 


Table 19.6 Commands for interpolation 


Interpolation|lyi.ys....)] gives an approximation function with the values y; for the 
values z; — i as integers 

Interpolation|[[zi,yib (22, yo},...}] gives an approximation function for the point-sequence 
(Xi, yi) 


3. Numerical Solution of Polynomial Equations 
As shown in 20.3.2.1, p. 1038 Mathematica can determine the roots of polynomials numerically. The 
command is NSolve[p[r|] == 0, x, n], where n prescribes the accuracy by which the calculations should 
be done. If n is omitted, then the calculations are made to machine accuracy. The complete solution 
is got, i.e., m roots, if the input polynomial is of degree m. 
E In[1] :— NSolve[r^6 + 9132 — 5 == 0] 

Out [1] = {x—> —1.07432}, {a> —0.867262 — 1.152921), (x—» —0.867262 + 1.152921], 
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{α--» 0.867262 — 1.152921}, {2 —> 0.867262 + 1.152921}, (x— 1.07432}}. 


4. Numerical Integration 

For numerical integration Mathematica offers the procedure NIntegrate. Differently from the symboli- 
cal method, here it works with a table of values of the integrand. Two improper integrals are considered 
(see 8.2.3, p. 506) as examples. 

W A: In[1]:— NIntegrate|Exp|—2x2],{x,—Infinity,Infinity}] Οωί [1] = 1.77245. 

W B: In[2] := NIntegrate[1/27^2, (v, —1,1]] 


: ; . 1 
Power::infy: Infinite expression — encountered. 
NIntegrate::inum: Integrand ComplexInfinity is not numerical at{«} = {0}. 


Mathematica recognizes the discontinuity of the integrand at x = 0 in example B and gives a warning. 
Mathematica applies a table of values with a higher number of nodes in the problematic domain, and it 
recognizes the pole. However, the answer can be still wrong. 

Mathematica applies certain previously specified options for numerical integration, and in some spe- 
cial cases they are not sufficient. The minimal and the maximal number of recursion steps, by which 
Mathematica works in a problematic domain, can be determined with parameters MinRecursion and 
MaxRecursion. The default options are always 0 and 6. If these values are increased, then although 
Mathematica works slower, it gives a better result. 
E In[3] := NIntegrate[Exp[|—27^2], (, —1000, 1000}] Mathematica cannot find the peak at x = 0, 
since the integration domain is too large, and the answers is: 


NIntegrate::ploss: 
Numerical integration stopping due to loss of precision. Achieved neither the requested 
PrecisionGoal nor AccuracyGoal; suspect one of the following: highly oscillatory integrand 
or the true value of the integral is 0. 

Out [3] = 1.34946 . 10-70 


If the requirement is 
In[4] :— NIntegrate[Exp|—x^2], (x, —1000, 1000), MinRecursion—> 3, MaxRecursion—- 10], 
then the result is 
Out [4] = 1.77245 
Similarly, a result closer to the actual value of the integral can be got with the command: 
NIntegrate[fun, {£, £a, 21, 2, . . , Le]. (19.292) 


One can give the points of singularities x; between the lower and upper limit of the integral to force 
Mathematica to evaluate more accurately. 


5. Numerical Solution of Differential Equations 


In the numerical solution of ordinary differential equations and also in the solution of systems of differ- 
ential equations Mathematica represents the result by an InterpolatingFunction. It allows us to get 
the numerical values of the solution at any point of the given interval and also to sketch the graphical 
representation of the solution function. The most often used commands are represented in Table 19.7. 


Table 19.7 Commands for numerical solution of differential equations 


NDSolve[dgl, y, (x, £a, Te}] computes the numerical solution of the differential equation 
in the domain between x, and x, 

InterpolatingFunction[liste]|x] gives the solution at the point x 

Plot|[Evaluate|y[r]/. 10s], (x, £a, £e}] scetches the graphical representation 


19.8 Using the Computer 1019 


W Solution of a differential equation describing the motion of a heavy object in a medium with friction. 
The equations of motion in two dimensions are 


ë=- +y- t, ὕ--α-ηγώ”Ε{θ.υ. 


The friction is supposed to be proportional to the velocity. If g = 10, y = 0.1 are substituted, then the 
following command can be given to solve the equations of motion with initial values α(0) = y(0) = 0 
and «(0) = 100, y(0) = 200: 
In[1] := dg = NDSolve[(z"[t] == —0.1Sqrt[z'[t]^2 + y'[t]^2] x'[t], y"[t] == —10 
—0.1Sqrt[a’ [t]^2 + y'[t]^2] v'[t], α]0] υ[0] 0, x'[0] 100, y'[0] == 200}, 


{x,y}, {t, 15}] 


Mathematica gives the answer by the interpolating function: 
Out [1] = {{x—> InterpolatingFunction|(0.,15.], <>], 


y—> InterpolatingFunction[{0.,15.},<>]}} 


The solution 
In[2] := ParametricPlot({x{t], yt] /.dg, {t,0,2},PlotRange—> A11] 
is represented as a parametric curve (Fig. 19.224). 
NDSolve accepts several options which affect the accuracy of the result. 
The accuracy of the calculations can be given by the command AccuracyGoal. The command 


PrecisionGoal works similarly. During calculations, Mathematica works according to the so-called 
WorkingPre cision, which should be increased by five units in calculations requiring higher accuracy. 


The numbers of steps by which Mathematica works in the considered domain is prescribed as 500. In 
general, Mathematica increases the number of nodes in the neighborhood of the problematic domain. 
Inthe neighborhood of singularities it can exhaust the step limit. In such cases, it is possible to increase 
the number of steps by MaxSteps. It is also possible the prescribe Infinity for MaxSteps. 


Β The equations for the Foucault pendulum are: 
#(t) + wa (t) -209(t), YE) +w y(t) = -203(t). 
With w = 1, Q = 0.025 and the initial conditions z(0) = 0, y(0) = 10, z(0) = y(0) = 0 the solution is: 
In[3] := dg3 = NDSolvel{x" [t] == —a[t] + 0.05y'[t], y"[t] == —ylt] — 0.052" [t] 


x[0] == 0, y[0] == 10, x"[0] == v'[0] == 0}, {x, y}, {t, 0, 403] 
Out [3] = {{x—> InterpolatingFunction|(0.,40.), <>], 


y—> InterpolatingFunction[{0.,40.},<>]}} 
With 

In[4] := ParametricPlot|{x[t], y[t]}/.dg3, (t, 0, 40), AspectRatio— 1] 
one gets Fig. 19.22b. 


19.8.4.3 Maple 


The computer algebra system Maple can solve several problems of numerical mathematics with the use 
of built-in approximation methods. The number of nodes, which is required by the calculations, can 
be determined by specifying the value of the global variable Digits as an arbitrary n. But it should be 
kept in mind that selecting a higher n than the prescribed value results in a lower speed of calculation. 
1. Numerical Calculation of Expressions and Functions 

After starting Maple, the symbol “ Prompt " > is shown, which denotes the readyness for input. Con- 
nected in- and outputs are often represented in one row, separated by the arrow operator — . 
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Figure 19.22 


1. Operator evalf Numerical values of expressions containing built-in and user-defined functions 
which can be evaluated as a real number, can be calculated with the command 

evalf (expr, n). (19.293) 
expr is the expression whose value should be determined; the argument n is optional, it is for evaluation 
to n digits accuracy. Default accuracy is set by the global variable Digits. 
W Prepare a table of values of the function y = f(x) = yz + Inz. 
First, the function is defined by the arrow operator: 

> f:=z > sqrt(z) + ln(z); — f := z > vr + lnr. 
Then the required values of the function can be got with the command evalf(f());, where a numerical 
value should be substituted for x. 
A table of values of the function with steps size 0.2 between 1 and 4 can be obtained by 

> for x from 1 by 0.2 to 4 do print(f [2] = evalf(f(x),12)) od; 
Here, it is required to work with twelve digits. 
Maple gives the result in the form of aone-column table with elements in the form ft45; = 2.95200519181. 
2. Operator evalhf (expr): Beside evalf there is the operator evalhf. It can be used in a similar 
way to evalf. Its argument is also an expression which has a real value. It evaluates the symbolic 
expression numerically, using the hardware floating-point double-precision calculations available on 
the computer. A Maple floating-point value is returned. Using evalhf speeds up your calculations in 
most cases, but you lose the definiable accuracy of using evalf and Digits together. For instance in 
the problem in 19.8.2, p. 1004, it can produce a considerable error. 
2. Numerical Solution of Equations 
By using Maple equations or systems of equations can be solved numerically in many cases. 
The command to do this is fsolve. It has the syntax 

fsolve(eqn, var, option). (19.294) 


This command determines real solutions. If eqn is in polynomial form, the result is all the real roots. 
If eqn is not in polynomial form, it is likely that fsolve will return only one solution. The available 
options are given in Table 19.8. 


Table 19.8 Options for the command fsolve 


complex determines a complex root (or all roots of a polynomial) 

maxsols — n determines at least the n roots (only for polynomial equations) 

fulldigits ensures that fsolve does not lower the number of digits used during computations 
intervall looks for roots in the given interval 
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W A: Determination of all solutions of a polynomial equation x + 33? — 5 = 0. With 
> eq:=2°64342°2-5=0: 

the result is 
> fsolve(eq, x); — —1.074323739, 1.074323739 


Maple determined only the two real roots. With the option complex, also the complex roots are ob- 
tained: 


> fsolve(eq, x, complex); 
—1.074323739, —0.8672620244 — 1.1529220121, —0.8672620244 + 1.1529220121I, 
0.8672620244 — 1.152922012T, 0.8672620244 + 1.1529220121, 1.074323739 

W B: Determination of both solutions of the transcendental equation e? — 4r? -- 0. 
After defining the equation 

> eq := exp(-2^3) —4* 2^2 =0 
the result is 

> fsolve(eq, x); — 0.4740623572 
as the positive solution. With 

> fsolve(eq,x,x = —2..0); — —0.5412548544 
Maple also determines the second (negative) root. 
3. Numerical Integration 
The determination of definite integrals is often possible only numerically. This is the case when the 
integrand is too complicated, or if the primitive function cannot be expressed by elementary functions. 
The command to determine a definite integral in Maple is evalf: 


evalf(int(f(r),x =a..b),n). (19.295) 


Maple calculates the integral by using an approximation formula. 


PME 
> πμ αν : — eee 
W Calculation of the definite integral / e dx. Since the primitive function is not known, for the 
J-2 


integral command the following answer is got 


2 
> int(exp(—2^3),r = —2..2); d e^* ds. 
-2 


If 
> evalf(int(exp(—2"3),x = —2..2), 15); 
is typed,then the answer is 277.745841695583. 
Maple used the built-in approximation method for numerical integration with 15 digits. 
In certain cases this method fails, especially if the integration interval is too large. Then, another 
approximation procedure can be tried with the call to a library 
readlib()evalf/int'): 
which applies an adaptive Newton method. 
W The input 
> evalf(int(exp(—2^2),x = —1000..1000)); 
results in an error message. With 
> readlib( evalf/int'): 
> "evalf/int'(exp(—2^2),z = —1000..1000,10, NCrule); 
1.772453851 


the correct result is obtained. The third argument specifies the accuracy and the last one specifies the 
internal notation of the approximation method. 
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4. Numerical Solution of Differential Equations 
Ordinary differential equations can be solved with the Maple operation dsolve. However, in most cases 
it is not possible to determine the solution in closed form. In these cases, it can be solved numerically, 
where the corresponding initial conditions have to be given. 
In order to do this, the command dsolve is used in the form 

dsolve(deqn, var, numeric) (19.296) 
with the option numeric as a third argument. Here, the argument deqn contains the actual differential 
equation and the initial conditions. The result of this operation is a procedure, and if it is denoted, e.g., 
by f, for using the command f(t), the value of the solution function at the value t of the independent 
variable is returned. 
Maple applies the Runge-Kutta method to get this result (see 19.4.1.2, p. 969). The default accuracy for 
the relative and for the absolute error is 107 P852, The user can modify these default error tolerances 
with the global symbols , RELERR and , ABSERR. If there are some problems during calculations, then 
Maple gives different error messages. 
W At solving the problem given in the Runge-Kutta methods in 19.4.1.2, p. 970, Maple gives: 

> r:— dsolve({diff (y(x), x) = (1/4) * (z^2 + y(x)^2), y(0) = 0}, y(x), numeric); 

r :— proc " dsolve/numeric/result2' (x, 1592392, [1]) end 


With 
> r(0.5); — {x(.5) = 0.5000000000, y(x)(.5) = 0.01041860472} 


we can determine the value of the solution, e.g., at x = 0.5. 


20 Computer AlgebraSystems- Example 
Mathematica 


20.1 Introduction 
20.1.1 Brief Characterization of Computer Algebra Systems 
20.1.1.1 General Purpose of Computer Algebra Systems 


'The development of computers has made possible the introduction of computer algebra systems for 
“doing mathematics". They are software systems able to perform mathematical operations formally. 
These systems, such as Macsyma, Reduce, Derive, Maple, Mathematica, Matlab, Sage, can also be used 
on relatively small computers (PC), and with their help, we can transform complicated expressions, 
calculate derivatives and integrals, solve systems of equations, represent functions of one and of sev- 
eral variables variables graphically, etc. They can manipulate mathematical expressions, i.e., they can 
transform and simplify mathematical expressions according to mathematical rules if this is possible in 
closed form. They also provide a wide range of numerical solutions to required accuracy, and they can 
represent functional dependence between data sets graphically. 


Most computer algebra systems can import and export data. Besides a basic offer of definitions and 
procedures which are activated at every start of the system, most systems provide a large variety of 
libraries and program packages from special fields of mathematics, which can be loaded and activated 
on request (see [20.15],{20.16]). Computer algebra systems allow users to build up their own packages 
(20.11]- [20.14]. 

However, the possibilities of computer algebra systems should not be overestimated. They spare us 
the trouble of boring, time-demanding, and mechanical computations and transformations, but they 
do not save us from thinking. 

For frequent errors see 19.8.2, p. 1004. 


20.1.1.2 Restriction to Mathematica 
The systems are under perpetual developing. Therefore, every concrete representation reflects only a 
momentary state. Here we introduce the basic idea and applications of these systems for the most im- 
portant fields of mathematics. This introduction will help for the first steps in working with computer 
algebra systems. In particular, we discuss Mathematica compatible until Version 10. This system seems 
to be very popular among users, and the basic structures of the other systems are similar. 
In this book, we do not discuss how computer algebra systems are installed on computers. It is assumed 
that the computer algebra system has already been started by a command, and it is ready to commu- 
nicate by command lines or in a Windows-like graphical environment. 
The input and output is always represented for Mathematica (see 19.8.4.2, 1., p. 1016) in rows which 
are distinguished from other text parts, e.g., in the form 

In[1] :— Solve(3 x — 5 == 0, x] (20.1) 
System specific symbols (commands, type notation, etc.) will be represented in typewriter style. 
In order to save space, we often write the input and the output in the same row in this book, and we 
separate them by the symbol —. 


20.1.1.3 Two Introducing Examples of Basic Application Fields 


1. Manipulation of Formulas 

Formula manipulation means here the transformation of mathematical expressions in the widest sense, 
e.g., simplification or transformation into a useful form, representation of the solution of equations or 
systems of equations by algebraic expressions, differentiation of functions or determination of indefinite 
integrals, solution of differential equations, formation of infinite series, etc. 


© Springer-Verlag Berlin Heidelberg 2015 1023 
LN. Bronshtein et al., Handbook of Mathematics, 
DOI 10.1007/978-3-662-46221-8 20 


1024 20. Computer Algebra Systems- Example Mathematica 


W Solution of the following quadratic equation: 


x +axr+b=0 with a,bER. (20.2a 
In Mathematica, one types: 
Solve[z? +a x +b == 0, zx]. (20.2b 


After pressing the corresponding input key/keys (ENTER or SHIFT+RETURN, depending on the 
operation system), Mathematica replaces this row by 


In[1] :=Solve[x? +a a+b —— 0, x] (20.2c 
and starts the evaluation process. In a moment, the answer appears in a new row 


ÜOut[1] = ((x >i a — Va? 4b) }, {x T a+ va? 4b) }}. (20.2d 


Mathematica has solved the equation and both solutions are represented in the form of a list consisting 
of two sublists. 


2. Numerical Calculations 
Computer algebra systems provide many procedures to handle numerical problems of mathematics. 
These are solutions of algebraic equations, linear systems of equations, the solutions of transcendental 
equations, calculation of definite integrals, numerical solutions of differential equations, interpolation 
problems, etc. 
W Problem: Solution of the equation 

αὖ — 23? — 30x* + 362? + 1902? — 36x — 150 = 0. (20.3a) 
Although this equation of degree six cannot be solved in closed form, it has six real roots, which are to 
be determined numerically. 


In Mathematica the input is: 
In[1] := NSolve[x® — 21? — 30" + 362? + 1902? — 36x — 150 == 0, x] (20.3b) 
It results in the answer: 


Üut[1] = ((x— —4.42228}, (x— —2.14285}, {α--» —0.937347), (r— 0.972291}, 
[z— 3.35802}, {α--» 5.17217}} (20.3c) 


This is a list of the six solutions with a certain accuracy which will be discussed later. 


20.2 Important Structure Elements of Mathematica 


Mathematica is a computer algebra system, developed by Wolfram Research Inc. A detailed description 
of Mathematica can be found in [20.11]-[20.16]. For the current version 10 see the Virtual Book in the 
online Help. 


20.2.1 Basic Structure Elements of Mathematica 


In Mathematica the basic structure elements are called expressions. Their syntax is (it is emphasized 
again, that the current objects are given by their corresponding symbol, by their names): 


objy|Obj,, objy,..., obj,,] (20.4) 
obj, is called the head of the expression; the number 0 is assigned to it. The parts obj; (i = 1,...,m) 
are the elements or arguments of the expression, and one can refer to them by their numbers 1,...,n. 


In many cases the head of the expression is an operator or a function, the elements are the operands or 
variables on which the head acts. 

Also the head, as an element of an expression, can be an expression, too. Square brackets are reserved 
in Mathematica for the representation of an expression, and they can be applied only in this relation. 
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E The term 2*2+2%2+1, which can also be entered in this infix form (and also in the nicer, preferred 
form x? + 2x + 1) in Mathematica, has the complete form (FullForm) 


Plus[l, Times[2, x], Power v, 2]] 
which is also an expression. Plus, Power and Times denote the corresponding arithmetical operations. 
'The example shows that all simple mathematical operators exist in prefix form in the internal repre- 
sentation, and the term notation is only a facility in Mathematica. 
Parts of expressions can be separated. This can be done with Part [expr, i], where i is the number of 
the corresponding element. In particular, 7 = 0 gives back the head of the expression. 
E If entering in Mathematica 
In[1] := 2? + 2z + 
then after the SHIFT and ENTER keys together are pressed, Mathematica answers 
Out [1] = 1 -- 21 + i? 
Mathematica analyzed the input and returned it in mathematical standard form. If the input had been 
terminated by a semicolon, then the output would have been suppressed. 
If entering 
In[2] := FullForm|%] 
then the answer is 
Out [2] = Plus[l, Tines[2, x], Power v, 2]] 
The sign 96 in the square brackets tells Mathematica that the argument of this input is the last output. 
From this expression it is possible to get, e.g., the third element 
In[3] := Part|%,3] as Out[3] =x’ 


which is again an expression in this case. 


Symbols in Mathematica are the notation of the basic objects; they can be any sequence of letters and 
numbers but they must not begin with a number. The special sign $ is also allowed. Upper-case and 
lower-case letters are distinguished. Reserved symbols begin either with a capital letter, or with the 
sign $, and in compound words also the second word begins with a capital letter, if it has a separate 
meaning. Users are advised to create their own symbols starting with lower-case letters. 


20.2.2 Types of Numbers in Mathematica 
20.2.2.1 Basic Types of Numbers 


Mathematica knows four types of numbers represented in Table 20.1. 


Table 20.1 Types of numbers in Mathematica 


Type of number | Head Characteristic Input 
Integers Integer | exact integer, arbitrarily long nnnnn 
Rational numbers | Rational | fraction of coprimes in form Integer/Integer ppypp/qqqq 
Real numbers Real floating-point number, arbitrary given precision | nnnn.mmmm 
Complex numbers | Complex | complex number in the form number+number «1 


Real numbers, i.e., floating-point numbers, can be arbitrarily long. If an integer nnn is written in the 
form nnn., then Mathematica considers it as a floating-point number, that is, of type Real. 

The type of a number x can be determined with the command Head{x]. Hence, In[1] :— Head[51 
results in Out [1] = Integer, while In[2] := Head[51.] Out [2] = Real. The real and imaginary 
components of a complex number can belong to any type of numbers. A number such as 5.731 + 0I is 
considered as a Real type by Mathematica, while 5.731 + 0. Lis of type Complex, since 0. is considered 
as a floating-point approximation of 0. 
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There are some further operations, which give information about numbers. So, 

In[3] := NumberQ[51] results in Out [3] = True, (20.5a 
Otherwise, if x is manifestly not a number, as e.g. x = 7, then the output is Out [3] =False. However, 
NumericQ[7] gives True. Here, True and False are the symbols for Boolean constants. 
IntegerQ[r]tests if x is an integer, or not, so 

In[4] := IntegerQ2.] —> Out[4] = False (20.5b 
Similar tests can be performed for numbers with heads EvenQ, OddQ and PrimeQ. Their meanings are 
obvious. So, one gets 

In[5] := PrimeQ[1075643] —+ Out[5] = True (20.5c 
while 

In[6] := PrimeQ[1075641] —+ Out[6] = False (20.5d 
These last tests belong to a group of test operators, called predicates or criteria, which all end by Q and 
always answer True or False in the sense of a logical test (including a type check). 


20.2.2.2 Special Numbers 


In Mathematica, there are some special numbers which are often needed, and they can be called with 
π 


1805 
factor from degree measure into radian measure with the constant Degree, Infinity as the symbol for 
oo and the imaginary unit I. 


as the transformation 


arbitrary accuracy. They include x with the symbol Pi, e with the symbol E, 


20.2.2.3 Representation and Conversion of Numbers 
Numbers can be represented in different forms which can be converted into each other. So, every rea 
number z can be represented by a floating-point number N[z, n] with an n-digit precision. 

IN[1]:— N[E, 20] yields —+ Οωτ[1] = 2.7182818284590452354 (20.6a 
With Rationalize|x, dx], the number x with an accuracy dz can be converted into a rational number, 
i.e., into the fraction of two integers. 

. aT a 1071 

In[2] := Rationalize[%, 10 5] —5 Οωί [2] = 394 (20.6b 
With 0 accuracy, Mathematica gives the possible best approximation of the number x by a rationa 
number. 

Numbers of different number systems can be converted into each other. With BaseForm[r, b], the num- 
ber Ὁ given in the decimal system is converted into the corresponding number in the number system 


with base b < 36. If b > 10, then the consecutive letters of the alphabet a,b, c,... are used for the 

further digits having a meaning greater than ten. 

W A: In[1] := BaseForm|255, 16] — UDut[1]—ffi (20.7a) 
In[2] := BaseForm|N[E, 10], 8] ---- Out [2] = 2.5576052131, (20.7b) 

The reversed transformation can be performed by b^^mmmn. 

E B:In[1]:— 8^^735 --» Out[1] = ATT (20.7) 


Numbers can be represented with arbitrary precision (the default here is the hardware precision), and 
for large numbers so-called scientific form is used, i.e., the form n.mmmm10^ + qq. 


20.2.3 Important Operators 
Several basic operators can be written in infix form (as in the classical form in mathematics) 
< symb; op symb >. However, in every case, the complete form of this simplified notation is the 
expression op[symb;, symb»]. The most often occurring operators and their complete form are col- 
lected in Table 20.2. Most symbols in Table 20.2 are obvious. For multiplication in the form a b, the 
space between the factors is very important. 
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Table 20.2 Important Operators in Mathematica 


a+b Plus|a, b] u-—-v Equal[u, v] 
abora*xb  Times[a, b] wl-v Unequal[w, v] 
a^borea^ Ῥοποτ[α, b] r>t Greater|r, t] 

a/b Times|a, Power|b, —1]] r2t GreaterEqual[r, t] 
u— v Rule|u, v] s<t Less|s, t] 

r=s Set|[r, s] st LessEqual|s, t] 


The expressions with the heads Rule and Set will be explained. Set assigns the value of the expression 
s on the right-hand side, e.g., a number, to the expression r on the left-hand side, e.g., a variable. 
From here on, r is represented by this value until this assignment is changed. The change can be done 
either by a new assignment or by x = . or by Clear[r], i.e., by releasing every assignment so far. The 
construction Rule should be considered as a transformation rule. It occurs together with /. which is 
the substitution operator. 
Replace|t, u — v] or t/. u—» v means that every occurance u in the expression t will be replaced by 
the expression v. 

WB [1]:-x-y /.y> a+b > Dut[1] = (a+b)? +a 
It is typical in the case of both operators that the right-hand side is evaluated immediately after the 
assignment or transformation rule. So, the left-hand side will be replaced by this evaluated right-hand 
side at every later call. 


Here, two further operators have to be mentioned with delayed evaluation. 
The FullForm of u:=wvis SetDelayed[u,v| and (20.8a) 
the FullFormofu: — v is RuleDelayed[u, v] (20.8b) 
The assignment or the transformation rule are also valid here until it is changed. Although the left- 
hand side is always replaced by the right side, the right-hand side is evaluated for the first time only at 
the moment when the left one is called. 
The expression u == v or Equal[u, v] returns True if u and v are identical. Equal is used, e.g., in 
manipulation of equations. 


20.2.4 Lists 
20.2.4.1 Notions 


Lists are important tools in Mathematica for the manipulation of whole groups of quantities, which are 
important in higher-dimensional algebra and analysis. 

A list is a collection of several objects into a new object. In the list, each object is distinguished only 
by its place in the list. The construction of a list is made either by the command 


List[al, a2, a3, ...] or by (a1, a2, a3, ...} (20.9) 
if the elements can be simply enumerated. To explain the work with lists, a particular list is used, 
denoted by /1: 

In[1] := 11 = List[al, a2, a3, a4, a5, a6] —5 Οω{ [1] = (al, a2, a3, a4, a5, a6) (30.10) 
Mathematica applies a short form to the output of the list: It is enclosed in curly braces. 

Table 20.3 represents commands which choose one or more elements from a list, and the output is a 
“sublist”. 
Β For the list /1 in (20.9) one gets, e.g., 

In[2]:— First|l] —> Out[2] — al In[S]:—ll[[] — Out[3] = a3 

In[4] := [1{{2, 4, 6)] —> Οωί [4] = (a2, a4, a6} 

In[5]:— Take|ll, 2] —> Out[5] = (al, a2} 
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Table 20.3 Commands for the choice of list elements 


First|l| Last] selects the first/last element 

Most[/], Rest |I| selects the elements except the last /first one 

Part[l, n] or I([n]] selects the n-th element 

Part|l (nl, n2, ...)] gives a list of the elements with the given numbers 
l[[(n1, n2, ...)]] equivalent to the previous operation 

Take[l, m] gives the list of the first m elements of l 

Take|l, (m, n}] gives the list of the elements from m through n 
Dropll, n] gives the list without the first n elements 

Drop|l, (mn, n}] gives the list without the elements from m through n 


20.2.4.2 Nested Lists 
The elements of lists can also be lists, so nested lists can be obtained. If entering, e.g., for the elements 
of the previous list {1 (20.10) 

In[1]:— al = (b11, 012, 013, 014, 015} 

In[2] := a2 = (021, 022, 023, b24, b25} 

In[3] := a3 = (031, b32, b33, b34, b35} 
and analogously for a4, a5 and a6, then because of (20.10) a nested list (an array) is obtained which 
is not shown here explicitly. One can refer to the j-th element of the i-th sublist with the command 
Part[(l, i, j]. The expression /|[;, ]] has the same result. In the above example (p. 1027), e.g., 

In[4] := l1|[3, 4] yields Out [4] = 634 


Furthermore, Part[l, (i1, i2 ...}, (j1, 72 ...}] or {{{{1, i2, ...}, (j1, 72, ...}]] results in a list con- 
sisting of the elements numbered with j1, j2... from the lists numbered with i1, 72, . . .. 


W For the above example in 20.2.4.1, p. 1027 
In[1] := [1[{8, 5}, (2, 3, 44]] --» Outl1] = {{b32, 033, b34}, (552, 053, 054] ] 
'The idea of nesting lists is obvious from these examples. It is easy to create lists of three or higher 
dimensions, and it is easy to refer to the corresponding elements. 
20.2.4.3 Operations with Lists 


Mathematica provides several further operations by which lists can be monitored, enlarged or shortened 
(Table 20.4). 


'Table 20.4 Operations with lists 
Position|, a] gives a list of the positions where a occurs in the list 
MemberQ|/, a] checks whether a is an element of the list 
Select]|l, crit] picks out all elements of the list for which crit holds 
Cases|l, pattern] gives a list of elements which match the pattern 
FreeQ|I, a] checks if a does not occur in the list 
Prepend|, a] changes the list by adding a to the front 
Append}, a] changes the list by appending a to the end 
Insert|, a, i] inserts a at position ἡ in the list 
Delete|l, {i, j,...}] delete the elements at positions i, j, . . . from the list 
ReplacePart|l, a, i| replace the element at position i by a 


W With Delete, the original list [1 (20.10) can be shortened by the term a6: 
In[1] := 12 = Delete(ll, 6] —> Out[1] = (a1, a2, a3, a4, a5}, 
where in the output the ai are shown by their values — they are lists themselves. 
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20.2.4.4 Tables 


In Mathematica, several operations are available to create lists. One of them, which often occurs in 
working with mathematical functions, is the command Table shown in Table 20.5. 


Table 20.5 Operation Table 


Table[f, {imaz}| creates a list with imax values of f:f(1), f(2),..., f (imax) 
Table|f, (i, imin, imax}] creates a list with values of f from imin to imax 


Table|f, (i, imin, imax, di] the same as the last one, but by steps di 


W Table of binomial coefficients for n = 7: 
In[1] := Table[Binomial[T. i], {i, 0, 7}] —5  Out[1] = (1, 7, 21, 35, 35, 21, 7, 1} 
With Table, also higher-dimensional arrays can be created. With the expression 
Table|[f, (i, i1, i2}, (5, j1, 73}, ...] 
a higher-dimensional, multiple nested table is obtained, i.e., entering 
In[2] := Table[Binomial[i, j], (i, 1, 7}, {7. 0, i} 
the binomial coefficients are got up to degree 7: 
Out [2] = {{1, 1}, (1, 2, 1}, (1, 3, 3, 1}, (1, 4, 6, 4, 1}, 
(1, 5, 10, 10, 5, 1}, (1, 6, 15, 20, 15, 6, 1}, (1, 7, 21, 35, 35, 21, 7, 1}} 
The operation Range produces a list of consecutive numbers or equally spaced numbers: 
Range[n] yields the list {1, 2,..., n] 
Similarly, Range[n1l, n2] and Range[nl, n2, dn] produce lists of numbers (arithmetic sequences) from 


n1 to n2 with step-size 1 or dn respectively. The command Array uses functions (as opposed to function 
values used by Table) to create lists. Array [Exp,5] yields (e, e?, &?, et, εὖ} 


20.2.5 Vectors and Matrices as Lists 
20.2.5.1 Creating Appropriate Lists 


Several special (list) commands are available for defining vectors and matrices. Α one-dimensional list 
of the form 

v = (v1, v2, ..., un} (20.11) 
can always be considered as a vector in n-dimensional space with components v1, v2,..., vn. The spe- 
cial operation Array[v, n] produces the list (the vector) {v[1], v[2],...,v[n]}. Symbolic vector opera- 
tions can be performed with vectors defined in this way. 
The two-dimensional lists {1 (see 20.2.4.2, p. 1028) and {2 (see 20.2.4.3, p. 1028) can be considered as 
matrices with rows i and columns j. In this case bij would be the element of the matrix in the i-th row 
and the j-th column. A rectangular matrix of type (6,5) is defined by /1, and a square matrix of type 
(5,5) by 13. 
With the operation Array|b, (n, m}] a matrix of type (n, m) is generated, whose elements are denoted 
by bli, j]. The rows are numbered by i, i changes from 1 to n; by j the columns are numbered from 1 
to m. In this symbolic form [1 can be created as 


Π = Array[b, (6, 5], (20.12a) where b[i, j] = bij (i—1,...,6; j — L,...,5). (20.125) 


Summarizing, lists can be created either by enumeration or by using the functions Array, Range, 
Table. Note that lists are different from sets in mathematics. 

The operation IdentityMatrix[n] produces the n-dimensional unit matrix. 

With the operation DiagonalMatrix|list] a diagonal matrix is produced with the elements of the list 
in its main diagonal. 
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The operation Dimension|list] gives the size (number of rows, columns, ...) of a matrix, whose struc- 
ture is given by a list. Finally, with the command MatrixForn([list]|, one gets a matrix-type represen- 
tation of the list. A further possibility to define matrices is the following: Let f(i,j) be a function 
of integers i and j. Then, the operation Table[f|i j], (i, n), {j, m}] defines a matrix of type (n, m), 
whose elements are the corresponding f (i, j) 


20.2.5.2 Operations with Matrices and Vectors 


Mathematica allows formal manipulation of matrices and vectors. The operations given in Table 20.6 
can be applied. 


Table 20.6 Operations with matrices 


ca matrix a is multiplied by the scalar c 
a.b the product of matrices a and b 
Det[a] the determinant of matrix a 
Inverse|[a] the inverse of matrix a 

Transpose|a| the transpose of matrix a 
MatrixExpl|a| the exponential function of matrix a 
MatrixPower|a, n| the n-th power of matrix a 
Eigenvalues|a] the eigenvalues of matrix a 
Eigenvectors|a] the eigenvectors of matrix a 


WA: In[1]:— r = Array[a, {4,4}] —> Out[1] = { { afl, 1], afl, 2], a[l, 3], afl, 4] }, 
{ α[2, 1], α[2, 2], a2, 3], a|2, 4] }, 

ἡ af3 1), a3, οἱ αἴ, 3), αἴ. ἡ 1 

{ald 1]. ald, 2], ald, 3] ald ἡ Ἡ 

1η [2] := Transpose[r| — Οωί [2] = { { απ, 1], a[2, 1], a[3, 1], a[4, 1] y, 

{ a[l, 2], a[2, 2], a[3, 2], α[4, 2] }, 

{ all, 3} af, 3} al, 3); ald, 9 F 

fall, 4]. αρ. 4]. alB, 4 ald, 4] Ἡ 


Here, the transpose matrix r! of r is produced. 
Let the general four-dimensional vector v be defined by 
In[3]:— v = Array{u, 4} —> Οωί [3] = {u[l], u[2], u[3], u[4]) 


Now, the product of the matrix r and the vector v is again a vector (see Calculations with Matrices, 
4.1.4, p. 272). 


In[4]:— rv — Out [4] = { afl, 1]u[1] + afl, 2] u[2] + a[1, 3] u[3] + a[1, 4] u[4] , 
a|2, 1|u|1] + α[2, 2] u[2] + a[2, 3| u[3] + a[2, 4] u[4] , 
α|3, 1} u|1] + a[3, 2] u[2] + a[3, 3| u[3] + a[3, 4] u[4] , 
al4, 1| u|1] + αμ. 2] u[2] + a[4, 3| u[3] + a[4, 4] u[4] i, 


There is no difference between row and column vectors in Mathematica. In general, matrix multiplica- 
tion is not commutative (see Calculations with Matrices 4.1.4, p. 272). The expression r. v corresponds 
to the product in linear algebra when a matrix is multiplied by a column vector from the right, while 
v.r means a multiplication by a row vector from the left. 


W D: In the section on Cramer's rule (4.5.2.3, p. 311) the linear system of equations pt = b is solved 
with the matrix 


2 13 
In[1] := MatrixForm|p = {{2, 1, 3}, (1, -2 1}, (3, 2 2}}} — Out[1] = (: -2 1 


3 22 
and vectors 
In[2] := t = Array|x, 3] —5 Ουἱ [2] = {2[1], x[2], 2[3]} 


In[3] := b= {9, —2, 7} > Out[3] = (9, —2, 7}. 
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Since in this case Det [p]==13 4 0 holds, the system can be solved by t = p~'b. This can be done by 
In[4] := Inverse[p.b with the output of the solution vector Out[4] = (—1, 2, 3}. 

Note that a b calculates the componentwise product and Exp[a] gives the matrix containing the exp 

of the components of the matrix a. 

20.2.6 Functions 

20.2.6.1 Standard Functions 

Mathematica knows several standard mathematical functions, which are listed in Table 20.7. 


Table 20.7 A few standard functions 


Exponential function Exp|x] 
Logarithmic functions ^ Logl|x], Log[b,x] 
Trigonometric functions | Sin|x|, Cos|x], Tan[x], Cot[x], Sec[x], Csc[x| 


Arc functions ArcSin|x], ArcCos|x], ArcTan|x], ArcCot[x], ArcSec[x], ArcCsc[x] 
Hyperbolic functions Sinh|x|, Cosh[x], Tanh|x], Coth|x], Sech|x], Csch|x] 
Area functions ArcSinh|x], ArcCosh[x], ArcTanh[x], ArcCoth|[x|, ArcSech|x], ArcCsch[x] 


All these functions can be applied even with complex arguments. 

In every case must be considered the single-valuedness of the functions. For real functions one branch 
of the function has to be chosen (if it is needed); for functions with complex arguments the principal 
value (see 14.5, p. 758) should be chosen. 

20.2.6.2 Special Functions 

Mathematica knows several special functions, which are not standard functions. Table 20.8 lists some 
of these functions. 


Table 20.8 Special functions 


Bessel functions J,,(z) and Y;,(z) Bessel J[n,z|, BesselY[n,z] 
Modified Bessel functions [μ(2) and K,(z) Besse1l1[n,z|], BesselK[n,z| 

Legendre polynomials P, (a) LegendreP|[n.x] 

Spherical harmonic Y/" (0, φ) SphericalHarmonicyY|/,m, 0, ¢] 


Further functions can be loaded with the corresponding special packages of Mathematica 


20.2.6.3 Pure Functions 


Mathematica supports the use of so-called pure functions. A pure function is an anonymous function, 
an operation with no name assigned to it. They are denoted by Function|x, body]. The first argumen: 
specifies the formal parameters and the second one is the body of the function, i.e., body is an expression 
for the function of the variable 2. 


In[1]:— Function|r, i? - z?| — Out[1] = Function|z, 2° + αἱ] (20.13 
and so 
In[2] := Function|z, i? - z?]||] gives Οωἑ [9] = ο) +c’. (20.14 


We can use a simplified version of this command. It has the form body &, where the variable is denoted 
by # . Instead of the previous two rows one can also write 

In[3]:— (# + #)&[q Out[3] 2 C +e’. (20.15 
It is also possible to define pure functions of several variables: 
Function| {2,22,...}, body] or in short form body &, where the variables in body are denoted by the 
elements #1, #2,.... The sign & is very important for closing the expression, since it can be seen from 
this sign that the previous expression should be considered as a pure function. Let us remark that the 


1032 20. Computer Algebra Systems- Example Mathematica 


pure function #& is nothing else than the identity function: to any argument z it assigns x. Similarly, 
#1& corresponds to the projection onto the first coordinate axis. 


20.2.7 Patterns 


Mathematica allows users to define their own functions and to use them in calculations. With the 
command 

In[1] := £[x_] := Polynomial[x] (20.16) 
with Polynomial(x) as an arbitrary polynomial of variable x, a special function is defined by the user. 
In the definition of the function f, there is no simple x, but x: (pronounced x-blank) with a symbol _ 
for the blank. The symbol r means “something with the name 2”. From here on, every time when 
the expression f£ [something] occurs, Mathematica replaces it by its definition given above. This type of 
definition is called a pattern. The symbol blank denotes the basic element of a pattern; y. stands for 
y as a pattern. It is also possible to apply in the corresponding definition only a * ", that is y^ . This 
pattern stands for an arbitrary power of y with any exponent, thus, for an entire class of expressions 
with the same structure. 


The essence of a pattern is that it defines a structure. When Mathematica checks an expression with 


respect to a pattern, it compares the structure of the elements of the expression to the elements o 
the pattern, Mathematica does not check mathematical equality! This is important in the following 
example: Let l be the list 

In[2]:— | — {1, y, y^, y, {4ly/4, 29) (20.17 
If one writes 

In[3]:— |[.y^ — yes (20.18 
then Mathematica returns the list 

Out [3] = {1, y, yes, yes, {flyes], 273) (20.19 


Mathematica checked the elements of the list with respect to its structural identity to its pattern y^ - 
and in every case when it determined coincidence it replaced the corresponding element by yes. The 
elements 1 and y were not replaced, since they have not the given structure, even though y? = 1, y! = y 
holds. 


Remark: Pattern comparison always happens in FullForm. If 

In[4] := b/y [. y^. — yes isexamined then Qut [4] = b yes (20.20) 
This is a consequence of the fact that FullForm of b/y is Times[b, Power[y, —1]], and for structure 
comparison the second argument of Times is identified as the structure of the pattern. 
With the definition 


In[5]:— f|[r] : (20.21a 
Mathematica replaces, corresponding to the given pattern, 

In[6] — f[r|] by Οωε[6] -- τὸ ete. (20.21b 

In[7]:— f|a] + f[rv| yields Out [7] = αὖ + a? (20.21c 
If however, 

In[8]:— f|x] := x°, then for the same input In[9]:— f[a] + f[a] (20.21d 
the output would be 

Out [9] = £[a] + a? (20.21e 


In this case only the (fixed, single) input x corresponds to the definition. 


20.2.8 Functional Operations 


Functions operate on numbers and expressions. Mathematica can also perform operations with func- 
tions, since the names of functions are handled as expressions so they can be manipulated as expressions. 


20.2 Important Structure Elements of Mathematica 1033 


1. Inverse Function, Inverse Series The determination of the inverse function of a given function 
f can be made by the functional operation InverseFunction or InverseSeries. 
ΒΑ: In[1] := InverseFunction|/f]|Y| —9 Out[1] = f~ [zx] 
E B: In[1]:— InverseFunction[Exp --» Out[1] = Log 
E C: In[1] := InverseSeries|Series|(g[z], {x,0, 2}]] 
x—9[0] σ"Ὀ](α — g[0]? : 

Out [1] = E. Nd e D' + Ofe — gioj? 
2. Differentiation Mathematica uses the possibility that the differentiation of functions can be 
considered as a mapping in the space of functions. In Mathematica, the differentiation operator is 
Derivative|l]||f] or in short form f’. If the function f is defined, then its derivative can be got by τ’. 
B ^ In[1]:— f|x] := Sin[r| Cos[r|] With 

In[2]:— f' follows Out [2] =Cos[#1]? — Sin[#1? &, 
hence f’ is represented as a pure function and it evaluates to 

1π[3] :=%[x] —5 Out[3] = Cos|x] — 81π[α]} 
3. Nest The command Nest[f, x, n] means that the function f nested n times into itself should be 
applied on x. The result is £[£[. . . £[2]] . . .]. 
4. NestList By NestList[f, x, n] a list (x, £[x], £[£[v]], ...} will be shown, where finally f is nested 
n times. FoldList|/, x, list] iterates a two-variable function. 
5. FixedPoint For FixedPoint[f, τ], the function is applied repeatedly until the result does not 
change. 
6. FixedPointList The functional operation FixedPointList[f, x] shows the continued list with 
the results after f is applied, until the value no longer changes. 


W As an example for this type of functional operation the NestList operation will be used for the 
approximation of a root of an equation f(x) = 0 with Newton’s method (see 19.1.1.2, p. 950). A root 
of the equation x cos x = sin z is needed in the neighborhood of 37/2: 


In[1]:— f|[r]:— x — Tanje] In[2]:— Επ] — Out[2] — 1 — Sec[z]? 

In[3] = g[r]:- x — £[x]/£'[x] 

In[4] :— NestList|g, 4.6, 4] —> Out[4] = (4.6, 4.54573, 4.50615, 4.49417, 4.49341} 
In[5]:— FixedPoint[g, 4.6] —> Οωί [5] = 4.49341 


A higher precision of the result can also be achieved. 
7. Apply Let f be a function which is considered in connection with a list fa, b, c, ...). Then 
Apply|/ {α, b, c, ...)]] fla, b, ¢,...] (20.22) 
E = Inf1] := hpply[Plus, {u, v, w]] —> Out[1]=u+ v+ w 
In[2] := Apply|List, a+ b+ c) —>+ Out[2] = (a, b, c} 
Here, the general scheme of how Mathematica handles expressions of expressions can be easily recog- 
nized. The FullForm of the last operation is: 
In[3] := FullForm|Apply|List, Plus[a, b, c]]] —+ Οωί [53] = Lista, b, c] 
The functional operation Apply obviously replaces the Head of the considered expression Plus by the 
required List. 
8. Map With a defined function f the operation Map gives: 
Mapl/, {a, b, ο... J) — Ulal fI fid...) (20.23) 
Map generates a list whose elements are the values when f is applied to the original list. 
E Let f be the function f(x) = ο’. It is defined by 
In[1] := f[x_] := x? With this f one gets 
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In[2] :=Map|f, (u, v, w}] —> Out [2] = {u?, v’, w} 
Map can be applied for more general expressions: 
In[3] := Map[f, Plus[a, b, ο] —> Out [8] -- αὖ +P +e 


20.2.9 Programming 


Mathematica can handle the loop constructions known from other languages for procedural program- 
ming. The two basic commands are 


Dolexpr, (i, 11, 12, di}| and (20.24a) 
While[test, expr] (20.24b) 


The first command evaluates the expression expr, where i runs over the values from 71 to 12 in steps di. 
If di is omitted, the step size is one. If i1 is also missing, then it starts from 1. 
The second command evaluates the expression so far as test has the value True. 
Β In order to determine an approximate value of e?, the series expansion of the exponential function 
is used: 
In[1] := sum = 1.0; 
Do[sum = sum + (2^i/i!), {1, 1, 10}]; 
sum 
Out [1] = 7.38899 
The Do loop evaluates its argument a previously given number of times, while the While loop evaluates 
as far as a previously given condition becomes false. 
Among other things, Mathematica provides the possibility of defining and using local variables. This 
can be done by the command 
Module|(t1, ¢2,...}, procedure] (20.26) 
The variables or constants enclosed in the list are locally usable in the module; their values assigned 
here are not valid outside of this module. 


(20.25) 


W A: A procedure is to be defined which calculates the sum of the square roots of the integers from 1 
ton. 
In[1] := sumq[n ] := 
Module[{sum = 1.}, 
Do[sum = sum + N [Sqrt [i|], (1,2, n3]; 
sum |; 


The call sumq[30] results in 112.083. 


The real power of the programming capabilities of Mathematica is, first of all, the use of functional 
methods in programming, which are made possible by the operations Nest, NestWhile, Apply, Map, 
MapThread, Distribute and by some further ones. 


(20.27) 


W B: Example A can be written in a functional manner for the case when an accuracy of ten digits is 
required: 

sumdq|n.] :— N[Apply[Plus, Table[Sqrt [i], (4, 1, n}]], 10], 
sumq[30] results in 112.0828452. Total|,/N{Range[n], 10]] gives the same result without using an index, 


increasing its value continuously, without needing the variable sum and its initial value. 
For the details, see [20.16]. 
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20.2.10 Supplement about Syntax, Information, Messages 
20.2.10.1 Contexts, Attributes 


Mathematica must handle several symbols; among them there are those which are used in further pro- 
gram modules loaded on request. To avoid many-valuedness, the names of symbols in Mathematica 
consist of two parts, the context and the short name. 


Short names mean here the names (see 20.2, p. 1024) of heads and elements of the expressions. In 
addition, in order to name a symbol Mathematica needs the determination of the program part to which 
the symbol belongs. This is given by the context, which holds the name of the corresponding program 
part. The complete name of a symbol consists of the context and the short name, which are connected 
by the’ sign. 

When Mathematica starts, then there are always two contexts present: System’ and Global’. Informa- 
tion about other available program modules can be obtained by the command Contexts|]. 


All built-in functions of Mathematica belong to the context System’, while the functions defined by the 
user belong to the context Global. 


If a context is actual, thus, the corresponding program part is loaded, then the symbols can be referred 
to by their short names. 

For the input of a further Mathematica program module by << NamePackage, the corresponding con- 
text is opened and introduced into the previous list. It can happen that a symbol has already been 
introduced with a certain name before this module is loaded, and in this newly opened context the 
same name occurs with another definition. In this case Mathematica gives a warning to the user. Then 
the previously defined name can be erased by the command Remove|[G1obal'/name], or the complete 
name for the newly loaded symbol can be applied. 

Besides the properties that the symbols have per definition, it is possible to assign to them some other 
general properties, called attributes, like Orderless, i.e., unordered or commutative, Protected, i.e., 
values cannot be changed, or Locked, i.e, attributes cannot be changed, etc. Informations about the 
already existing attributes of the considered object can be obtained by Attributes[/]. 

Some symbols can be protected by Protect[somesymbol]; then no other definition can be introduced 
for this symbol. This attribute can be erased with the command Unprotect. 


20.2.10.2 Information 


Information can be obtained about the fundamental properties of objects by the commands 


?symbol information about the object given by the name symbol, 

??symbol | detailed information about the object, 

?Bx information about all Mathematica objects, whose names begin with Β. 

It is also possible to get information about special operators, e.g., by ? :— about the SetDelay operator. 
However, the most useful possibility is to put the cursor anywhere in the cell containing the symbol of 
the object in question and press the key F1. 


20.2.10.3 Messages 


Mathematica has a message system which can be activated and used for different reasons. The messages 
are generated and shown during the calculations. Their presentation has a uniform form: symbol :: 
tag, providing the possibility to refer to them later. (Such messages can also be created by the user.) 
Consider the following examples as illustrations. 


WB A: In[1]:— f£[z] :— 1/z; In[2] := £[0] 
Power: :infy:Infinite expression Π encountered. 
Out [2] = ComplexInfinity 

WB B: In[1] := Log[3, 16, 25] 


Log: :argt:Log called with 3 arguments; 1 or 2 arguments are expected. 
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Out [1] —Log[3, 16, 25] 

Inexample A, Mathematica warns us that during the evaluation of an expression it got the value oo. The 
calculation itself can be performed. In example B the call of logarithm contains three arguments, which 
is not allowed according to the definition. Calculations cannot be performed. Mathematica cannot do 
anything with the expression. The user can switch off a message with Off[s :: tag]. With On the 
message will appear again. Quiet switches off all the messages. 

With Messages|[syrnbol] all messages associated to the symbol with the name symbol can be recalled. 


20.3 Important Applications with Mathematica 

This section describes how to handle mathematical problems with computer algebra systems. The 
choice of the considered problems is organized according to their frequency in practice and also accord- 
ing to the possibilities of solving them with a computer algebra system. Examples will be given for 
functions, commands, operations and supplementary syntax. When it is important, the corresponding 
special package is also discussed briefly. 


20.3.1 Manipulation of Algebraic Expressions 


In practice, further operations must usually be performed with the occurring algebraic expressions (see 
1.1.5, p. 10) such as differentiation, integration, series representation, limiting or numerical evaluation, 
transformations, etc. In general, these expressions are considered over the ring of integers (see 5.3.7, 
p. 361) or over the field (see 5.3.7.1, 2., p. 361) of real numbers. Computer algebra systems can handle, 
e.g., polynomials also over finite fields or over extension fields (see 5.3.7.1, 3., p. 362) of the rational 
numbers. Interested people should study the special literature. The algebraic operations with polyno- 
mials over the field of rational numbers have special importance. Mathematica provides the functions 


and operations represented in Table 20.9 for transformation of algebraic expressions. See also the 
menu item P. er|Algebraic Manipulation. 
'Table 20.9 Commands for manipulation of algebraic expressions 
Expand|[p] expands the powers and products in a polynomial p by multiplication 
Expand|p, r] multiplies only the parts in p, which contain r 
PowerExpand([a] expands also the powers of products and powers of powers 
Factor |p] factorizes a polynomial completely 
Collect|p, x] orders the polynomial with respect the powers of x 
Collect|p,[z,y,...]] the same as the previous one, with several variables 
ExpandNumerator[r| expands only the numerator of a rational expression 
ExpandDenominator|r] expands only the denominator 
ExpandAl1l[r] expands both numerator and denominator completely 
Together[r) combines the terms in the expression over a common denominator 
Apart|[r] represents the expression in partial fractions 
Cancel|r] cancels the common factors in the fraction 
20.3.1.1 Multiplication of Expressions 
The operation of multiplication of algebraic expressions can always be performed. The coefficients can 


also be undefined expressions. 
B I[1]:— Bro +y—z)‘] gives 


Out [1] = ο) } Εθα” y? 4 4ry+yt—4a% 2-122? yz-l2ry 4y>z 
+6 2? 2? ρω. + 6y2227-4a23—Ayz +z 
Similarly, 
In[3] : = Expand|(a x + by?)(c x? — d y?)] 
Out [3] =aca*—adaxy? +bea y? — bdy! 


20.3 Important Applications with Mathematica 1037 


20.3.1.2 Factorization of Polynomials 
Mathematica performs factorization over the integer or rational numbers if it is possible. Otherwise the 
original expression is returned. 
α In[1]:— p= x9 + Ταῦ + 1204 + 6x? — 25a? — 30x — 25; 

In[2] := Factor|p], gives 

Out [2] = (5 + x) (1+ x + 2?) (-5+2?4+2%) 
Mathematica decomposes the polynomial into three factors which are irreducible over the rational num- 
bers. 


If a polynomial can be completely decomposed over the complex rational numbers, then this can be 
obtained by the option GaussianIntegers. 


Wb 1n[1]:— Factor[i? -2r 45] — Out[1]— b— 2x - 3? , but 
In[2]:— FactorGaussianIntegers—> True) 


Out [2] = ((—1 — 21) + α)({--1 + 21) + 2) 


20.3.1.3 Operations with Polynomials 
Table 20.10 contains a collection of operations by which polynomials can be algebraically manipulated 
over the field of rational numbers. 


Table 20.10 Algebraic polynomial operations 


PolynomialGOCD[pl, p2] determines the greatest common divisor of p1 and p2 
PolynomialLCM[pl, p2] determines the least common multiple of p1 and p2 
PolynomialQuotient|pl,p2,r| ^ divides p1 (as a function of x) by p2, the residue 

is omitted 
PolynomialRemainder[pl,p2,r| determines the residue on dividing p1 by p2 
MonomialList|p] gives the list of all monomials in the polynomial p 


W Two polynomials are defined: 
In[1]:— p= xô + Ta? + 12x + 6a? — 25x? — 30x — 25; 


q = zt + a? — 6x? — Tx — T; 


With these polynomials the following operations are performed: 

In[2] := PolynomialGCD[p.g] —> Out[2] = 1 +2 + x° 

In[3] := PolynomialLCM[p, q]//Factor 

Out [3] = (5 + £)(—7 + £?) (1 + x + 22)(—5 +2? + αὖ) 

In[4] := PolynomialQuotient[p, q, x] > Out [4] = 12 + 6x + 2? 

In[5] := PolynomialRemainder|p, q, x] —— Out [5] = 59 + 96x + 962? + 372? 
With the two last results one gets 

£ê + Ta? + 1204 + 6r? — 25x? — 302 — 25 PA 37a? + 962? + 96a + 59 

xt + r3 — 63? — Tr — T ια gt |- αὖ — 62? — Τα -- Τ᾽ 

20.3.1.4 Partial Fraction Decomposition 


Mathematica can decompose a fraction of two polynomials into partial fractions, of course, over the 
field of rational numbers. The degree of the numerator of any part is always less than the degree of the 
denominator. 


W Using the polynomials p and q from the previous example one gets 
6 55--llr4- 62? 


Tn[1] := πι ese (5-Fz) ' 35 (—5- a? 4 x3) 
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20.3.1.5 Manipulation of Non-Polynomial Expressions 

Complicated expressions, not necessarily polynomials, can often be simplified by the help of the com- 
mand Simplify. Mathematica will always try to manipulate algebraic expressions, independently of 
the nature of the symbolic quantities. Here, certain built-in knowledge is used. Mathematica knows the 
rules of powers (see 1.1.4.1, p. 7): 

In[1] := Simplify[a"/a")] —> Out[1] = a "*" (20.28) 
With the option Trig — True, the commands Expand and Factor can express powers of trigono- 
metric functions by trigonometric functions with multiple arguments, and conversely. Alternatively, 
TrigExpand, TrigFactor, TrigFactorlist, TrigReduce, ExpToTrig, TrigToExp can be applied. 
WB  In[1]:— TrigExpand[Sin [2xlCos [2y1] 


Out [1] = 2Cos [x] Cos [y]?Sin[x] -2Cos [x] Sin [x] Sin [y]? 


In[2]:— Factor[Sin [4x], Trig—» True] — 8Cos [x]?Sin[x] + 4Cos [x] Sin[x]) 
Out [2] = 0 


In[3] := Factor[Cos [5x], Trig—> True 
Out [3] = Cos[x] (1 — 2Cos[2x] + 2Cos[4x]). 
Remark: The command ComplexExpand[expr] assumes a real variable expr, while in the command 
ComplexExpand[ezpr, (x1, x2, . ..)] the variables x are supposed to be complex. 


In[1] :— ComplexExpand[Sin[2 x] ,{x}] 
Üut[1] = Cosh[2Im[x]] Sin[2Re[x]]*I Cos[2Re[x]] Sinh[2 Im[x]] 


20.3.2 Solution of Equations and Systems of Equations 


Computer algebra systems know procedures to solve equations and systems of equations. If the equa- 
tion can be solved explicitly in the domain of algebraic numbers, then the solution will be represented 
with the help of radicals. If it is not possible to give the solution in closed form, then at least numerical 
solutions can be found with a given accuracy. In the following some basic commands will be introduced. 
The solution of systems of linear equations (see 4.5.2, p. 308) is discussed here in a special section (see 


20.3.2.4, p. 1040). 
20.3.2.1 Equations as Logical Expressions 


Mathematica allows the manipulation and solution of equations within a wide range. In Mathematica, 
an equation is considered as a logical expression. If one writes 


In[1]:— g = x) +2x-9==0, (20.294) 
then Mathematica considers it as a definition of a function with Boolean values. Giving the input 
In[2] := %/.x—> 2, yields Out [2] = False, (20.29b) 


since with this value of x the left-hand side and right-hand side are not equal. 
The command Roots[g, x] transforms the above identity into a form which contains x explicitly. Math- 
ematica represents the result with the help of the logical OR in the form of a logical statement: 

In[3] = Roots|g.z|] —> Out[3] = z == —1— v10||r == —1 + v10 (20.299) 
In this sense, logical operations can be performed with equations. 
With the operation ToRules, the last logical type equations can be transformed as follows: 


In[4] := {ToRules[%]} 
Out [4] = ((x > —1 — V10}, (x -> —1 + V10}} (20.29d) 
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20.3.2.2 Solution of Polynomial Equations 
Mathematica provides the command Solve to solve equations. In a certain sense, Solve perform the 
operations Roots and ToRules after each other. 
Mathematica solves polynomial equations in symbolic form up to fourth degree, since for these equations 
solutions can be given in the form of algebraic expressions. However, if equations of higher degree can be 
transformed into a simpler form by algebraic transformations, such as factorization, then Mathematica 
provides symbolic solutions. In these cases, Solve tries to apply the built-in operations Expand and 
Decompose. 
In Mathematica numerical solutions are also available. 
W The general solution of an equation of third degree: 
In[1] := Solve[i? +a 3? +bx+c==0,2] 
Mathematica gives 
Out [1] = ((x— 
21/3 (—a? + 3b) 
1/3 
3(-2a* + 9ab— 27 c-- 33? \/— (a? 13) +40 + 4a? c— 18abe + 21 C?) 
( 2a? -9ab —27c4- 332 y (a? b2) + 40? + Aaà3 c — 18abc4 xe) 
| 32173 } 
E 
'The solution list here shows only the first term explicitly because of the length of their terms. If an 


equation with given coefficients a, b, c has to be solved, then it is better to handle the equation itself 
with Solve than to substitute a, b, c into the solution formula. 


E A: For the cubic equation (see 1.6.2.3, p. 40) x? + 6x + 2 = 0 one gets: 

In[1] := Solve[i? + 6 x -- 2 == 0, x] 

— 1-I/3 1-«IV3 1—IV3 111ν3 

Out [1] = ((z— 2 — 225), (3— 21/3 23/3 hir os 1 917 n 
W B: Solution of an equation of sixth degree: 

In[2] : = Solve|x® — 6a? + 6x1 — 4a? + 652? — 38x — 190 == 0, x] 

Out [2] = {{x—> —-1}, {a> -1- 21}, {a> —1 +21}, (x— 2}, {a> 3}, {x 41} 
Mathematica succeeded in factorizing the equation in B with internal tools; then it is solved without 
difficulty. 


If numerical solutions are required, then the command NSolve can be used. 


? 


W Tho following equation is solved by NSolve: 
In[3] : = NSolve|[x® — 4a? + 6x* — 52? + 3a? — Ay + 2 == 0,2 
Out [3] = {{α-5 —0.379567 — 0.76948 I}, {x—> —0.379567 + 0.76948 I}, 
{x—> 0.641445}, (x— 1. — 1. I}, (z— 1. + 1. I}, (x— 2.11769}} 


20.3.2.3 Solution of Transcendental Equations a 
Mathematica can solve transcendental equations, as well. In general, this is not possible symbolically, 
and these equations often have infinitely many solutions. In these cases, an estimate of the domain 
should be given, where Mathematica has to find the solutions. This is possible with the command 
FindRoot|g, (x, ,)], where xs is the initial value for the search of the root. 
E In[1]:— FindRoot|x + ArcCoth[r| — 4 == 0, (z, 1.1)] 
Üut[1] = {x—> 1.00502} and 
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In[2] : = FindRoot|x + ArcCoth[r|] — 4 == 0, (z,5)] — Out[2] = (x— 3.72478} 


20.3.2.4 Solution of Systems of Equations 
Mathematica can solve simultaneous equations. The operations, built-in for this purpose, are repre- 
sented in Table 20.11, and they present the symbolical solutions, not the numerical ones. 

Similarly to the case of one unknown, the command NSolve gives the numerical solution(s). The solu- 
tion of systems of linear equations is discussed in 20.3.3, p. 1040. 


Table 20.11 Operations to solve systems of equations 


Solve|(l, == rı, l2 == r2,...}, vars] solves the given system of equations with respect to 
vars 

Eliminate|(l, == ri,...), vars] eliminates vars from the system of equations 

Reduce[(l; Ξ---τι,...}, vars] simplifies the system of equations and gives the possi- 
ble solutions 

FindInstancelerpr, vars] finds an instance of vars that make expr true 


20.3.3 Linear Systems of Equations and Eigenvalue Problems 


In 20.2.4, p. 1027, the notion of matrix and several operations with matrices were defined on the basis 
of lists. Mathematica applies these notions in the theory of systems of linear equations. In the following 
command m,n denote given integers, not variables. 


P = Array|p, (m, τι} (20.30) 
defines a matrix of type (m, n) with elements p;; = ρ[[1. j]]. Furthermore 
X = Array(x,{n}] und B = Array|b, {m} (20.31) 


are n- or m-dimensional vectors. With these definitions the general system of linear homogeneous or 
inhomogeneous equations can be written in the form (see 4.5.2, p. 308) 


P.X--B P.X == or Thread[P.X == B] Thread[P.X == 0] (20.32) 
1. Special Case n = m, det P40 


In the special case n = m, detP Æ 0, the system of inhomogeneous equations has a unique solution, 
which can be determined directly by 

X = Inverse[P]. B (20.33) 
Mathematica can handle such systems of up to ca. 5000 unknowns in a reasonable time, depending on 
the computer system. An equivalent, but much faster solution is obtained by LinearSolve[P, B]. 
2. General Case 
With the commands LinearSolve and NullSpace, all the possible cases can be handled as discussed 
in 4.5.2, p. 308, i.e., it can be determined first if any solution exists, and if it does, then it is calculated. 
Now, some of the examples from 4.5.2, p. 308ff. will be discussed. 
W A: The example in 4.5.2.1, 2., p. 310, is a system of homogeneous equations 


πι πο + 5x3 $40 
t+ πο — 233 + 334 = 0 
321 22 + 823 f=) 


z1 + 329 — 9x3 + ται = 0 
which has non-trivial solutions. These solutions are the linear combinations of the basis vectors of the 
null space of matrix p. It is the subspace of the n-dimensional vector space which is mapped into the 
zero by the transformation p. A basis for this space can be generated by the command Nul1Space [y]. 
With the input 
In[1] := p = {{1,-1,5, 1, {1, 1, —2, 3}, {3, -1, 8, 1}, {1, 3, -9, 7}} 
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a matrix, whose determinant is actually zero is defined (check it by calculating Det[p]). Now 
In[2]:— NullSpace[p| and Out [2] = ((—1, —2,0, 1}, {—3,7,2,0}} 

is displayed, a list of two linearly independent vectors of four-dimensional space, which form a basis for 

the two-dimensional null-space of matrix p. An arbitrary linear combination of these vectors is also in 


the null-space, so it is a solution of the system of homogeneous equations. This solution coincides with 
the solution found in 4.5.2.1, 2., p. 310. 


W B: Consider the example A in 4.5.2.1, 2., p. 309, 


πι — 229 + 3x3 21 + 215 
3a πο + 523 — 3x4 
2a, + 21 + 2:4 — 214, — 9235 = 8 


with matrix m1 of type (3, 5), and vector b1 
In[1] := m1 = {{1, —2,3, 1,2), (3, 21,5, 3, —1}, (21,2, 2, -3] 5 
In[2] := bl = {2,6,8}; 


For the command 
In[3] := LinearSolve|ml1, b1] the response is 
LinearSolve : : nosol: Linear equation encountered which has no solution. 
The input appears as output. 
W C: According to example B from 4.5.2.1, 1., p. 309, 
πι — T2 + 227,— 1 
zı — 233 — t= 2 
ὅσι — %2+5%3= 3 
-251 + 229 + 933 = —4 


the input is 

In[1] := m2 = {{1, -1, 2}, (1, -2, -1}, {3, —1, 5}, {-2, 2, 3}}; 

In[2] := b2 = (1,2,3, —4}; 
To learn how many equations have independent left-hand sides, one calls 

In[3] := RowReduce|m2];—+> Out [3] = {{1,0,0}, {0, 1, 0}, {0, 0, 1}, {0,0, O}} 
Then the input is 


1 1 2 
In[4] := LinearSolve[m2,02]; — Out[4] = C. T at 


The answer is the known solution. 


3. Eigenvalues and Eigenvectors 

Eigenvalues and eigenvectors of matrices are defined in 4.6, p. 314. Mathematica provides the possibility 
of determining eigenvalues and eigenvectors by special commands. So, the command Eigenvalues|m| 
produces a list of eigenvalues of a square matrix m, Eigenvectors|m] creates a list of the eigenvectors of 
m, whereas Eigensystem|m| gives both. If N[m] is substituted instead of m, then one gets the numerical 
eigenvalues. In general, if the order of the matrix is greater than four (n > 4), then no algebraic 
expression can be obtained, since the characteristic polynomial has degree higher than four. In this 
case, one should ask for numerical values. 

E In[1]:— h = Table[l/(i + j — 1), (4,5), {5,5} 

This generates a five-dimensional so-called Hilbert matrix. 


1111 ]-1 1.1. 1 TL odo 1111 11111 


Out [1] - {15.9 geht D SUI ENA hills biebeeen e H 


1042 20. Computer Algebra Systems- Example Mathematica 


With the command 

In[2] := Eigenvalues[h] 
the answer (which may be not useful) is 

{Root [-1 + 307505 #1 - 1022881200 #17 + ...} 
But with the command 

In[3] := Eigenvalues [N[h]] one gets 


Out [3] = {1.56705, 0.208534, 0.0114075, 0.000305898, 3.28793 x 105} 


20.3.4 Differential and Integral Calculus 


The notation of the derivative as a functional operator was introduced in 20.2.8, p. 1032. Mathematica 
provides several possibilities to apply the operations of analysis, e.g , determination of the derivative 
of arbitrarily high order, of partial derivatives, of the complete differential. determination of indefinite 
and definite integrals, series expansion of functions, and also solutions of differential equations. 


20.3.4.1 Calculation of Derivatives 


1. Differentiation Operator 
The differentiation operator (see 20.2.8, p. 1032) is Derivative. Its complete form is 


Derivative|[ni,nms,...] 
result as a pure function. 
2. Differentiation of Functions 


With D[ [x], x], the derivative of the function f at the argument x will be determined. 
D belongs to a group of differential operations, which are enumerated in Table 20.12. 


Table 20.12 Operations of differentiation 


(20.34) 


'The arguments say how many times the function is to be differentiated with respect to the current 
variables. In this sense, it is an operator of partial differentiation. Mathematica tries to represent the 


'The differentiation of a given function can be performed in a simplified manner with the operator D. 


D[f [x], (x, n) yields the n-th derivative of function f(x) with respect to x 
Dif, (xi, n1}, (12, πη}; multiple derivatives, n;-th derivative with respect tox; (i = 1,2,-- 
Dt[f] the complete differential of the function f 
d, 
Delf, x] the complete differential τ of the function f 
Dt[f, 21, %2,...] the complete differential ob a function of several variables 


) 


ΒΛ: In[1] :— D[Sqrt[x? Exp[4z] Sin[x]], x] 
Et? z? Cos[x] + 3E!" z?Sin[x] + 4E!" 2? Sin[v] 


2 ΕΛΣ 23 Sin[r] 


MB: [1] :— D((2x - 1)*,z] —  Out[1] = (1 +27)?” ( 


Out [1] = 


6x 
3Log[1*2 ) 
or + 3Log[1+2 x] 
The command Dt results in the complete derivative or complete differential. 
MC: In[1]:— Dt[zi? +y] — Out [1] = 3i? Dt[y] + 3y? Dt[y] 


WB D: In[1]:— Dt[z? 1 y), z| —  Out[1] = 3a? + 3y?Dt[y, x] 
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In this last example, Mathematica supposes y to be a function of x, which is not known, so it writes the 
second part of the derivative in a symbolic way. The preferable forms of writing are: D{x[t?] + y[t]?, t] 
and D[z? + y[x]?, x] showing explicitly the independent variables. 

If Mathematica finds a symbolic function while calculating a derivative, it leaves it in this general form, 
and expresses its derivative by f’. 

ΒΕ: In[1]:—D[r f[z]*,27] — Out [1] = f[x]? + 3a£[x]?£? [v] 

Mathematica knows the rules for differentiation of products and quotients, it knows the chain rule, and 
it can apply these rules formally: 


ΒΕ: In[1]:—D[f[ux], x] — Out[1] = £'[u[z]] v [x] 


uir] ule] v'[x] 


ν να 


E G: In[1]:—D[|u[r]/v(v|, x] — Out[1] = 


20.3.4.2 Indefinite Integrals 


With the command Integrate[f, x], Mathematica tries to determine the indefinite integral / f(a) da. 


If Mathematica knows the integral, it gives it without the integration constant. Mathematica supposes 
that every expression not containing the integration variable does not depend on it. 

In general, Mathematica finds an indefinite integral, if there exists one which can be expressed in closed 
form by elementary functions, such as rational functions, exponential and logarithmic functions, trigono- 
metric and their inverse functions, etc. If Mathematica cannot find the integral, then it returns the orig- 
inal input. Mathematica knows some special functions which are defined by non-elementary integrals, 
such as the elliptic functions, and some others. 


To demonstrate the possibilities of Mathematica, some examples will be shown, which are discussed in 


8.1, p. 4808. 


1. Integration of Rational Functions 
(see also 8.1.3.3, p. 485ff.) 


ΒΑ: In[1] := Integrate|(2x + 3)/(x? + 3? — 2x), x] 


t Log|r 1 
Out [1] = ee 1-4 zx] 3Loglz] -Log|2 + x] 
3 2 6 
ΒΒ: In[1] :— Integrate[(z? + 1)/(v(x — 1)*), v] 
1 
Out [1] = + 2L l1c-z|-L : 20.35 
welt] = ~~ πο ο (20.35) 
On the monitor can be seen in the left corner of the next cell a plus sign. Clicking on it one may choose 
either the free-from input or the Wolfram-Alpha query. If one types the integral into one of these then 


there is given the possibility to have a look at all the details of the process of integration. 


2. Integration of Trigonometric Functions 


(see also 8.1.5, p. 491ff.) 
W A: The example A in 8.1.5.2, p. 492, with the integral / sin? x cos? x dx is calculated (substitution [6 
is done by the program automatically, if needed): 
In[1] : = Integrate[Sin|2]*Cos|z]”, x] 
5 Sin|[z| 1 ] 
64 192 


Out [1] = Sin[3z] 5j Sin[5 z] gigi ο 
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W B: The example B in 8.1.5.2, p. 492, with the integral / ves dx is calculated: 


In[1] := Integrate[Sin|r]/Sqrt|Cos[r].x] — Out[1] = —2y Cos[r] 


Remark: In the case of non-elementary integrals Mathematica may do nothing. 


E 77/1] := fra > Dut[1] = [xe 


20.3.4.3 Definite Integrals and Multiple Integrals 


1. Definite Integrals 

With the command Integrate|/, (x, £a, Πολ], Mathematica can evaluate the definite integral of the 
function f(x) with a lower limit x, and upper limit ze. 

E A: In[1] :— Integrate[Exp[-27], (7,0, Infinity)]] —> Out[1] = ps 
(see Table 21.8, p. 1098, No. 25 for a — 1). 

W B: Ifthe input is 


1 
In[1]:— Integrate[—,, {α, —1,1]] one gets 
x 


1 T 
Out [1] = Integrate::idiv: "Integral of E does not converge on (-1,1]. 


In the calculation of definite integrals one should be careful. If the properties of the integrand are not 
known, then it is recommended to ask for a graphical representation of the function in the considered 
domain before integration. 
2. Multiple Integrals 
Definite double integrals can be called by the command 

Integrate[f[v. y], (x, ta, τε}. (y. Va Ye] (20.36) 
The evaluation is performed from right to left, so, first the integration is evaluated with respect to y. 
The limits ya and ye can be functions of x, which are substituted into the primitive function. Then the 
integral is evaluated with respect to x. 
W For the integral A, which calculates the area between a parabola and a line intersecting it twice, in 
8.4.1.2, p. 524, one gets 55 

In[1] := Integrate|x y^, {x,0, 2}, (y, 2?,2:)]] —  Out[1] = = 


B 
Alsointhis case, it isimportant to be careful with the discontinuities of the integrand. The domain of in- 
tegration can also be specified with inequalities: Integrate [Boole[z?4-y? < 1, (z, 21, 1, (y, 21, 1)1] 
gives 7. 


20.3.4.4 Solution of Differential Equations 


Mathematica can handle ordinary differential equations symbolically if the solution can be given in 
closed form. In this case, Mathematica gives the solution in general. The commands discussed here are 
listed in Table 20.13. 
The solutions (see 9.1, p. 540) are represented as general solutions with the arbitrary constants CTi]. 
Initial values and boundary conditions can be introduced in the part of the list which contains the equa- 
tion or equations. In this case a special solution is returned. As examples, two differential equations 
are solved here from 9.1.1.2, p. 542. 


W A: The solution of the differential equation y'(r) — y(x) tan x = cos x is to be determined. 
In[1] := DSolvely’|x] — ylz] Tan[(v] == Cos[x], y, x] 
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Mathematica solves this equation, and gives the solution as a pure function with the integrations con- 
stant ΟΠ]. 


Table 20.13 Commands to solve differential equations 


DSolve|deq, y[x], x] solves the differential equation for γα] (if it is possible); y[x] 
may be given in implicit form 
DSolve|deq, y, x] gives the solution of the differential equation in the form of a 


pure function 
DSolve|(deqi,deqo,...],y,v| solves a system of ordinary differential equations 


Out [1] = {{ν — Function[x, C[1] Sec[x] + Sec[x] (5 + ον i 


If it is required to get the solution value y[r], then Mathematica gives 
d$ I 

In[2]:— y|x]/. %1 — Outl2] = {Clilsec x] + Sec[x] (5 + petal α))} 
One also could make the substitution for other quantities, e.g., for y'[r] or y[1]. The advantage of using 
pure functions is obvious here. 
Β B: The solution of the differential equation y'(z)z(x — y(x)) + να) = 0 (see 9.1.1.2, 2., p. 542) is 
to be determined. 

In[1] := DSolvely' [x] x(a — y[]) + v[x]^2 == 0, y[x], x] 


E-eul 
Out [1] = (bed —> -x ProductLog[- m |J 


Here ProductLog[z] gives the principal solution for w in z — we". The solution of this differential 
equation was given in implicit form (see 9.1.1.2, 2., p. 542). 

If Mathematica cannot solve a differential equation it returns the input without any comment. In such 
cases, or also, if the symbolic solution is too complicated, the solutions can be found by numerical 
solutions (see 19.8.4.2, 5., p. 1018). Also in the case of symbolic solutions of differential equations, like 
in the evaluation of indefinite integrals, the efficiency of Mathematica should not be overestimated. If 
the result cannot be expressed as an algebraic expression of elementary functions, the only way is to 
find a numerical solution. 
Remark: Mathematica can solve some partial differential equations both symbolically and numerically, 


as well, even on complicated multidimensional domains. 


20.4 Graphics with Mathematica 


By providing routines for graphical representation of mathematical relations such as the graphs of func- 
tions, space curves, and surfaces in three-dimensional space, modern computer algebra systems provide 
extensive possibilities for combining and manipulating formulas, especially in analysis, vector calculus, 
and differential geometry, and they provide immeasurable help in engineering designing. Graphics is a 
special strength of Mathematica. 


20.4.1 Basic Elements of Graphics 


Mathematica builds graphical objects from built-in graphics primitives. These are objects such as points 
(Point), lines (Line) and polygons (Polygon) and properties of these objects such as thickness and 
color. 

Mathematica has several options to specify the environment for graphics and how the graphical objects 
should be represented. 
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With the command Graphics|list], where list is a list of graphics primitives, Mathematica is called 
to generate a graphic from the listed objects. The object list can follow a list of options about the 
appearance of the representation. 


With the input 
In[1] := g = Graphics|{Line[{{0,0}, {5,5}, (10, 3}}], Circle[{5, 5}, 4]. (20.37a) 
|, 


Text |Style[“Example” , “Helvetica” ,Bold,25]. (5, 6}]}, AspectRatio—> Automatic] (20.37b) 
a graphic is built from the following elements: 

a) Broken line of two line segments starting at the point (0,0) through 
the point (5,5) to the point (10, 3). 

example b) Circle with the center at (5,5) and radius 4. 
c) Text with the content "Example", written in Helvetica font, boldface 
(the text appears centered with respect to the reference point (5, 6)). 
With the call Show|g], Mathematica displays the figure (Fig. 20.1). 
Certain options might be previously specified. Here the option 

Figure 20.1 AspectRatio is set to Automatic. 


By default Mathematica makes the ratio of the height to the width of the graph 1 : GoldenRatio (sce 
e.g. 3.5.2.3,3., p. 194). It corresponds to a relation between the extension in the x direction to the one 
in1:1/1.618 = 1: 0.618. With this option the circle would be deformed into an ellipse. The value of 
the option Automatic ensures that the representation is not deformed. 


20.4.2 Graphics Primitives 


Mathematica provides the two-dimensional graphic objects enumerated in Table 20.14. 

Besides these objects Mathematica provides further primitives to control the appearance of the rep- 
resentation, the graphics commands. They specify how graphic objects should be represented. The 
commands are listed in Table 20.15. 

There is a wide scale of colors to choose from but their definitions are not discussed here. 


Table 20.14 Two-dimensional graphic objects 


Point|(z, y} point at position z, y 

Line[{{21, y1}, (12, 2, ...]] broken line through the given points 

Rectangle|(zi,, Ytu}, {£ro, Yro}] shaded rectangle with the given coordinates left-down, right-up 
Polygon|{{x1, Y1}, {σοι yo},...}] ^ shaded polygon with the given vertices 

Circle[[z, y), r] circle with radius r around the center x, y 

Circlel|{x, y},r, (05, 03]] circular are with the given angles as limits 

Circle|(z, y}, {a,b} ellipse with half-axes a and b 

Circle|{x, y}, {a,b}, {a1, a2}] elliptic arc 

Disk|[z, y}, r], Disk|{x, y}, {a,b}] shaded circle or ellipse 

Text |[text, {x, y)] writes text centered to the point x, y 


Table 20.15 Graphics commands 


PointSize|a] a dot is drawn with radius a as a fraction of the total picture 

AbsolutePointSize[b) denotes the absolute radius b of the dot (measured in American 
pt (0.3515 mm)) 

Thickness|a] draws lines with relative thickness a 

AbsoluteThickness[b) draws lines with absolute thickness b (also in pt) 

Dashing|(ai, a2, a3,...}] draws a line as a sequence of stripes with the given length (in 
relative measure) 

AbsoluteDashing|{b), bo,...}] the same as the previous one but in absolute measure 

GrayLevel |p] specifies the level of shade (p = 0 is for black, p = 1 is for white) 
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20.4.3 Graphical Options 
Mathematica provides several graphical options which have an influence on the appearance of the entire 
picture. Table 20.16 gives a selection of the most important commands. For a detailed explanation, 
see [20.16]. 

Table 20.16 Some graphical options 


AspectRatio —» w sets the ratio w of height and width. Automatic determines 
w from the absolute coordinates; the default setting is 
w = 1:GoldenRatio 

Axes —> True draws coordinate axes 

Axes —> False does not draw coordinate axes 

Axes —> {True, False} shows only the z-axis 

Frame —> True shows frames 

GridLines —> Automatic shows grid lines 

AxesLabel —> {Tsymbol, Ysymbol} denotes axes with the given symbols 

Ticks —> Automatic denotes scaling marks automatically; with None they can be 
suppressed 

Ticks > {{αι,αο....},{υι.να....}} scaling marks are placed at the given nodes 


20.4.4 Syntax of Graphical Representation 
20.4.4.1 Building Graphic Objects 


If a graphic object is to be built from primitives, then first a list of the corresponding objects with their 
global definition should be given in the form 

(object, objecta, . . .), (20.384) 
where the objects themselves can be lists of graphic objects. Let object1 be, e.g., 

In[1] := οἱ = {Circle[{5,5}, {5,3}], Line[{{0, 5}, (10,51 }]} 
and corresponding to it 

In[2] := 02 = {Circle[{5, 5}, 3]} 
as in Fig.20.1. If a graphic object, e.g., 02, is to be provided with certain graphical commands, then 
it should be written into one list with the corresponding command 

In[3] := οὗ = {Thickness[0.01], 02} 
This command is valid for all objects in the corresponding braces, and also for nested ones, but not for 
the objects outside of the braces of the list. 
From the generated objects two different graphic lists are defined: 

In[4] := αἱ = Graphics|[{ol, 02}] ; g2 = Graphics[(o1, 03}| 
which differs only in the second object by the thickness of the circle. The call 

Show[gl| and Show[g2, Axes —> True] (20.38b) 
givs the pictures represented in Fig. 20.2. 
In the call of the picture in Fig. 20.2b, the option Axes —> True was activated. This results in the 
representation of the axes with marks on them chosen by Mathematica and with the corresponding 
scaling. 
20.4.4.2 Graphical Representation of Functions 
Mathematica has special commands for the graphical representation of functions. With 

Plot[f [z], {2, Tirini Zmac}] (20.39) 
the function f is represented graphically in the domain between x = r,,;;, and £ = Lmax. Mathematica 
produces a function table by internal algorithms and reproduces the graphics following from this table 
by graphics primitives. 
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a) b) 2 4 6 8 10 


WB OUIO Oo 


Figure 20.2 Figure 20.3 


E If the function x — sin 2r is to be graphically represented in the domain between —27 and 27, then 
the input is 
In[1] := Plot[Sin[2x], (x, —2Pi,2P1]]. 
Mathematica produces the curve shown in Fig. 20.3. 
It is obvious that Mathematica uses certain default graphical options in the representation as men- 
tioned in 20.4.1, p. 1045. So, the axes are automatically drawn, they are scaled and denoted by the 
corresponding 2 and y values. In this example, the influence of the default AspectRatio can be seen. 
The ratio of the total width to the total height is 1 : 0.618. 
With the command InputForm|%] the whole representation of the graphic objects can be shown. For 
the previous example one gets: 
Graphics[{{{}, {}, (DirectivelOpacity[1.], RGBColor[0.368417, 0.506779, 0.709798], 


AbsoluteThickness[1.6]], Line[([-6.283185050723043, 2.5645654335783057* ‘}, 
..., {6.283185050723043, -2.5645654335783057* "| )], {DisplayFunction -> Identity, 


AspectRatio -> GoldenRatio C” , Axes -> {True, True}, AxesLabel -> {None, None}, 
AxesOrigin -> {0, 0}, DisplayFunction :> Identity, 

Frame -> {{False, False}, {False, False}}, FrameLabel -> {{None, None}, 

(None, None}}, FrameTicks -> {{Automatic, Automatic}, [Automatic, Automatic}}, 
GridLines -> {None, None}, GridLinesStyle -> Directive[GrayLevel [0.5, 0.411, 
Method -» {"DefaultBoundaryStyle" -» Automatic, "ScalingFunctions" -> None}, 
PlotRange -> {{-2*Pi, 2*Pi}, {-0.9999996654606427, 0.9999993654113022}}, 
PlotRangeClipping -> True, PlotRangePadding -> {{Scaled[0.02] , Scaled[0.02]}, 
(Scaled[0.05], Scaled[0.05]}}, Ticks -> (Automatic, Automatic}}] 


Consequently, the graphic object consists of a few sublists. The first one contains the graphics primitive 
Line (slightly modified), with which the internal algorithm connects the calculated points of the curve 
by lines. The second sublist contains the options needed by the given graphic. These are the defaul 
options. If the picture is to be altered at certain positions, then the new settings in the Plot command 
must be set after the main input. With 

In[2] := Plot[Sin(2x], (rv, —2Pi,2Pi), AspectRatio— 1] (20.40 
the representation would be done with equal length of axes x and y. 
It is possible to give several options at the same time after each other.With the input 

Plot[{fi [z], f [ο], oe abs ix, Tmin, Lmaz Ἡ (20.41 
several functions are shown in the same graphic. With the command 

Show|plot, options] (20.42 
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an earlier picture can be renewed with other options. With 


Show|GraphicsArray|list]], 


(with list as lists of graphic objects) pictures can be placed next to each other, under each other, or 


they can be arranged in matrix form. 


20.4.5 Two-Dimensional Curves 


A series of curves from the chapter on functions and their representations (see 2.1, p. 48ff.) is shown as 


examples. 


20.4.5.1 Exponential Functions 


A family of curves with several exponential functions (see 2.6.1, p. 72) is generated by Mathematica 


(Fig. 20.4a) with the following input: 


In[1] := fl := 2^w;g|r - 
In[2] = h[x_] := (1/0) λα: j 


These are the definitions of the considered functions. There is no need to define the function e”, since 
second step the following graphics are generated: 


In[3] := pl = Plot{{£[x], h{x]}, (x, —4, 4}, PlotStyle—> Dashing|{0.01, 0.02}]] 


it is built into Mathematica. In the 


In[4] := p2 = Plot[{Exp[:], 


In[5] := p3 = Plot|{g|x], k[x]), (x, —4, 4}, PlotStyle—> Dashing|(0.005, 0.02, 0.01, 0.02}]] 


:= 1042; 


[e] := (1/E)*a; k[x_] := (1/10)^z; 


1], ία, —4, 4j] 


The whole picture (Fig. 20.48) can be obtained by: 


In[6] := Show|{p1, p2, p3}, PlotRange—> (0,18), AspectRatio— 1.2] 
The question of how to write text on the curves is not discussed here. This is possible with the graphics 


primitive Text. 


a) 


20.4.5.2 Function y = cx 


Considering the properties of the function Arcoth x discussed in 2.10, p. 93, the function y = x + 


10 


b) 
Figure 20.4 
+ Arcoth x 


Arcoth x can be graphically represented in the following way: 


In[1]:— {1 = Plot[x + ArcCoth|[z], (1, 1.000000000005, 7} 


In[2] := f2 = Plot|x + ArcCoth{z], {x, — 7, —1.000000000005}] 
In[3] := Show|{ f1, f2},PlotRange—> {--10, 10}, AspectRatio— 1.2, Ticks— 
{{{—6, —6}, {—1, —1), (1, 1), (6, 6}}, (42.5, 2.5}, (10, 10}}}, AxesDrigin— > 0,0] 


The high precision of the x values in the close neighborhood of 1 and —1 was chosen to get sufficiently 
large function values for the required domain of y. The result is shown in Fig. 20.4b. 


1050 20. Computer Algebra Systems- Example Mathematica 


20.4.5.3 Bessel Functions (sce 9.1.2.6, 2., p. 562) 
With the calls 
In[1] := bj0 = Plot|{BesselJ[0, z], Besse1J|2, z], Besse1J|4, z]}], (2,0, 10}, PlotLabel—> 
TraditionalForm[{BesselJ[0, 2], BesselJ[2, z], BesselJ[4, 2] }} (0.444) 
1π [3] := bj1 = Plot|{BesselJ[1, z], Besse1J(3, z], Besse1J[5, z]}, (2, 0, 10}, PlotLabel—> 
TraditionalForm[{BesselJ[1, z], BesselJ[3, z], BesselJ[5, z]}]] (20.445) 
the graphics of the Bessel function J,,(z) for n = 0,2,4 and n = 1,3,5 are generated, which are then 
represented by the call 
In[3] := GraphicsRow|{bj0, bj1}]] 
next to each other in Fig. 20.5. 


1 J(n,z) π-0,24 0.6 [(1,2) n=1,3,5 


b) 
Figure 20.5 


20.4.6 Parametric Representation of Curves 
Mathematica has a special graphics command, with which curves given in parametric form can be graph- 
ically represented. This command is: 

ParametricPlot|(f.(t), fylt) V. (t.t, 09]. (20.45) 
It provides the possibility of showing several curves in one graphic. A list of several curves must be 
given in the command. With the option AspectRatio—> Automatic, Mathematica shows the curves 
in their natural forms. 
The parametric curves in Fig. 20.6 are the Archimedean spiral (see 2.14.1, p. 105) and the logarithmic 
spiral (see 2.14.3, p. 106). They are represented with the input 

In[1] := ParametricPlot|[t Cos[t], t Sin|t]), (t, 0, 3Pi}, AspectRatio— Automatic] 
and 

In[2] :— ParametricPlot|{Exp|0.1¢] Cos|t], Exp|0.1¢] Sin|t]). (t. 0, 3Pi}, 

AspectRatio—> Automatic] 

With 

In[3] := ParametricPlot[{t — 2 Sin|t], 1 — 2 Cos[t]), (t, —Pi, 11Pi], AspectRatio—> 0.3] 
a trochoid (see 2.13.2, p. 102) is generated (Fig. 20.7). 


3) 10 15 35 


Figure 20.7 


Figure 20.6 
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20.4.7 Representation of Surfaces and Space Curves 


Mathematica provides the possibility of representing three-dimensional graphics primitives. 

Similarly to the two-dimensional case, three-dimensional graphics can be generated by applying differ- 
ent options. The objects can be represented and observed from different viewpoints and from different 
perspectives. Also the representation of curved surfaces in three-dimensional space, i.e., the graphical 
representation of functions of two variables, is possible. Furthermore it is possible to represent curves 
in three-dimensional space, e.g., if they are given in parametric form. For a detailed description of 
three-dimensional graphics primitives see [20.5], [20.16]. The introduction of these representations is 
similar to the two-dimensional case. 


20.4.7.1 Graphical Representation of Surfaces 


The command P10t3D in its basic form requires the definition of a function of two variables and the 
domain of these two variables: 


Plot3D[f(r, y], (x, a, τε}, (y, Yas v] (20.46) 
ΑΙ options have the default setting. 
E For the function z = 2? + y?, with the input 

In[1] :— Plot3D[x? + y?, (1, —5, 5}, (y, —5, 5}, PlotRange—> {0,25} 
we get Fig. 20.8a, while Fig. 20.8b is generated by the command 

In[2] := Plot3D[(1 — Sin[r]) (2 — Cos[2 y]), (x. —2, 2}, (y. —2, 2}] 
For the paraboloid, the option PlotRange is given with the required z values, because the solid is cut 
at z = 25. 
20.4.7.2 Options for 3D Graphics 


The number of options for 3D graphics is large. In Table 20.17, only a few are enumerated, where 
options known from 2D graphics are not included. They can be applied in a similar sense. The option 
ViewPoint has special importance, by which very different observational perspectives can be chosen. 


Figure 20.8 


20.4.7.3 Three-Dimensional Objects in Parametric Representation e 
Similarly to 2D graphics, three-dimensional objects given in parametric representation can also be 
represented. With 

ParametricPlot3D[(/.[t. ul, fylt, ul, falt, ul}, (6 ta, te}, {u, ua, Ue] (20.47) 


a parametrically given surface is represented, with 
ParametricPlot3D[{ falt], f, [t]. f. |t]. (4. ta, te}] (20.48) 
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a three-dimensional curve is generated parametrically. 


Table 20.17 Options for 3D graphics 


Boxed default setting is True; it draws a three-dimensional frame around the surface 

HiddenSurface sets the non-transparency of the surface; default setting is True 

ViewPoint specifies the point (x, y, 2) in space, from where the surface is observed. De- 
fault values are (1.3, —2.4, 2} 

Shading default setting is True; the surface is shaded; False yields white surfaces 

PlotRange {2a, Ze}, {{2α, te, (Ya; Ye}, (2a; Ze}} can be chosen for the values All. De- 
fault is Automatic 


Figure 20.9 


E The objects in Fig. 20.9a and Fig. 20.9b are represented with the commands 
In[3] := ParametricPlot3D[{Cos|t] Cos[u], Sin|t] Cos[u], Sin[u]], (t, 0, 2Pi} 
(u, —Pi/2,Pi/2}| (20.49a) 
In[4] := ParametricPlot3D[{Cos|t], Sin{t],t/4}, {t,0, 20)] (20.49b) 


Mathematica provides further commands by which density, and contour diagrams, bar charts and sector 
diagrams, and also a combination of different types of diagrams, can be generated. 


W The representation of the Lorenz attractor (see 17.2.4.3, p. 887) can easily be generated by Mathe- 
matica. 

There is a series of recent developments most of which are not to be shown in a book. One can eas- 
ily build a GUI (graphical user interface) to utilize interactive properties of the program. Most of 
the calculations are parallelized automatically, but functions such as Parallelize and ParallelMap 
provides the user to create his/her own parallel programs. The extremely fast graphic cards can be 
programmed at a very high level (as opposed to other languages) using such functions as CUDALink, 
OpenCLFunctionLoad etc. An extremely useful example of dynamic interactivity tool is Manipulate 
which in the simplest case shows you the parameter dependence of a family of curves. Working in the 
cloud or using the computer Raspberry Pi (which comes a free Mathematica license) should also not be 
unmentioned. 


21 Tables 
21.1 Frequently Used Mathematical Constants 


π 3,141592654... | Ludolf constant (7) 1°/, | 0,01 percent 
6 2,718281828... | Euler constant (6) 1°/,, | 0,001 per mil 
Q 0,577215665... | Euler constant (C) /2 1,414 2136. .. 
lge = M | 0,434294482... | n10 = M-! = 2,302585093... | V3 1,7320508... 
152 0,301 030. . . In2 = 0,693 1472... V10 | 3,1622777... 


21.2 Important Natural Constants 

This table contains values of constants, recommended in [21.19], [21.20], [21.21]. In parenthesis is given 
the standard uncertainty of the last two digits. The note (fixed) indicates that this value is fixed by 
definition. 


Fundamental constants 
Avogadro constant Na = 6,022 141 29(27)-1073/mol 
velocity of light in vacuum co — 299792 458m/Ss (fixed) 
gravitation constant G - 6,673 84 (80)-10-!!m3/(kgs*) 
fundamental electric charge e = 1,602 176 565(35)-107!9 C 
fine structure constant a = pocoe?/ (2h) = 7,297 352 5698(24)-10-3 
Sommerfeld constant a! = 137,035 999 074(44) 
Planck constant h — — 6,626 069 57(29)-10-4Js = 4,135667 516(91) -10-eVs 
Planck quantum h/(27) h = 1,054571 Τ26(4Τ):10-31]5 

= 6,582 11928(15)-10- !6eVs 
Electromagnetic constants 
spec. fund. electric charge = —e/m, = —1,758820088(39)-10H C kg! 
permeability of free space Ho — — 47-1077 N/A? = 12, 566 370 614- 1077 Vs/Am (fixed) 
permittivity of vacuum Eo Ξ l/(uoc2) = 8,854187 817-1071? As/Vm (fixed) 
quantum of magnetic flux dy  -—h/(2e) = 2,067 833 758(46) 107? Wb 
Josephson constant Ky = 2e/h = 483597,870(11)-10? Hz/V 
ν. Klitzing constant Rx = h/e? = 25 812,807 4434(84) Q 
quantum of conductance Go =2e/h = 7,748 091 7346(25)-10 ^ S 
character. impedance (vacuum) Zo = 376,730313461 Q (fixed) 
Faraday constant F =eNy, = 96 485,3365(21) As/mol 
Constants in physical chemistry, thermodynamics, mechanics 
Boltzmann constant k = Ro/Na = 1,380 6488(13) 10 77J/K 

= 8,617 3324(78) -10-°eV/K 

universal gas constant, molar Ro = Nak =8,3144621(75) J/(mol K) 
molar volume of inert gas Ving = BoTo/po = 22,710953(21) 107? m?/mol 
(Τὺ —273,15 K, po =100 kPa 
molar volume of inert gas Vm = BoTo/ po = 22,413 968(20) 107? më /mol 
(Ty —273,15 K, pı =101,325 kPa 
Loschmidt constant (To, po) noo = Na/Ving = 2,651 6462/24) -10% /m? 
Loschmidt constant (To, pı) nog = Na/Va, = 2,686 7805(24) -10% /m? 
standard acceleration of gravity (earth, gan = 9,80665 ms~ (fixed) 
45° geographic latitude, sea level) 
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Atomic electron shell and atomic nucleus 


atomic mass unit u Mu = (10 ?kg/mol)/N4 = ima (?C) 

= 1,660 538 921(73)-10-2" kg 
quantum of circulation (electron) s = h/(2me) = 3,636 947 5520(24)107* m?/s 
Bohr radius αρ = ħ? /(Eo(e)e?) = re/a? = 0,529 177 210 92(17) «10719 m 
classical electron radius Te = a7 a9 = 2,817 9403267(27) 107^ m 
Thomson cross-section To = 8rr?/3 = 0,665 245 8734(13) 1075 m? 
Bohr magneton m = eħ/(2me) = 927,400 968 (20)-10?6 J/T 

= 5,788 381 8066(38)-10~eV /T 
nuclear magneton Hk = eh/(2mp) = 5,050 783 53(11)-10-7" J/T 

= 3,152 451 2605(22)-10-8 eV/T 
nuclear radius R = roA!/3; rg = (1,2...1,4) fm; 1 € A < 250: 


9fm2 R> ro 
rest energy 


atomic mass unit Eo(u) = 931,494 061(21) MeV 
electron Eo(e) = 0,510998 928(11) MeV 
proton Eo(p) = 938,272 046(21) MeV 
neutron Eo(n) = 939,565 379(21) MeV 
rest mass 
electron Me = 9,109 382 91 (40) «10751 kg = 5,485 799 0946(22) -1074 u 
proton Mp = 1,672621 71(29)-107” kg = 1 836,152 672 61 (85) me 
= 1.007 276 466 812(90) u 
neutron Mn = 1,674927 351(74) -107° kg = 1 838,683 659 8(13) me 


= 1,008 664915 60(55) u 


magnetic moment 


electron με = —1,001 159 652 1859(41) μμ 

= —928,476 412(80) 10-25 J/T 
proton Hy = +2,792847 356(23) jn, = 1,410 606 71(12) 102 J/T 
neutron μα = —1,913 042 72(45) ju, = 0,966 236 4τ(23):10-5 J/T 


21.3 Metric Prefixes 


Prefix Factor Abbrevation | Prefix Factor Abbrevation 
Yocto 105 y Deka 10! da 
Zepto 107?! z Hekto 103 h 
Atto 10715 a Kilo 10? k 
Femto 10-19 f Mega 106 Μ 
Pico 10-12 p Giga 10° G 
Nano 107? n Tera 02 T 
Mikro 1078 n Peta (t P 
Milli 107? m Exa 018 E 
Zenti 107? ς Zetta 021 Z 
Dezi 107! d Yotta 02 Y 

B 10° = 1000. M 107? = 0,001. B 10°m = 1 km Bl 1m — 109m. Bl 1nm = 107? m. 


Remark: The metric system is built up by adding prefixes which are the same for every kind of mea- 
sure. These prefixes should be used in steps of powers with base 10 and exponent +3: milli-, micro-, 
nano-; ruther than in the smaller steps hecto-, deca-, deci-. The British system, unlike the metric one, 
is not built up in 10's e. g.: 11b = 16 oz = 7000 grains. 
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21.4 International System of Physical Units (SI Units) 


Further information about physical units see [21.8], [21.4], [21.15]. 


SI Base Units 


length m meter 
time 8 second 
mass kg kilogram 
thermodynamic temperature | K kelvin 
electric current A ampere 
amount of substance mol | (1 mol =N; particles, NA = AVOGADRO-constant) 
luminous intensity cd candela 
Additional SI Units 
plain angle rad | radian a=lI/r,1lrad=1m/1m 
solid angle sr steradian Q = $/r?, 1sr = 1m?/1 m? 
Examples of SI derived units with special names and symbols 
frequency Hz Hertz 1 Hertz = 1/s 
force N Newton 1N = 1kg m/s? 
pressure, tension Pa Pascal 1Pa = 1 N/m? = 1kg/(ms) 
energy, work, J Joule 1J-iNm-lkgm?/s? 
quantity of heat kWh | kilowatt hour | 1kWh = 3, 6-10° J 
power W Watt 1W=1Nm/s=1J/s = 1kgm’/s? 
electric charge C Coulomb 1C=1As 
electric voltage γ Volt 1V =1W/A = 1kgm?/(A sè) 
electric capacitance F Farad 1F = 1 C/V = 14A?8?/J — 1 A?s!/(kg m?) 
electric resistance Q Ohm 1Ω = 1 V/A = Lkgm?/(A? s?) 
electric conductance S Siemens 1821/0 =1A?s?/(kgm?) 
magnetic flux Wb | Weber 1Wb = 1Vs=1kgm?/(As?) 
magnetic flux density T Tesla 1T = 1Wb/m? = 1 kg/(A 2?) 
inductance H Henry 1H =1Wb/A = Hkgm?/(A? s) 
luminous flux lm Lumen 1 lm = 1 cd sr 
illuminance Ix Lux 1 lx = 1 cd sr/m? 
Further derived SI units without special names 
speed, velocity m/s acceleration m/s? 
angular velocity rad/s angular acceleration rad/s? 
momentum kgm/s | angular momentum kg m? /s 
torque Nm moment of inertia kg m? 
action Js energy Ws 
area m? volume m? 
density kg/m? | particle number density | m^? 
electric fieldstrength | V/m magnetic fieldstrength | A/m 
heat capacity J/K specific heat capacity J/(K kg) 
entropy J/K enthalpy J 
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activity 


dose equivalent 
absorbed dose 


Further derived SI units with special names and symbols 
Bq | Becquerel | Bq = 1s7! 
Sv | Sievert Sv = J kg! 
Gy | Gray Gy = Jkg ! 


Some units outside the SI accepted for use with the SI 


area ar 
area ) 
volume 
velocity km/h 
mass u 
energy eV 
focal power | dpt 
pressure ar 
mmHg 
plain angle | grad 
minute 
second 
time min 
h 
d 


a 


Ar 
barn 
litre 


unified atomic mass unit 
metric ton 
electronvolt 


diopter 

Bar 

mmHg column (Torr) 

1° = 7/180 rad 

1’ = (1/60)° = 7/108 00 rad 
1" = (1/60)! = 7/648 000 rad 
minute 

hour 

day 

year 


ar = 100 m? 

barn = 10-25 m. 
= 107m? 

km/h = 0,277 778 m/s 

u = 1,660 5655- 107” kg 

; — 1000 kg 

eV — 1,602 176 565(35) 


2 


1077? Nm 


dpt = 1/m 
bar — 10? Pa 
mmHg — 133,322 Pa 


° = 0,017 453 293 ... rad 
' — 000 290 888 ... rad 
" = 000004848 ... rad 


min = 60s 
h=3,6-10's 

d= 8,64-10* s 

a = 365d = 8760h 


Some units outside 


the SI currently accepted for 


use with the SI 


length ua astronomical unit AE = 149,597 870-10? m 
ος 281906 pe = 20,851: 1015πι 
y ight year Lj = 9,460 447 63-10? m 
A Angstrom À —107 m 
sm nautical (intern.) mile sm = 1852 m 
volume vbl .S. barrel petroleum bbl = 0,158 988 m? 
plain angle | gon gon 8 = 0,57-10-? rad 
gon minute © = 0,52 107! rad 
gon second ce = 0,57. 1075 rad 
velocity kn knot kn = 1sm/h = 0,5144 m/s 
energy cal calory cal — 4,1868 J 
pressure atm standard atmosphere atm = 1,01325-10? Pa 
activity Ci Curie Ci = 3,7- 1019 Bq 
Remark: 1 Pa =1Nm~? = 107? bar = 7,52 - 107? Torr = 9,86923 - 10-9 atm; 


1 atm = 760 Torr = 101325 Pa; 1 Torr = 133,32 Pa = 1mmHg; 


About units accepted only in some EU states see [21.15]. 


bar = 0,987 atm = 760,06 Torr . 
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21.5 Important Series Expansions 


C 
Function Series Expansion Ee E 
egion 
Algebraic Functions 
Binomial Series 
(ac x)" | After transforming to the form a" (1 E z) one Ir| <a 
; . d form » 0 
gets the following series: lz| <a 
form « 0 
Binomial Series with Positive Exponents 
(ta) | 14-mz4 E 1) 2 4 m(m — πμ -- 
(πι 0) quay mon Zi καπ) l eit 
n! 
" "ν.μ. ΝΡ 
ls 1---ᾳ qa T a dex «1 
is 4^ 34.8 ^ 4.8.12. τσ τ; 
1 1-2, ,1:2:5 , 1-2.5. 
1 x)3 1 τα oe? 1- 2 98 : 3 μετ τ] €1 
3 3.6 3-6-9 3-6-9-12 
' i. db a: Wists, dde 
ttn) Jy 24 ji A. < 
ο d T EE e ÉÉ* TA ve 
3 3 3:154 8:1:1 ας ὃν] 1.5 
17r)? 1 Tx Tr 3 -ᾱ -r «1 
AP a Gd Sag 24-098. Pls 
: 5 Bude. ο ο ου 
1 ΕΣ |1 um 2 Pe +2 : α) .. Ὁ 2 os 1] «1 
2 2:4 2.4.6 2.4.6.8 
Binomial Series with Negative Exponents 
Gta)" | 12 mz 4 mE 1) 2 z mnt Dent s a 
(m > 0) qq ee 1) n E aided 
n! 
1 15, 159, 159.1 
(r)i | lp =a a ie | cur 3 F z| «1 
4 4.8 4.8.12 4.8.12. 16 
: ν. eee νι} 
1x2) 3 |lT zw 2? F ; 1 
a 3 7 $8 T3697 35:912 m 
ye yd 11-35 1-3-5 3 1:83:5:Τ αμ. 
πα στ T2468 κ. 
ει |1Fr+r’ Fr? +r z| «1 
ς ΡΝ TN .5.7 
(+g)? 13. E 3 r2 — LEE $T r| «1 
2 2.4 2.4.6 2.4.6 
(1x x)? | 1x 2z + 30? F 403 + 5at xe z| «1 
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ο " 
Function Series Expansion "erum 
5 5 5.7 4, ,9:T.9 4 5.7.9.11 
(lta)? |13 cue 2g? í -ᾱ 4 : at s| «1 
2 2.4 2.4.6 2.4.6.8 
1 
(lic)? πρ... 4x? +4: 5a? F5- θα... z| «1 
1 
(1x a) Pp 2 32 3:-4r F3- 4: 52? 
T4: 5-62? $5-6- Ta! +--+) x| <1 
1 
(Is)? tg 3.3.4 3.4. 5x 3-4: 5. 6x 
4.5.6. T2? $5-6-7-80t+---) 2] «1 
Trigonometric Functions 
. n a? " qnl 
sina db Rr μι (-1) Qna κα |z| < oo 
ΜΉΝ . n z?sina x cosa 
sin(x +a) | sina +acosa— gr = a 
z^sina x" sin (a + 3 
j be dire ο |z| < oo 
4! n! 
T? n xê τ x?” 
cos x 1 ata Π +: + (-1) Qr |z| < oo 
MN " Au x cosa x sina 
cos(x + a) | cosa — x sin a 2 * 3s 
xt cosa ax” cos (a+ ar) " 
Π t ni F |z| < oo 
i 14,2 5,17 y, 62 ο 
απο ο τα x 2 3 t 
ΓΦ 15 315 ^ 2835 ^ 
2 (920 1)B, od π 
za n * < mes 
(2n)! $ [οἱ 2 
" 1 Eq). Dg? σ΄ 
cot x t t t 
" |e [3 45 945 4725 
2"B, wa τ 
i Tte 0 < |z| <a 
(2n)! 
d 61 277 
sec x 13 Ly? i 4 6 18 | 
ld 2^ ^24" ταῦ, * 8064 " 
En 2 
2n | < π 
2n)! 
(2n)! 2 
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Function Series Expansion M eon 
egion 
1 1 T 3 3l ος 127 z 
cosec T d ze z"4 2" c+ 
x 6 360 15120 604800 
2(277 1) αι 
* Gry Bat 0< |x| «T 
Exponential Functions 
P ο x φῦ l x” 
: pratat tmt Ese pe 
rl rl 2 rl 3 mi n 
a? = οἳ Ina 1 a " g } ία P a) ία E a) Feed ία 2 a) Fee |α| « oo 
! 2! ! n! 
E: ι. 9 i Bia? Box! à He 
e? —] 2 2! 4! 6! Brun 
-1γητι Daw Lee z 2 
+(-1) σσ] |z| «2π 
Logarithmic Functions 
r—1 r—1) 2 -- 15 
In x 2 | & y t (z K 
z+1 3(¢4+1)3 5(z+1) 
(x _ 1)» " 
n+ ya & Dj * ds 
x—1) 2 -- 19 a—1)* 
In « (x — 1) -— a - dee 
z-1y 
yin ED 0<r<2 
n 
. x-—1.(2—1* {α-1).. ο υπ E: 
5g vo 2232 ^35 ^ gn -; 2 
95 23 Q4 n 
In(l+2) |z ed 3 5 + +(-1)"t1 τ d -l<a<l 
n 
t 2 x? xt "n p^ 
In (1 — x) beg y "E | -l<ar<l 
1152 x? x Pu qnl 
ο ο ο ο. jsi 
7 ls [e+ 3 5 Tt 2n ΕΤ [οἱ 
= 2 Artanh x 
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Function Series Expansion UN 
egion 
qoe 1 1 1 1 1 
π(ξΞτ) |2 τας "οἷ. μις posi a|>1 
eet E T 3a? δας — Ta (2n + 1) 22911 [οἱ 
=2 Arcoth x 
2 τά 16 92n-1 B r?” 
In|sina| | In|x E 0 « |z| « 
nlinz| |Ile| "6 — 185 — 2835 n 2n)! ae 
cot a5 1708 
In cos z 
2 12 45 2520 
22-1 (220 E 1)B, x?” T 
|α| <5 
n(2n)! 2 
1, 7 62 κ 
In|tanz| |n|r|4- za? 4 [gi af fee: 
3 90 2835 
zn (gant = 1)Bn P T 
| η νε. 0< |z| « 2 
n(2n)! [οἱ 2 
Inverse Trigonometric Functions 
i x 1.325 1-3-5427 
arcsin x 4 | t t 
2.3 2.4.5 2.4.6.7 
1.8 δι (2n — 1) à? 
POSU AMT + jx] « 1 
2:4-6---(2π)(2η + 1) 
T αὐ εθαδ 1:83.δαΐ 
arccos © a+ F i 
2 2.3 2.4.5 2.4.6.7 
15355-2998. l qnl 
= m τ po jal « 1 
2-4-6--- (2n)(2n + 1) 
3 5 T 2n4l 
arctan x πως : : ! + (—1)” ΠΤΙ ja] « 1 
ar te v. 1 | 1 1 | 1 
arctan 2 t5-35+3m B5 T7 
1 
+(—1)"*1 ; ΞΕ 2] » 1 
(-ῃ (2n + 1) z2n1 lel 
gg og pu 
arccot x on |- + 5 7 +-++++(-1) mae jr] <1 
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Function Series Expansion SU 
egion 
Hyperbolic Functions 
. gP αὔ gt SI 
τα a er gen 
xz? t οὔ q^ 
cosha | 1 οι πι στ + Gayl Eee |x| < oo 
mi 1, 2, 17 , 2 62 ο 
anh x $—-m«- r 4 X 
3 15 315 2835 
(-1)' 22n(gan T" 1) ο : T 
(2n)! πα ἘΠ [αἱ < 9 
ΠΗ 1 g x 217 xT l 
0017 [x *3 45 945 4726 
=] n+1 g2n : 
ED nant - 0«|z| <r 
(2n)! 
"mer l 2, 5 4 6l ο, 1385 g 
sech x 1 π΄ tq? "TES tT 
(-1)" 2n T 
le els <= 
(2n)! á lzi 2 
1 α 7x 31α7 
cosech x =F = | 
x 6 360 15120 i 
2(—1)” 0) n-r... 1) and 
+. Bax" ee 0 « αἱ « 
+ (on)! x + |x| « v 
Area Functions 
1 4 1.3 1-3-5 P 
Arsinh | x 2.3 α 4 2.1.5 x? 5 PT αἱ 4 
1-3-5---(Qn-1) 3 
+ 1)". geet + g| «1 
ο σσ στη I 
1 1-3 1-3-5 
Arcosh x | + |In (22) 2.22 3.4.4 QC KCN. qd 
3 pd aet. m2n+1 
Artanh x | Ὁ ἡ E | E +5 + ELS Eee ja) « 1 
1 1 1 1 1 
Arcoth ar to toe tH — 151 
αυ 333 Ba? Τα οσα |z| 
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21.6 Fourier Series 


1. y for0< g < 2x 


(= x  Sin2x i sin 3x 


πο 4 3 


2. y=aufor0<a<7 


y = 2r — x for mt < x < 2r 


g 4 cos3r | cosbr 
COS X ł 


32 


52 


1 2 3 


2 (= x sin2dx | sin3x ) 
y= m 


47. singz | sinbr 
y= Ξ sin x 3 t 52 ien 


5. y—aforÜüc rz «m 


y = —a for m <a < 2r 


4a f , singz sins 
y= sin x 4 | Ξ σα 
π 9 5 
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6. y=0for0 < zx < qaand fort —a < zx < rT +a and 2r -a < zr < 27 
y=afra<r<r-a 
y =—afrr+a <r <n- a 


4a ; 1 πα. 
y= --- (cos asing + 3 008 3a sin 3x 
π 


chia 


1 : 
-czc0os5aosin5z--:- ) 
5 


σι 
a 
xy 


απ 
7. y=—for-a<a<a 
a 


y=afora<u<an-a 
4a 1 
a(r—z = —— | sin asin z + — sin δα sin 3x 
E del p rr ταί 32 
α 


{ρω Ὁ 
y=-—aforn+a<a<27-a +z Sin asint +- ) 
5? 


63a f . 
y= (sin x 
π 


1 1 1 
5 sin 5x + 2 sin 7x iP sinlla+-- ) 


Especially, for a = : holds: 


8 y-ez?fo-mXmvXm 


πλ ES T  cos2r , cos3x ) 
AY d 1 mom 
-n |On 21 3n 4n 
9. y—z(r—x)fort0E v Em 
om = 2r | cosda , cos 6a " ) 
Ay πμ p ^m 'g 
NINAN 
-π 0 m 2π 


10. y-x(r—-x)fo0XrXm 
y—(r—z)(2z — x) for t < x € 2n 


=- 8 ΕΠΙ a 1 m 2η’ 4 1 amm 5r 
y τν sing + z3 sin 3x 4 5i sin ας ο. 
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11. y=sinzfor0<a<7 


2 4 /cos2a cos 4x , cos 6x 
T 3 E T E T idc 


AY Vm "TX 1-3 E 5.1 
ΨΥ 
-π 0 m 2π 
12. y= cosx for 0 <x «m 

4 /2sin2x  A4sindr  6sin6z 

Ay y= ( — 4 | e] 
1 π 1.3 3.5 5-7 
-7 0 

-1 


13. y —sinrforüÜcz r € 


y — 0 fors € x € 2x 


1 " Y. 2 (= 28 z cos 4x 
y——-rzsinr—-— 
"Tg ος πι τα 3:8 
cos 6x 
5-7 
14. y —cosurfor—-z Xr <T 
2u sin um | 1 cost  cos2r σοβϑα | 
y= : d - ἃ μα 
d π 2uà 42—1 w—4 wuw-9' 
(u arbitrary, but not integer number) 
15. y= sinuzg for -r « r «7 
2sin ut ( sin 2; 2sin2r | Jsin3r : ) 
y= - "m 
d π 1l-w 4-—u? 9—u? | 
(u arbitrary, but not integer number) 
16. υν--αοσοβαἴογ-π«α«π 
d. 4sin2r 6sin3x 8sin4r 
= —- sing - - - 
a "2-1 gi 4-1 


17. y=-lh (2 sin 5) forO<a<a7 


1 1 
y = οοβα + 5 cos 2x + 3 cos3z +: 


18. y — In (2 cos 5) forO<a<a7 


1 1 
Y = COs £ — — cos2x cos 3x 
y = cos x 5 5 21 3 x 


1 T 
19. j= gin cot 5 forc zem 


1 1 
y = cosx + 3 00595 + z cost +++ 
5 
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21.7 Indefinite Integrals 


(For instructions on using these tables see 8.1.1.2, 2., p. 482). 
21.7.1 Integral Rational Functions 
21.7.1.1 Integrals with X = az + b 

Notation: X = ax +b 


1 

1. 5 dx = a (n#-1); (for n — —1 see No. 2). 
2 (= = Inx 

X a 
3 fex” dz = 1 xr b Xr+! 

a ^. a2(n + 2) a?(n + 1) 
(n Z —1,z —2) (forn — —1,— —2 see No. 5 und 6). 

myn 1 "i m n 

4. E X de = — | (Xx — b)” X” dX (n Z -1,4 -2,...,4—m). 


The integral is used for m < n or for integer m and fractional n; in these cases (X — b)' is expanded 
by the binomial theorem (see 1.1.6.4, p. 12). 


5 I cec ue 
J X 3 
x dx b 1 
6. πχ + qa nx. 
7 xdr 1 1, b 
I Χ ὦλ Χ 2X3] 
8 a dx 1 -1 b ( 142) 
: X^"  q?l(n—2)X"? ' (n—1)Xn-! ü i À 
x? dr 1/1.» "TN 
ο. [7 = 5 (5x? - 2x +Pmx). 
τ ds 2 
i E liy am 
J x? αὖ 
ii x? dx 1 -- 20 p? 
EI IC E axe 
a? dx 1 -1 2b p 
12. = ] j 133 : 
` Xn a? ls zm 3)X^-3 | (n = 2)Xn-3 (n = =| (n A i 3) 
z? dx 1/X? 3bX? ] ; 
13. J ee PX αρχ — lax). 
: X a^ A 3 2 
a da 1 (xX? "E b3 
14. y στι ( 5 3bX + 3b" In X 4 xl 
a? dx 1 , 3b? b3 
15. x3 A (x 3bln X X + za) 
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Vd 2 3 
16. EUR E (mx E. + b ). 


Xt a X 2X? 3X3 
17 x dx E. -1 3b 3b? ; [d 
Jo X" ad |(n-4)X" (n-3)X*3 (n-2)X"3 ! n—-1)X"-! 


(n z 1,2 2,2: 3, Z 4). 
18. e = μας 


19 / dx 1 ( X, x) 
-J Ix plait sy): 

20 / dx 1 i X , λας ara? 
JX 7 BA x X 2X1) 


dx 1 X VC (n — 1Y (--α)αά 
: =-— |n—-— PS 
n / aX" qv |n ων. ( i ) ixi (n2 1) 


i=1 
22 / dx M αι X 
JEX ^ dm Wow 
dx 1 1 2, X 
23. f= | pus] |. 
J x? X? ΖΧ abe P r 
dx 1 2 1 3, X 
24. dau Ex nz]. 
12Χ3 [22X2 PX i abz P x 
dx 1 5 (πλ (ar X X 
25. / = l n> 3). 
x2 X” pnl | Σ (") (i — 1)X—1 τ T HAE z (n2 2) 
dx Ela X 2aX x? 
26. / - = - |a? In 7 | IE 
23Χ p? z x Qx? 


T / da: 1 3021 x, a?z X? 3aX 
` J g3X? ΠΗΓΕΣ; zd 


28 / dx 1 6a? 1 X, Δαν atr? X?  4aX 
` J X? δ] a TX 2X3 ' 2g? zo 

29 / dr 1 E {351} (-aya*? , PX? | (n+ lax 
`J Xe bp ΕΝ i =A ο 1 


D 


1 m+n—2 m+n—2 xm l(_a)! 
30. jJ zmn = NEN 1 (m. ur 1)gm-ii : 


ES If the denominators of the terms behind the sum vanish, then such terms should be replaced by 


m+n—2 X 
(" id Jean l. 
xv 


πι-- 1 


i=0 
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31 


32. 


33. 


34 


35. 


36. 


37. 


38. 


39. 


21 


40. 


41. 


42. 


43. 


bf —ag 


Notation: A = 
ax +b ar Δ 
. fag f | pine +g) 
dx l, fr+g 
/ (ax +b)(fa+g) Δ ax +b (A70): 


x dx E g 
. In(az + b) — = In( fx 4 A : 
/ (av+b)(fe+g) Δ E ο. f alfa o) PW 
dx d 1 f, fxt+g 
` | errea σα ax +b πώ 
rdr b a Tt 
= I b). 
/ Urs a-et a- τος “7” 
22 dx p a? b^ — 2ab 
- i + ln(b + x 
J (a+az)(b+2)? (b-—a)(b+x) (b-a) ee) (b—a)? neta) 
dx =1 1 1 2 a+r 
/ 2 z~ p ( H ;ln (a 
(a4 rj(b4-zr? (a-b? \a+r bcc (a-b) b+a 
x dx 1 a b a+b a+ 
/ 2 z= . -ln (a 
(a+2)2(b+2)? (a-b)? \a+r b+r (a-b) b+a 
x? dr -1 a p 2ab a+r 
/ : z= : -In (a 
J (a+x}(b+r)} (a-b? (a+r b+e (a-b) b+2 
.7.1.2 Integrals with X = az? + br +c 
Notation: X = ax? + bx +c; A = 4ac — b? 
lx Θα 
us ES arctan ES for A > 0), 
= BOE Artanh TX for A « 0), 
zi In ΓΙ for A < 0). 
ν-Δ 2axz+b+V-A 
dr — 2ax+b | 2a [dx 
X AX "AJ X 
dr — 2ax+b ( 1 δα ) 6a? pdx 
Xs A A2X? " AX) ΔΖ) X 
dr κ 2ax +b , (2n — 3)2a dx 
J X^. (n-DAX™1 | (n-1)A J Xe 


(see No. 40). 


(see No. 40). 
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x dx 1 b 11 
44. E ος = (see No. 40). 
X 2a 2aJ X 
x dx ba+2c b fdr 
45. x3 AX Al X (see No. 40). 
46 sdr c bx + 2c b(2n — 3) dx 
` X^ (n—1)AX"!  (n—1)A J Xn 
αν zx b . ?-2ac fdr 
47. x EET In X 4 σα X (see No. 40). 
αν (P —2ac)r bc 2c p dx 
48. x aAX tA I x (see No. 40). 
17 da: -- ] da: n — 2)b ; dx 
ades E Eu μια” ad (see No. 43 and 46). 
X^ (2n —3)aX"-!  (2n—3)aJ X” (2n—3)a) X” 
50 / a™ dx qm (m — 1)c / x” dr 
DESC (2n — m — 1)aX^-1 τα —m-l)as X^ 
E gn 1a 
T2 m» Da E y^ (m 4 2n — 1); (for m — 2n — 1 see No. 51). 
51 / a? dy 1 ie sS dy c / gdr b / gr? dy 
Τον Xn α. Xn-1 a. Xn "E Xn 
dx 1,2? b [dx 
52. / οσοι In x "xx (see No. 40). 
dx 1 b [dx 1 dx 
"E fam. 
J αχ.  2c((n—1)X"^! 2cJ X" cJ xX! 
dx b X 1 Poa dx 
54 [aL l see No. 40). 
J PX 2°? cr * (zs =) X (6e No:40) 
dx 1 (2n +m — 3)a dx 
55. / = / 7 
maX" (m — lex” ΧΑ 1 (m= le qm-2Xn 
(n 4- m — 2)b ] dx 
πι > 1). 
(πι-- 1ο. J amlxn Dd 
dx 1 xv É 
56. / AX ; zy |f ln μι 
J Get gX ~ ef? - gbf + gia) X 
2ga — bf dx 
} : : see No. 40). 
πρ-πγιρρν "Ὁ 


21.7.1.3 Integrals with X = a? +2 


z?, 


Notation: X = a? + 


T 
arctan — for the 
α 


“+” sign, 


» 


sign and |x| < a, 


" 1 +a 
Y = 4 Artanh κ... In επ for the “ 
EY ee 
Arcoth ~ = = In 2 for the “ 
a 2 D— 


If there is a double sign in a formula, then the upper one belongs to X = a? +x”, 
and the lower one to X = a? — z?, 


" sign and |x| > a. 


a>0. 
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57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


τι. 


72. 


73. 


ΤΑ. 


75. 


76. 


dr 1 
JX a 
dx a 1 
dx zx 3r ne 
] X5 7 axi max * Bess 
dx x ,2n-1 dx 
/ X^ MaX” ^ 2na? / xn 
nM = at In X. 
X 2 
zdr 1 
J x? Χ 
ταν. 1 
J xs ~ 4x2 
a dx 1 
Xe = Tas 79 
xv? dx 
X SATAY. 
z? dx a 1 
X? OX” Qa 
αν. . 2 x 1 Y 
X3 4X? 8a2X  8a35 ` 
wdc _ & 1 dx 
/ X"H 7 *3nX" ^ On / xwv "70. 
[Es da » Q2 . a aX 
R X ΠΝ; 2 
En = Ls ΠΧ 
Χ2 2X 2 
x? dx 1 a? 
X? T 2X | AX? 
x? dx 1 a? 
Xm δη χι s CPD 
— = B In r 
aX 2@ X 
dx 1 il z? 
1 = rout In —. 
zX?  2a?2X 2a X 
/ dx 1 | 1 1 T g? 
1X3 4a? X? 2atX 245 X`’ 
dx L221 
pee am αἲ ᾿ 
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TT: 


78. 


79. 


80. 


81. 


82. 


dx 1 x 3 
P atx  2a^X © 2a5 
/ dx 1 x Tx 15 Y 
α2Χ9 a®x ' 4a^X? ' 8a9X ' 8αΐ ` 
Í dr κ 1 | 1 In 2 
wx 2a?:? 2αἱ 
/ dr _ 1 1 1 In x 
J aX? 2a'a3? | 2a*X aê ` 
/ de — 1 1 1 _ 3 n x 
J z3X3 2a9:? © a9X © 4a1X? ' 2α8 
/ (b Du ΠΕΙΣ πλ gx E a : 


21.7.1.4 Integrals with X = a? + 2° 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


91. 


92 


93. 


94. 


/ 


/ 


J 


/ 
/ 


Notation: a? + z? = X; if there is a double sign in a formula, the 


upper sign belongs to X = a? + z?, the lower one to X = a? — x. 


dx 1 (atx)? , 4 "T rca 
X θα a Far +22 ὧν ium αν ` 
dr _ c à 2 dx 
X? 3X 32J X 
2 dx 1 1 a? t ax 4- a? 1 T" 2r a 
= n + arctan : 
X θα ata)? av3 av3 
gdr z? mE a dx 
X?  38X 30) X 
a? dx 1 x 
Ξ t-lnX. 
X 3 
x? dr m 
Xx? "3X 
x? da cux f dX 
—crc« 
X X 
a? dx 8 1 pdx 
X? 3X 3J X 
dr 1 z’ 
— = h. 
πΧ 3a X 
dx 1 -i l a? 
zX2' 3a3X | 3d6 X 
dx ι. 1 frdr 
12Χ αν αν X 
dr _ 1 x 4 ax dx 
g? X? aóz | 3a9X 8345) X 


(see No. 


(see No. 


(see No. 


(see No. 


(see No. 


(see No. 


83). 


85). 


83). 


83). 
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dx 1 ὮΙ rdz 
95. E 3X = He aj X (see No. 83). 


g£ 5 dx 
id [55 SXP^ S 3a6X | 3d46 ) X 
21.7.1.5 n with X = a! + αἱ 


(sce No. 83). 


1 = + azvV/2 + a? 1 azv/2 
97. fe In -+ arctan — 3 
at -- αἱ E 2_arJ/2+a2 — 2a9/2 a? — a? 
a dx z? 
98. Ia = 55 arctan --. 
aic ai 2a? a? 
obs 1 2 aV24+ a? 1 av 2 
jo J v a u n= mae d + arctan ane. 
a^ + a 4av/2 —axv/2-- a?  2aV/2 a -- x? 


a? απ 1 à 4 
100. E: ταν κ 4 In(a + a). 


21.7.1.6 Integrals with X = a‘ — z^ 


dx 1 a+r 1 $ 
101. / =—,In + — arctan 
at— rt 4d α-α 2a a 


2; dx 1 a? + x? 
192. / atri 4a? li a2 — x? 


rde d z 1 z 
103. / uni RE In wm arctan ὡς 
a^—z*^ 4a a-z 2a a 


x dx 1 4 4 
104. [ατα = -3 ina — q^). 


21.7.1.7 Some Cases of Partial Fraction Decomposition 


105 l me b 2 
` (a+bxr)\(f+gx) fb—-ag\a+br f+grj)` 


106. : - H | B I E , where it holds 
(α-Γ α)(ᾳ -Εὐ)ία--ο απ α zb ate 
1 1 1 
= B= ,C= : 
(b — a)(c — a)’ (a — b)(c — b) d (a — c)(b — c) 
1 _ A B C D E . 
107, (9 +a)(x +b)(x+c)(£+d) r-a "a+b T πο rd ΕΕ 
1 1 
--ππε-θμ-α-ᾱ-πε-Ἠπ- Ue 


108 1 u 1 b g 
| (a+ bx?)(f--ga?) | fb— ag Nac bx? f σα) 
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21.7.2 Integrals of Irrational Functions 
21.7.2.1 Integrals with yz and a? + bx 


Notation: 
br 
arctan bva for the sign “+”, 
La gH. Y ai 
A= ἔα, Y = 1, "T 
-]π------ 
a — bx 


If there is a double πη in a formula, then the upper one 
belongs to X = a? + ?z, the lower one to X = a? — bx. 


for the sign “—”. 


zd ur 
109. [Ese NE. Gy 


X ος b3 
va? dx 2yz?  2a?yx | 2a? 
110. QE £y, 
à X 3 p bt b5 
ve ντ 1 
M11. =F + — Y. 
e DX ab 
38 da: Je d -ᾱ 
daa: x dx ma αὐ κ 3a? /r δαν, 
à X2 DX b X 05 
dx 2 
113. -------Υ. 
Χντ ab 
dx 2 2b 
ua. f - sy 
XV ayr aè 
115. C 7, 


= — Y. 
X?^/r αχ * ab 
, ο 
ux / 2 3b? /T 3b 
J X? Vr a? X yr αχ a? 


21.7.2.2 Other Integrals with yx 
"dx 1 η" Nr 2 1 ο 
117. / va 7 = n= ο = | arctan 7 7 
ae 2αν2  r—ay2r-ca? αγΏ g^—4 
J dx 1 x +av2r +a? 1 αν 2x 

118. J : = — In = -- arctan — ᾿ 

J (aic ανα 2V2 »-αγΏτ!αἳ αὐγΏ ασ -ᾱ 
Vi: | Ve des c ih at γα να 

J at 2α α- x 


120 del ate tan VE 
g 4. οι Je 34 "ΝΕ, z arctan " 
J (at—a)/e 2a a JE a3 a 


1 
arctan 
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21.7.2.3 Integrals with Vax +b 
Notation: X = ax +b 


121. [VX ax = ZVÝ. 


a 


122. E o X de = B 
a? 


z 2 πρ ba b?) X 
123. |PX ae = ΠΡ“ «κος E ) 


105a 
jug [E VN 
VX a 
xdx —2(ax — 2b) 
125. — --------ν X. 
dox. 3a? vx 
{38 adx — 2(3a?z? — daba + δὺ)νΧ 
J SX 15a? l 
2 X 1,vX-vb 
"T — — Arcoth = ln for b > 0, 
127. / dx = yn b vb VX 4 vb 
nye tan 2 forb < 0 
A arctan d 77 Or 1 


VX dx 
128. f dr =2VX +b 
Joc 7 | ry X 


129 i dx u VX a da 
Τα x2/ X ba 2b. avy X 


130. / τ. LU eI 
J g x 2J ανΧ 
131 i dre VX (2n — 3)a dx 
` 2 Χ (n—1)ba"-!  (2n—2)b J| gn—/X- 


ον X5 
185, [v= die. 


90. 


133. / aVX3 dz = x ; (SVX - τον Χ5). 
à a^ 


, - 2 X? WV XT PV X4 
[2 X3 dz = — τ bv i v ) ἢ 
d αἱ 


134. 


9 i 5 


ΡΕ VX ' 
135. I ss da 2 + 2V X +b? f Le 
J c avy X 


2; dx 2 
136. == (VX 
J JX3 a ( i 


8 


(see No. 127). 
(see No. 127). 


(see No. 127). 


(see No. 127). E 
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137. 


138. 


139. 


140. 


141. 


142. 


143. 


144. 


145. 


14 


D 


147. 


148. 


149. 


150. 


2 Aig 3 2 
var co ο ο 
VX? a 3 VX 
dr κ ETE. 
rV X5 bVX 
/ da 1 3a 3a 
J e/X3 εχ  U/X OWS v/X 
, 9Χί (24n)/2 
ΠΕΡ ας -- 
a(2tn)' 


/ αχ dr = 


4+n 2+n 


9 C=. 2 
5 $ 


/ 2 y-4n/2 , 2 Gan 9b X (4+n)/2 e) 
T T= — | à 
i a3 


6+n 4ctn 2ctn 


dx. 


pe dx 9xn/2 I 


zx 


n 


dx 
/ gX” =a ape ^ dor αλα) 


1 na dx 


dr . 
/ q2Xn/2 


brX(-2/ 2b J Xn? 


21.7.2.4 Integrals with vax + band y fz +g 


Notation: X = az +b, Y = fx +g, A =bf — ag 


2 [X 
arctan for af <0, 
γ-- Y 
- ; af jx α 
VXY — vg aY for af > 0, 
“ην (να. VFX) foraf >0. 
rdr νΧΥ | 
J /XY af 2af J ΨΧΥ 
dx "E WX 
VXVY AVY 
/ "T EAT arctan A for Af <0, 
PYVX | 1 QfVX- VAT 
for A ; 
μεμα, λα. 
A+ 2αΥ A? dx 
/ VEY de = 4af VEY 8af 1 ΥΧΥ 


(see No. 127). 


(see No. 127). 


(see No. 146). 


(see No. 146). 
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Y 1 A dx 
151. ie = —V XY 
[\x" a θα) /XY 
VXdr  2V/X A dx 
i f fa VOR 


^ da n—1 
153. Y^dz κ 2 XY” E dx 
J SX (2n + 1)a 


152. 


dx 1 VX 3 dx 
154. = = + (n 5) af : 
J J/Xy" (n —1)A | Y^1 2) J XY» 


155. [vx dx = (vx +a f= x) 


νΧ 


pos 1 ( VX a dx ) 
Y" — (n-1)f 


CER 


156. 


ye 9, V/Xyna 


21.7.2.5 Integrals with Va? — x? 
Notation: X = a? — «x? 


1 , T 
157. [vx dx = 5 («vx + a? arcsin z) . 
d a 


158. [evX de = -iv X3. 
a 2 x 
159. [Xd = -7V X3 + eg σα + a? arcsin z) ] 
: α 
VX X5 αν X 
|X i= = 


160. >= a? 
VX VX 
161. J dr =VX —aln ET : 
J z x 
VX v 
162. / 3 dr = arcsin 
ἡ i 
VX VX 1 VX 
163. — dx — 4 ln er , 
J æ 23? 2a 2 
164 dx . 
! —= = arcsin —. 
VX a 
da 
165. — = -vX. 
J VX 
qua : 2 : 
166. ZT ZVX + Č arcsin 
J yX 2 2 a 


z? dx Vv X3 ; 
167. J = -aX. 
Ix .3 —— 


(see No. 146). 


(see No. 149). 


(see No. 153). 
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11; 1 VX 
168. J se --- ρε ; 
τν X a x 
169. / ο CN 
xV Χ ΠῚ 


170. 


dx VX 1 at+VX 
/ : = = In . 
13 X 2a?x? 2a? xv 


2 


1 2s 4 " 
171. [ da = 4 (v= } a Vx I “ arcsin Z) : 
: a 


1 
172. [ eX de = - LX. 
` 5 


F cV X5 «πχ ataVX a E 
173. [αν X3dr = H H + — arcsin —. 
j : = 6 24 16 16° a 
3 A VX? aX? 
174. [av X8dg = "^ % : 
B 7 5 
vVX3 vVX3 VX 
175. / 1 3 Ha? VX iut : 
x D 


176. ES = = 3 E 


3 5 zd 
πα΄ arcsin —. 
2 a 


J g 2 
V X3 VX? 3VX 3a, at+VX 
ITT: J — dx = - + ln j 
J sg? 202 2 2 EA 
178. = 
2/Χ 
179. 
180. arcsin : 
2 
181. TN 
X 
1 X 
182. jerk 
aM/X a x 
183. / dr 1 VX l ¢ 
J yX? at T VX 
T ] de 1 E. 3 pty 
3y X3 2a?:2/X  2a4/X 2a x 
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21.7.2.6 Integrals with V2? + a? 
[Notation: X=r +a 
185. |X i= - («vx » a? Arsinh =) +C 
i a 
= 5 [eve : a? ln (x | νΧ)] Εσι. 


186. [=E = d 


187. ΕΣ ES = (ov 4 a? Arsinh =) +C 
E cod 
= 1% 8 [sX Fa ?In (a | vx)] FO. 


. yX” 2/X3 
188. ΕΣ —-——-— 2 ;: 
. 5 


3 
X Ἶ X 
189. [ντα -vx Nha d 
x £ 
X X X 
190. d 2 = -YX 4 unis += - cs (s X) +0, 
191. p = vx ln oF VX 
g3 21}. 2a x 
1. n ην 
1932. oF = Arsinh - Ες - τα (a I VX) +C. 
VX a 
xv dx 
193. | “= = VX. 
JX 
194 pum UX ae he 4 C D/X ai (x | VX) +C. 
J JX 5 ο Asih- στο ο πη 1 
rdr VX? Ρ 
195. / - - ax. 
J VX 3 
196. / NM ρε 
ry X a T 
197 Í dx _ mes 
12 X atx 
198. f de d | h mety 
aX 29423: | 2a3 T 
4 mq 
199. [ dx = Ἔν! 3a? -VX | x Asinh 2) +C 
; a 


4 
4 
= i PM m, πο yx) TÓC. 
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1 
200. [ov X?dr = ΕΥ X^ 


" rV X5 3, X3 4, wX 
201. Je X3 dg = eV dev agv 2 Arsinh = FC 
i 6 24 16 16 a 
xv X5 a@aVX3 atryX αν 
E: - ln (x | VX)4C 
6 24 16 
3.5 VXT aX? 
202. p X? dr = e E 
; 7 5 
JX JX3 . JX 
203. [3 αν X — αὖ]η EE : 
i x 3 z 
J/X3 VX . ; 
204. / z dz = — + 3 E + se Arsinh * +C 
Jog -— a 
+ 50V¥ X + Ža m (2+ VX) +0. 
£ 
ν Χὸ VX VX 
205. / dx | 3i X Επ In ar ; 
J . 23 2 2 c 
pg | Loo 
a dx 1 
207. =-—. 
ν X38 VX 
x dx z 1 T 
208. + Arsinh ΄ FC ΕΙπισΕνΧ]-εόι. 
VX3 , X α VX i ( ) 1 
x? dx a? 
209. =VX +—=. 
J y X3 X 
210. J di = : ! jp A 
rv X3 αν X a £ 
211. / dr 1 (VX E 
J xy X? a\ απ X 
dx 1 3 3 a+vxX 
212. J - == UE I In : 
αγ X3 βαν X — 2a^/X — 2a? x 


21.7.2.7 Integrals with να’ — a? 
Notation: X = a? — a? 
(ενΧ — a? Arcosh z) +C 
a 
[sv X — a? ln (υ + Vx)] + Ch. 


213. / VX de= 


NI ο... 


214. [X = d 
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215. 


216 


217 


218 


219 


220 


221 


222. 


223. 


224. 


225. 


226. 


227. 


228. 


229. 


230. 


- 2 ; 
[© X dx να" | E (vX a? Arcosh z) +C 
a 
: 2 
= 2 X i E [ov X a? ln (x | νΧ)] FC. 


4 
v5 2,/ X3 
SETE T p2 i 
5 3 
: / VX iy X a arccos . 
x x 
X X i X 
; YS ages vx Arcosh : FC + In (υ | vX) H Ci. 
T z a x 
VX VE ωρα 
; a d Da? * πα 916008 — 
d. n £ 
a X = Arcosh = tC =]n (x+ VX) T6. 
1; dx 
. | -Z =VX. 
VX 
x? dx Σπ a? Areosh É 4C X a^ ( VX) σ 
t 'cosh — + C t nix F CA. 
VX 2 2 a 2 2 i 
rdr VX3 9 
-----ανΧ. 
lay 3 {αγ Χ 
| dx u 1 m 
rv, dui" arccos =. 
/ dx u JX 
J yX aa’ 


/ dx VX a 1 a 
= arccos —. 
J a39/X 2427 — 2a? x 


242, 4 . 
JE i= ντ X4 3 Arcosh 5} FC 
α 
1 2a 4 
τα [ανα XA 2a In (x I yx) FC. 


2 


1 
[VB ae = g X^ 


/ 2X3 d cV X9. atr X? απνΧ 
T 3 da: 


6 ; 
a x 

H } Arcosh — + C 
a 


p 25 16 16 
αν X5 @aVX3 αγ a® 

t t 1 p+ VX) 4 : 
6 24 16 16 (2 ) ο 


J X7 2,/X5 
Jor de = : 42 = 
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af XS JX3 . ' 
231. | dx = a? V X + à? arccos n, 
à x 3 x 
/X3 J/x3 
232. / z απ -- 5 t SVX sa Arcosh = 2 
xv 
V X? 3° i 
Ξ- + 3 MX — τῷ In (υ + ντ) +C. 
2 2 2 
VX? VX? 3VX a 
233. / — dx = — + —a arccos —. 
J g? 2112 2 2 x 
234 dx . 1 
i VX? aWX 
235 x dx E 1 
JVX νΧ 
αλ dx x T £ 
236. + Arcosh — + C +In (£+ VX) +6. 
J JX3 VX a VX ( ) a 
xv dx a 
237 =VX - -= 
ν X3 X 
dx 1 1 α 
238. / eG = SX a arccos x: 
239. | dy: ι VX i g 
eV X3 at \ 5 VX 


240. / L a ] 3 3 
J xV X? 2a?32/X — 2a4/X — 2a° 


a 
arccos —. 
T 


21.7.2.8 Integrals with Vax? + bx + c 
4a 


Notation: X = az? +br +c, A = 4ac — b? , k= * 


1 
— In (2v aX + 2ax 4- b) HC fora »0, 


i ; 
2ax +b 
] Arsinh —— 1-01 fora>0, A>0, 
ja 1.95 να vA 
vX —= In(2ax + b) fra >0, A=0, 
να 
-= arcsin a fra<0, A<0. 
dx 2(2ax + b) 
242. / ae ee 
XV X AVX 
dx 2(2ax +b) (1 
243. / = ( 2k) . 
; dd BAVX ΑΧ) 


T [en 2(2ax + b) pe 
xU] /2 ~ Qn- AX Qn-1/2 © 94 1 m 1)/2° 
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245. 


246. 


247. 


248. 


249. 


250. 


251. 


252. 


253. 


254. 


255. 


256. 


258. 


259. 


260 


/νχα . (2aa + b)/X | 1 dx 
' 4a "Oks JX 
(2aa + b)/X ( 3 ) 3 dx 
XVX dz = X4 H 
] VX de 8a 2k) 82) JX 
(2az--b)V/X (4 5X 15 5 da: 
X?^VX dx = (x | ) 
/ VX de 12a + eT gm) + 168] VX 
scd Qaa + b) X 2Η ΕΠ 2n 41 2n1)/: 
xOn0/2 qu = ( fxe 1)/2 dy 
/ τ 4a(n + 1) 2k(n +1) . "m 
ταν ~X b dx 
VX a 2aJ JX 
/ Ὁ.  2(bx+2c) 
XVX AVX 
sde 1 
X(2n+1)/2 (2n = 1)aX 0n-)/2 2a J X(2n+1)/2 
x dx dt 3b wes 30} —4ac f dz 
VX 2a 4a? ENT VX 
/ de — (2b°—dac)a+2be | 1 fde 
XVX αδγΧ VX 
XVX  b(2ax 4b) b dx 
zy X dx = 
[av tae 3a 8a? dak J YX 
X?MWX b 
[ #XVX de = 2E. ds 
. 5a 2a. 
2n+3) 
[ ox oen da = Louis is = = 2 πμ da: 
à (Q2n+3)a 2a. 
F 5b\ XVX b? — 4ac 
257. ΠΕΣ; c a VX | 2 τ [Xa 
6a 4a 16a? 
1 2VcX 2c 
! +b} 4 forc ; 
m x - ) σ orc >0 
1 ba + 26 
Arsinh FC force >0, A>0 
dz fa A οι ; ; 
/ z/X 1, br4-2c 
s - p bas ο. forc» 0, A— 0, 
νο z 
1 bx + 2c 
τε arcsin ee τες, A<0. 
Í dx VX b dx 
αν X cx 2 X 
| VX dx -VX] A dx dx 
i x ce X 


(see No. 241). 


(see No. 241). 


(see No. 241). 


(see No. 241). 


(see No. 244). 


(see No. 241). 


(see No. 241). 


(see No. 241). 
(see No. 246). 


(see No. 248). 


(see No. 245). 


(see No. 258). E 


(see No. 241 and 258). 
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X dx X dx b d: 
gp Ee XL af Sd / E (see No. 241 and 258). 
x x VX 2) yX 
X 2n+1)/2 XQnc1)/2 b . : XQn-0/2 
262. / απ = p2 pe dx +c ή — — dr (see No. 248 and 260). 
T 2n 4-1 2. i T 
dx 2 
263. / 2 = ax? + ba. 
J αγααᾶ- bx ba 
dx 2 -- 
264. Ξ = = arcsin : 
J να — x? 
a dx rz— 
265. 2 == 2ax — x? + aarcsin 
J ax — x? 


2 2 m 
266. [ os z dr =~ 3 2 να: — x? 4 3 arcsin 5 — 4. 
i a 


/ 2n = : arctan svag = Of 
J (ax? + b) / 0-09 Vb Jag — bf Up [Fre Fg 
- 1 | Vov Fa? + g + x /bf — ag 
n 
2/bybf —ag νυν” +9 — x y/bf — ag 


267. (ag — bf > 0), 


(ag — bf « 0). 


21.7.2.9 Integrals with other Irrational Expressions 


268. / Vax + bdx = ΠΤ Vax +b. 


n-cl)a ` 
289. / dx n(aa +b) 1 
J Var-b (n — 1)a Yax +b 
dax 9 7 [ar 2 
270. / x == hé PVA να : 
J rax" + a2 na n 
dx 2 a 
n. f =~ arccos —. 
aaa ATCCOS ην 


ντα: 21... (2). 
272. l usc]: 9 ats 


21.7.2.10 Recursion Formulas for an Integral with Binomial Differential 


273. EC +b)? dr 


il 
m —. a"! (aa? + D)? + npb fo (ax” Top ἀεὶ : 
m+np+1 j 
1 
Z = meri |-2"* (az +b) + (m -- n np4- 1) / x" (ax" +b)" da! : 
]i 


(m+ 1)b 


z" T (ag + b)Pt! — a(m 4n 4 np +1) ως +b)? da! " 
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1 
a eg ar” 4 ΤΙ _ i bfa „m-n ^. | y? d. 
a(m 4- np 4- 1) 7 (az" + b) — (m= n +1) (ax” 4- b)? dx 


21.7.3 Integrals of Trigonometric Functions 


Integrals of functions also containing sin x and cos x together with hyperbolic and exponential functions 
are in the table of integrals of other transcendental functions (see 21.7.4, p. 1092). 


21.7.3.1 Integrals with Sine Function 


1 
274. [sin ax dx = —-— cosaz. 
a 


; 1 1 
275. [ 3? ax dx = =x — — sin azr. 
2 4a 


1 
276. ου ax dx = —7 cos ax + — cos? ax. 
3a 
277. / d l iiid E dnd 
sin! az dx zt sin 2ax 4 sin daz. 
4a. 32a 
sin"larcosar m-—1 "C . 
278. ns απ dx = | [ 5» ^ar da (n integer numbers, > 0). 
na " o. 
279. fe ΕΡΕ sinat _ vcosaz 
a? a 
2x z 2 
280. pe sinax dx = © sinas : ~ | cosax. 
a? a a 
3a? 6 r? 6x 
281. 15 sinax dx = : sin ax = cos ax. 
a? at a a 
qn 
282. "Ea sinax dx = —— cosax + — 2 fe 1 cos ax dx (n > 0). 
a 
sin ax (az) (αὐ)  (ax)* 
283. / dr — ax H Een 
J az 3:3 5:5: 7-7! 


z 
sint 
The definite integral | τ dt is called the sine integral (see 8.2.5, 1., p. 513) and it is denoted si(x). 
0 


For the calculation of the integral see 14.4.3.2, 2., p. 756. The power series expansion is 
d -- aT 


. £ £ ae - 
si(r) = c 3.38 5.8 7 ^ "8€ 8.2.5, 1., p. 513. 
284. | sinan m sin ax T / cos ax dx (see No. 322). 
J x g : 
285. / sin ax dines 1 sinar , δη COS az ye (see No. 324). 
gh n—1lzagz"! m—1J/ οὶ 


1 ar 1 
286. = - em ax dx = = ln tan — = -- ln(cosec ax cot ax). 
sin ax a 2 a 


287. I 
sin? ax 


=-= * cot ax. 
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dx COS ax 1 ax 
288. / — = — | In tan 
sin’ ax 2asin ax 2a 2 
1 COS ax n—2 dr 
289. i= m n» 1). 
sin" ax a(n—l)sin""lar  n—14J sin"? ar ( ) 


A3 m m 
290. ᾿ rdr _ ΕΠ (ax) ^ T(ax) 31(ax) 
sinar a? 3:3! 3.5.5! 3-7-7! 


127(ax)? 2(22n-1 — 1) 2 
j porca B,(axy*! +... 
3.5.9! Ἐπ ΕΠΙ Pme 
B,, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 
a dx zr i . 
291. n - , ot ax + — Insinaz. 
sin? ax a? 
292. / τ dx ile 1 — l n—2 ο. (3-9). 
sin” ax (n—1l)asim" "ax (n—1)(n—2)a? sin"? ax n-1/J sin"*ar 
293 / dx 1 " ( τ 
! = an . 
i + sin ax a 4 2 
294. / E 7 tan ( t 2) , 
—sinav a 4 2 
295 / x dx i : π =) 2 i (7 =) 
: = a t cos Σ 
+ sin ax a j 4 2 a? ο 4 2 
da x ; 2 i 
296. / = = = T cot ( -) + — Insin (7 3 ; 
‘ —sinar a 4 2 a? 4 2 
sin az dx 1 "T απ 
297. / : =+r+- tan ( F ) à 
, csinar a 4 2 
1. 
298. / - τ - — — tan (z T 2) + - Intan 
sinaz(l+sinar) a 4 2 
dx ax 1 απ απ 
299. / - -= tan ( ) tan ( ) 
1+sinax)? 4 2 6 2 
300 j dx 1 : (7 2i 1 e G z) 
: Ξ cot t cot“ E 
l—sinavr)? 2a 4 2 6a 4 2 
301 / sin az dx 1 i G 2 1 ΜΝ e e 
i = an t an : 
1 + sinazr)? 2a 4 2 6a 4 2 
302 / sinaz dx 1 " G x) 1 e (7 = 
3 CO | co ; 
J (1—sinaz)? 2a 4 2 6a 4 2 
dx . (3sin? ax — 1 
303. / —3 arcsin —3 ; 
+sin*ar 242a sin ax + 1 
1: 
304. / = f = = ae ax. 
— sin? ax οσα: a 
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sin(a —b)r sin(a + b)x (lal 
2(a — b) 2(a + b) 


305. nr sin bx dx = |b] ; for |a| = |b| see No.275). 
dx 2 btanar/2-4-c 
` ? b+csinar ayb? -— 2 aneren v-e 
1 p btanaz/2 € c - Ve - 
ayc —l? btanar/2+c+Ve— V? 


sin az dx r b dr 
307. : / | (see No. 306). 
cJ b+csinaxr 


306 


for b? > ο), 


for b? < e). 


J b+csinaz c 
dx ar c dx 
sos. f - — Inta al see No. 306). 
J sinar(b--csinar) αὐ Da 5) b+ csinaz (ee No 
dx ccosax b dx 
309. = b : : see No. 306). 
/ (b+ csinaz)? a(b? — @)(b + csinar) b- ὅ ] b+ csinax sen 
310 / sin ax dx bcos ax ë / dx (see No. 306 
` J (bccsinar)  a(@ — b)(b+csinar) c —U J b+ csinar dia i 
dx 1 Vb? + c? tanar 
311. = - T 5 = - = arctan πε anag (b> 0). 
J P+esin ar αὐγθξ ὦ b 
dx 1 Vb? — c? tan ax 
312. f — arcta >, b> 0), 
P asna VET arctan 7 (b? »σ, b> 0), 


1 s Vc? — b tanar +b 
= 1 
βαὺν ὦ --0Σ Ve — b tanar — b 


(ὦ >b, b> 0). 


21.7.3.2 Integrals with Cosine Function 


1 
313. fos ax dx = -sin av. 
a 
5 1 i 
314. n ax dx = =x + — sin2az. 
2 4a 


3 ΠΗ 1 3 
315. cos” ax dx = = sinax — 3a sin” ax. 
à a 


3 1. 1... 
916. fo ax dr = -x + — sin 2ax + —— sin 4az. 
n 8 4a 32a 
cos""aysinar n—1 = 
317. [ cos" ax dx = I [ cos" “ax dx. 
ἢ na n. 
cosar xsinax 
318. xcosax dx = z F 
; a a 
, 2x 2; οἱ, 
319. T x? cos az dx = z COS ax 4 z | sinaz. 
‘ 2 a a 
: 332 6 z? 6rY , 
320. / x cos az dx = 3 z | Cos aa } z | sinaz. 
n a? a’ a a? 
z"siar n — 
321. [= cos ax dx = ————— — — Jr ‘sin ax dz. 
; α a. 
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m 212 "T "6 
322. / ο Ie = ln(ax) τ ! Kam) τη ees 
T 2.21! 4.4! 6-6! 


oo 
st 
The definite integral — / -- dt is called the cosine integral (see 14.4.3.2, p. 756) and it is denoted by 
. m αἱ n 


T | 
2.2! 4.4! 6-6! 


Ci(r). The power series expansion is Ο1(α) = C+Ina +++ see 8.2.5, 2., p. 513; 


C denotes the Euler constant (see 8.2.5, 2., p. 513). 


323. / = PEEL af sin ax dx (see No. 283). 
x m x 
cos aa cos ax a sin ax dx 
324, [ as mnm RE [ZEE ολη (see No. 285). 
^ dx 1 1 a 1 
325. / ο ες Artanh(sin az) = — In tan (5 | 1) = - In(sec ax + tan ax). 
cosax a a 2 4 a 
dx 1 
326. / - = —tanaz. 
J coar a 
327 / dx sin ax 1 isi e τ 
à | a; | ; 
J coar  2acojaz 2a \4 3 
328. / de . 1 sin ax n—2 oe (n>1). 
J coar  a(n—1)cos-!ar  n—1J cos" ax 
329 / 2 dx 1 ((aa)? , (ax)!  δ(αα)᾽ , 61(ar)* , 1385(az)'? , Q Es(ar)"? | 
|j coar at \ 2 4.20 ^ 6.4 8.6 ^ 10.8 ^ Qn + 2)(2n!) | 
E,, denote the Euler numbers (see 7.2, p. 466). 
2 dax x 1 
330. / - = —tanaz + —lncosaz. 
cogar a a? 
331. / rde o xvsinax 1 n—2 gdr ΠΤ 
cos"ax  (n—l)acos"!axr | (n—1)(n—2)a?cos"?ar  n—1J cos" ax 
dx 1 ax 
332. / — —tan 
+cosaz a 2 
333. / um - cot 5 
— cos ax a 2 
P d. ] a x 2 " 
334. / ο = tan 27 + — ln cos -- 
J 1+cosaxr a 2 2 2 
335 / zdr T " ax Insi ax 
. = co + — insin ; 
Ρ — cos ax a 2 a? 2 
cos ax dx ax 
336. / =g — -tan 
+ cos ax a 2 
337. / cosaxdx — " 1 -- ax 
— cos ax a 2 
dex 1 t\ 1, ar 
338. / z = —]ntan ( I =) —tan ae . 
J cosax(1+cosaxr) a 4 2 a 2 
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d dax 1 T ax 1 ax 
339. = —]ntan ( } ) cot —. 
J cosar(l— cosar) a 4 2 α 2 
dx 1 ax 1 4 Qc 
340. / == tan t tan : 
l--cosar)? 2a 2 6a 
dx 1 ax 1 3 Qc 
341. / = t ot? 
J (1— cos aa)? 2a ^^ θα 
cos ax απ 1 ; 
942. / σας .-— tan tan? ad 
J (1--cosax)? 2a 2 6a 
cos ax da 1 ax 1 zat 
343. / == cot cot" 
l—cosax)? 2a 6a 2 
= ee τος 
344. / a ΞΕ arcsin rm SE A jy 
J 1+cos?ax — 2 2a 1 + cos? ax 
dx dx 1 
345. / ων = / - E = —- cot απ. 
J l—cosax J sin? ax a 
r: ,— br si bx 
346. [cosarcosbz dr = κ ως κος, ({α| 5: ||); (for |a| = |b| see No. 314). 


2(a — b) 2(a + b) 


dar J dx — (b — c) tanax/2 
b+ccosar ayb -- ᾧ b? — ὦ 
. 1 (c — b) tanaz/2 + Vc? — 9? 
avc? -b  (c—- b)tanaz/2 - Vc? — b? 
cos ax dx x b da: 


348. = - 
J b+ccosax ο εὐ b+ccosax 


349 / dx en! hiis (= A 7) J dx 
` J cosax(b+ccosaxr) ab PE A bJ b--ccosax 


350 ή dx csin ax b / dax 
|J (b-ccosax) ασ —U?)(b--ccosax) c@— b?) b+ccosaxr 


351 / cos ax dx bsinax c / dx 
|J (b-ccosax) a(b?—c?)(b+ccosar) 0? —c?/ b+ ccosax 


352 / dx 1 A btan aa (b> 0) 
: = arctan b , 
J ÜP-8cosg?ar aby +e v? ἠ- ὦ 
dx 1 btanax 
353. 7 2 arcta 
b2 — @ cos? ax ab /b? -- 2 η V- dà? 


1 1 btanax — Ve — b? 
B n 
2aby/c? — 0 btanax + να — b? 


21.7.3.3 Integrals with Sine and Cosine Function 


; Y. sa 
354. [sm ax cos ax dx = Ἐς sin? ax. 
ἢ α 


(for b? > c?) 


(for b? < ο). 


(see No. 347). 
(see No. 347). 


(see No. 347). 


(see No. 347). 
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x sindax 


355. [ sin? ax cos? ar da = = — 
8 32a 
356. fsi ax cos ax dx = ———~ sin"*! ax (n # —1). 
: a(n + 1) 
F T 
357. [ 3» ax cos" az dx = ——— —. cos"! ax (n # —1). 
à (n +1) 
η m+ q " 
, sin"! ax cos π-- 1 -- 
358. ή sin" ax cos" ax dx | J sin"? ax cos" ax dx 
à a(n 4- m) nam. 


(lowering the exponent n; m and n > 0), 


sin”! az cos™™ taz  m-1 


+ 
a(n + m) n+m 


fs» ax cos" ax da 


(lowering the exponent m; m and n > 0). 


1 
359. — In tan ax. 
a 


sin az cos ax 


Il T aL 1 
= — |Intan I - . 
sin?axcosax αἱ 4 2 sinar 


360. 


361. 


1 ax 1 
= Intan + . 
sin ax Lu ar a 2  cosar 


1 1 
—-|Intanar— nz) ; 


362. Dd 
sin?axcosaz a 2sin^ax 


363. 


= —(Intanaz + mm ; 
sinag coar a 2 cos? ax 


sin? ax ra ax 


pres 
E 
/ 
judo 2 
στ. 
364. [| ——À— - = cot 2az. 
E 
Ja 
/ 
/ 
/ 


l[ sinax 1 3 T aT 
365. 2 - + — Intan T—]J|. 
sin?^arcos?ar | a|2cosar sinar 2 4 2 
1 1 COS ax 3 ax 
366. = ( —3 + —Intan ) ; 
sin arcos?ar  a\cosax — 2si?ar 2 2 
1 dx 
367. = H (n #1) (see No.361 and 363). 
sinarcos?ar a(n — l)cos"-!ax sin az cos?—? ax 
1 dr 
368. ot } — a (n 4 1)(see No. 360 and 362). 
sin” ax cos ax a(n — 1)sin"™ ax sin"- ax cos ax 
1 1 n+m—2 dx 
369. = 'uun-l t n—2 
sin" ax cos" ax a(n— 1) sin" "az cos?-! ax n—1 sin"^* ax cos" ax 
E (lowering the exponent n; m > 0, n > 1), 
1 1 ERI dx 
a(m —1) sin""arcos"-lar  n—1 J sin" arcos”? ax 


(lowering the exponent m; n > 0, m > 1). 
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370. 


371. 


372. 


373. 


374. 


375. 


376. 


377. 


378. 


379. 


380. 


381. 


382. 


383. 


384. 


385. 


i 
/ 
/ 
/ 
/ 
/ 
/ 
ο 
/ 
/ 
/ 


/ 
/ 
/ 
/ 
/ 


Οι. 


T) 
ο)" 


sin ax dx 1 
= = ~- sec ar. 
os? ax a cos ax 
sin ax dx 1 1 2 
t {απ΄ ax +4 

cos? ax 2a cos? ax a 
sin ax dx 1 

cos"ax a(n — 1) cos" ax’ 
sin? aa dr 1. 1 π 

sinav + —Intan t 
cos ax a 4 
sin? ax dx 1 sin ax 1 Tur 
3 = 3 —Intan{—+—}}]. 

cos? ax 2cos‘ar 2 4 2 

sin? ax dx sin ax 


(nz 1) (see No. 325, 326, 328). 


n—1 / cos?-? ax 


cos” ax a(n — 1) cos?-! ax 
sin? ax da 1 (sin? ax 
= + Incos aa | . 
cos ax a 2 
sin? ax dr 
5 cos ax + 
cos? ax COS ax 


sinl 'ardr 1 1 1 
= i - 3 (n Z 1, n #3). 
cos” ax a|(n—1)cos!ax (n -— 3) cos"-3 ax 
sin” ax sin”! ax : / sin"? ax dx ( 1) 
T n n 
COS QT a(n — 1) COS ax 
μα 111. ay - 
sin” ax sin^*' ax n—m-2 / sin" ax , (m 4 1) 
- z m 
cos™ m a(m — 1) cos"-! ax m-—1 J cos"? ax ᾿ 
sin”! ax n—1 psin"? ax dx 
- = (m#n), 
a(n—m)cos"-lar n-m COS" ax 
sin”! ax π-- 1 sim" "a ax dx (m #1) 
= m : 
a(m —1)cos"-lar m—1/ cos™?ar 
cos ax dx 1 
= - E cosec ax. 
sin? ax a sin ax 
cos anda — 1 cot? ax 
ΠΝ; H = + Ci. 
sin? ax 2a sin^ ax 2a 
cos ax dx 1 
sin” ax a(n — 1) sin" aa 
cos?ardr 1 ax 
= cos ax + Intan : 
sinar a 2 
cos? ax dx 1 /cosar ax 
= m. ln tan ; 
sin? ax 2a Nsin^ ax 2 
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386. 


387. 


388. 


389. 


390. 


391. 


392. 


393. 


394. 


395. 


396. 


397. 


398. 


399. 


400. 


401. 


402. 


/ 


/ 
/ 
/ 
/ 
/ 


/ 
/ 
/ 
/ 
/ 


/ 
/ 
/ 
/ 


cos? ax dx 


COS AX 


sin" ax 


1 
(n — 1) (m ax 


/ dx 
Τ " = 
J sin"? 


cos? ax 


cos? ax dx Ἢ 


sin ax a 


+Insinaz |. 
2 


z) man 


(see No. 289). 


cos? ax dx Wf 1 
= —— (sinat +- y 
sin? ax a sin ax 
cos?ardr 1 1 1 
= aq aims -nI (n#1l,n #3). 
sin” ax a|(n—3)sin""ax πας 1)sin”™ ax 
cos” ax cos”! ax cos”? ax dx 
= + (n #1). 
sin ax a(n—1) . sin ax 
cos" ax da cost! q n—m-42 r cos" ax da: ( 1) 
= — — m 
sin" a(m — 1) qe lag m-—1 J sin™ ax ᾿ 
cos”! ax wed cos" ax dx ) 
= —mn— I 7 (m zn 
a(n —m)sin™ "ar m-l. sin" í 
cos”! ax n—1 [ου 2 ax dx idi 
= — — m : 
a(m — 1) sin"! m-—1J sin™ τα 
dx 1 1 ax 
s | Intan 
sin ax(1 + cos ax) 2a(1 +cosar) 2a 
dr 1 "a (242) 
5 =F : t ntan t : 
cos az(1 + sin ax) 2a(l-csinar) 2a 4 2 
sin az dx i 1 + cosas 
n ; 
cosar(l-ccosar) a COS ax 
cos ax dx 1 sinar 
sin az(1 + sin ax) a sinax ` 
sin ax dx 1 1 fee (24 27) 
- + ntan . 
cosar(l-c-sinax) 2a(1+sinaxr) 2α 4 2 
cos az dx 1 1 ist ax 
: = + ntan 
sin aa(1 + cos ax) 2a(1+cosax) 2a 2 
sin ax dx T 1 . 
- T — In(sinax + cos az). 
sinardcosaxr 2 a 
cos ax dx 1 1 : 
: EZ In(sin ax + cos ax). 
sinax + cosax 2 2a 
dx 1 ar T 
: = In tan + ; 
sinard-cosar αγ 2 δ 
dx 1 ax 
- In (1 tan ) j 
1+cosax + sin ax a 2 
dx 1 ax +6 : ! G ο 
- = - - ln tan with sin 0 = —— and {απ = --. 
ὑβίπαα + ccosax ayb? ὦ b+ 2 b 
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^ sinax dx 1 
403. κα In(b + ccos ax). 
J b+ccosax ac 


cos ax dx 1 


404. In(b + csin ax). 


J b-Fcsinar ας 


d (« + 3 
405 / da a 
: m 


+ ecosax + f sin ax νε vc? + [2 sin(ax + 0) 


with sind = —— —— and tan@ = F (see No. 306). 


νο f? 
406 / dx 1 " ( A ) 
: πο 3. = arctan | - tanaz }. 
b2 cos? ax + @sin?az — abc b 
dx 1 ctanax +b 
; : = n : 
J Pcostax—csin?ax —2abc  ctanax — b 


A07 


cos(a+b)a cos(a — b)a 


408. n ax cos bx da = 2(a 3) aad) (a? 4b’); fora=b (see No. 354). 
21.7.3.4 Integrals with Tangent Function 
1 
409. [tan ax dx = —- ln cos ax. 
a 
2 tanar 
410. [ tan απ dx = - 
; a 
3 1 2 1 
411. fton ax dx = — tan ax + — ln cos ax. 
; a 
n 1 n—1 n—2 
412. fion ax dx = ———~ tan” ax — f ov ax dx. 
a(n — 1) 
"m i i d ax? a3a? 20937 17a‘x? ος z 1) Βαζ lgt 
: rtanardx- | | | ενα] bees 
J ο = 3 +- i5 "105 5^ 2835 (2n +1)! 
B,, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 
414. / anax dx |— ΜΡ (az)? A 2(ax)? n lT(ax) xcd 22^ (2n — 1) B, (ασ) τὰ 
x 9 75 2205 (2n — 1)(2n!) 
an” ax 1 
415. dr = tan”t! ax | X -1). 
/ costar ^ a(n + 1) ''''' inge) 
d. a η" 1 
416. / PT = XT Ἐφ Ἐπ In(sin ax + cos ar). 
tan ax dx r1 
417. / ως In(sin az + cos ax). 
J tanatt1l 2 2a 


21.7.3.5 Integrals with Cotangent Function 


1 
418. i cot ax dx = — ln sin ax. 
: α 
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P cot ax 
419. [ cot? ax dx = ———— — T. 
; a 


: 1 3 1 
420. n ax dz = —— cot? ax — -lnsinaz. 
2a a 


1 , 
421. J cot" ax dx = “am =D cot"! ax — jot? axdx — (n#1). 


m / í j r aa aka? 2mp a lgt 
. | xcotardx = 2 
μας 9 225 Qn +1)! 
B,, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 
423 / cotardr 1 ax (ar? (ac) 2?n B, (ar)! 
E x ar 3 135 4725 (2n — 1)(2n)! 
t" 1 
424. {= 27 dx cot” ται (n £ —1). 
sin? ax ' a(n 4- 1) 
dx an ax dx 
425. / = -fz τοις (see No. 417). 
J 1+cotax tanar + 1 


21.7.4 Integrals of other Transcendental Functions 
21.7.4.1 Integrals with Hyperbolic Functions 


1 
426. E ax dx = — cosh ax. 
$ a 
1. 
427. / cosh ax dx = — sinh az. 
à α 
πια d. 1 
428. E ax dx = — sinh az coshax — =a. 
à 2a 2 
" κα. 1 
429. J cosh“ ax dz = — sinh az cosh ax + =a. 
é 2a 2 
430. J sinh” ax dx 


1. = n—1lf. = 
= — sinh"! ax cosh ax — ——— / sinh”? aa dx (for n > 0), 
an no. 


2 PIE 
= ——— sinh"*! ax cosh ax — ώμος [sim ax dx (for n < 0) (n # —1). 
a(n + 1) 1-1. 
431. [ cosh" ax dx 
Tou n-l n-l1 n-2 
= — sinh ax cosh™ ax + —— / cosh" ^ ax dx (for n > 0), 
an n. 


sinh ax cosh"*! aa + E ο. ax da (for n < 0) (n # —1). 
n+ 


~ a(n +1) 


432. / mt ος. nu 
a 2 


sinh ax 


dx 2 ^ 
433. / — e°? 
coshar a 
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^ 1 1 
434. / rsinhardr- —xcoshar-— a sinh ax. 
: α α 
1. 1 
435. l x cosh ax dx = —x sinhax — — cosh ax. 
à a? 
1 
436. [ann ax dx = — ln cosh az. 
i a 
1. 
437. / cothax dx = — ln sinh az. 
a 


tanh ax 


438. n ax dr = xt — 


a 
, oth ax 
439. [ xe ax de —z— LOAF, 
: a 
1 9 ς 
440. E ax sinh bx dx = — z (a sinh bx cosh ax — b cosh ba sinh ax) (a? b°). 
à a? — 
441. I cosh ax cosh bx dx = zp (a sinh ax cosh ba — b sinh bx cosh ax) (a? 4B’). 
i a? —b 
442. [cosh ax sinh bx dr = — Rp (a sinh bx sinh ax — b cosh bx cosh ax) (a? £ b?). 
a2 — 
, i 1 , : 
443. J sinh ax sin az dx = — (cosh ax sin ax — sinh ax cos ax). 
a 


pun ; 
444. / cosh ax cos ax dx = σα sinh ax cos ax + cosh ax sin ax). 
a 


; 1 i ; 
445. f sinh az cos ax dr = T (cosh ax cos ax + sinh ax sin ax). 
: d i ; 
446. | coshaxsinax dr = ad (sinh ax sin az — cosh ax cos ax). 
à a 


21.7.4.2 Integrals with Exponential Functions 


1 n EE 
450. [oe dr = —a"e"* — — [= leo? da. 
a 


NT vn ma mS 
451. J: duci ες, NO ὑπ 
"m 1-1 ^ 2.2! " 3.3l 
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T t 
e 
The definite integral / e dt is called the exponential function integral (see 8.2.5, 4., p. 514) and it is 
denoted by Ei(r). For x > 0 the integrand is divergent at t = 0; in this case we consider the principal 
value of the improper integral Ei(z) (see 8.2.5, 4., p. 514). 


^ οἱ "d 2 ; 
7 ut =C πα] +5 H3 t Heet Een 


—oo 


C denotes the Euler constant (see 8.2.5, 2., p. 513). 


452. [5 2 ud τα = dr) (n z 1). 
n-1\ gmt Jnd 


dx 1 p 
453. id So s 
lc-e* aq 1+e%* 


454. I; LL NE. In(b + οε΄“). 


tce b ab 


ge” 11 Ἦ il 
455. TES SS In(b + ce**). 
b4 " ac 


+ ce? 


dx 1 ax: O 
456. l= = an ( B (bc 0). 


1 1 c+ e** --ος 


= be < 0 
2avy —bc i c — e 4/ —bc ας ) 
xe dx en 
a57. f = . 
(11-αα)᾽ a(1+azr) 
. οαα 1 1. 1 pda 
458. Je Inzdz = ——* — = [= dz; (see No. 451). 
: a aJ mz 
459. T e sin br dz = 5 (asin ba — bcos bax). 
460. | ε΄" cos ba dx = aoe (a cos ba + bsin ba). 
à a? + 
ου. 
461. τον sin" zdr = ^ στ z z (asin x — n cos x) 
a@ +n 
n-1 " : 
es) e sin"? x dz ; (see No. 447 and 459). 
a^-Emn*. 
αα n—l, 
462. Je cos” z dz = Ê ra ‘(a cosa + nsin x) 
j a? +n? 
n(n—1) f a. n2 
AER E cos" x dz; (see No. 447 and 460). 
463. Ἰ ve^ sin bx dx = τ. m (a sin bx — b cos bx) Cae py [(a? — 03) sin bx — 2ab cos bz]. 
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peat paT 


a cos bx + bsin bz) ο i py [(a? — b’) cos ba + 2ab sin ba]. 
a 


464. J te cos ba dz = an ri 


21.7.4.3 Integrals with Logarithmic Functions 
465. [m zdr -—zlnz- rz. 


466. [m v) dx = z(In z)? — 25 1η: + 22. 


467. [x dx = x(Ina)* — 3a(In £)? + 6x ln x — 6x. 


468. [m x)” dx = x(Inz)" — n ο dx (n 4-1). 


(nz) (nz)? 
3a t go 


469. {= — InIn z 4- In z 4 
J Ina 


f dt 
The definite integral / iad is called the logarithm integral (see 8.2.5, p. 513) and it is denoted by Li(z). 
n 
0 


For x > 1 the integrand is divergent at t = 1. In this case we consider the principal value of the 
improper integral Li(x) (see 8.2.5, p. 513). 

The relation between the logarithm integral and the exponential function integral (see 8.2.5, p. 514) is: 
Li(r) = Ei(In x). 


z 

470. i = 1); (see No. 469). 

(nay (n — 1)(In z)n 2 n—1J (Iz)! frd α-- 

Ina 1 
471. [^ z"Inzdy = g” |22 - (m 1). 
j m+1 (m+1} 
gmt l n m 
472. E xz" (ln £)” dr = gaz) = [ nay dx (mz —1,mn 7 —1; (see No. 470). 
m-F1 m+1 
mu qnl 
πα. / (In x) dic: (In x) 
T n+1 

Ing Ing 1 

ani. [de = l 1). 
Jam (m—1)r"— (πι--1)λαη-! i 

Ina)? ina) 4 n— 'd 

475. J p i σπα ο πα (m#1); (see No. 474). 
qm (m—1)r"-! m-1 gm 

αν da εν ; 
476. = [-- dy with y = —(m+ 1) Ine; (see No. 451). 

Ina y 

x? da: gm m+1 x" dx 
am. f n 1). 

(In x)” (n — 1)(In z)n-! RU ric (In z)n-1 Dod 


dx 
478. {f= =InIng. 
vlna 


a L 4s 42 — Mfg 18 
479. / Oe Inlnz — (n — 1)In x 4 ΕΙΤΕ 
2.2! 3.3! 


απ Τη α 
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=] 
480. m. n £1). 
m x) ^ (n — D) In z)n-i aA 
-1 p-1 dx 
481. Im - nz 1). 
xP πα azP-l(n— l)(IlDnnz)-! π- 19 z*(Inz)n-i . 
. a 15 g2n-1p, gent 
482. / Insinzdr --α]πα-α m wp nn - 1) 


B, denote the Bernoulli numbers (see 7.2.4.2, p. 465). 


x? 15 Pu 92n-1(92n - 1)B, " 
483. p sa da ΠΣ ον, ΩΝ 
iios 60 315 nQn+i ^ 
po η 25 g2n 02η =f B, . 
484. ns =alnx-—at+ 2 t a bes ( ) gu, 
9 450 n(2n + 1)! 


485. [sin Inzdr = 5 (sin ln z — cosln x). 


486. Joos ln zdr = 5 (sinin 2 + cosIn x). 


487. Je lng dgr = l gar lng 1 E dz ; 
: a aj rz 


21.7.4.4 Integrals with Inverse Trigonometric Functions 


. T zr D 7 
488. / arcsin — dx = x arcsin = + Va? — 13, 
a α 


2 9 

Nu T α "m m & -- - 

489. [ vavesin dx = arcsin a? — 23, 
; a 2 4 a 


αἴ 
x ν 
490. [2 arcsin ~ τάν = 3 arcsin ~ aa je 2a?)v/a? — x?. 


arcsin 5 — dz ; 1 3 
α 


491 / T Y 1-3 oP 1-35 ac 
f x a 2-3-3a3 ' 2-4-5-5a5  2:4.6-Τ-Ταΐ 
arcsin T dx i E 
1 — $i 
492. / E arcsin = In a Ξ 
y? x a a T 
E g ; 
493. J arccos — dx = x arccos a? — 23. 
α 
£ a3? αἳ t £ 
494. E arccos ^ dx = ( 5 7 ) arccos - A a? — 23, 
495 15 arccos 7 σας = T arccos Ž La + 2a?) Va? — a? 
; z? $579 22, 
arc M » f F : 
496. / ar Coos " τ π ΠΝ m 1 z’ 1-3 2 1.3.5 a 
x a 2.3.3a? 2-4-5-5a5 3:4:6:Τ.Ταΐ 
arccos 7 dx z 
e 2 gr? 
497. / so = arccos Č + -In Ê Un οὑκ 
T x a a 1 


(see No. 451). 
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5 n(o + a”), 


ᾳ T 
498. / arctan — dx = x arctan 
a a 


x 1 T ax 
499. J arcte = dz = “) arctan — — — 
x arctan 7 da 3 Fa?) ar us 


xv x ar a , , 
500. fa arctan - τάν = — arctan | In(a? + 2”). 
3 à ee ) 
" n+l m pU lx 
501. J x” arctan ^ dx = — arctan — E ini (n 1). 
a n+1 a πη a? + a? 
P 
502. CM SLM OPNS NM iE (|2| < Jal). 
T a 3a? 524 Ta! 
arctan Š = 7 dz 3 a? +r? 
503. / arctan In xc 
2 α α £ 
.. arctan = - 7 dz 1 : x dis i 
= arctan - - n ; 
/ (n — 1)z"-i a n-—14 a? l(a? +2) ) 
r a : ; 
505. [ΝΕ dx = x arccot — 4 5 In(a? + 2”). 
a 
506. f: 2 τά een re ae 0 
1 arcco = arccot — + —. 
n c= 5 47 +a?) ax Aer 
φῦ απ. ar a 9 5 
507. Fe arccot - τἀν = > arccot — 4 In(a* + z*). 
3 a 6 6 
uas . qnl lx 
508. fe ' arccot - = de = arccot ^ 4 — [5 = (n 1). 
n+1 a n+l @+2? 
bos J arc cot τ - τάν - Es zo a "n x 
a 3a 5a Ta 
arccot = - -de 1 m 1. @42? 
510. f arccot — + In —. 
xv a 2a g2 
arccot 5 dz 1 
511. / o - arccot - Aj n 56 1). 
g^ (n — 1)z"-! à a mn-—14J rta? + 2?) (n ) 


21.7.4.5 Integrals with Inverse Hyperbolic Functions 


© z 
512. i Arsinh — dx = x Arsinh z? +a. 
a a 


513. J Arcosh T de = x Arcosh — x? — a?. 
: a a 


514. EST 7 dg = zx Artanh 5 + s In(a? — 2°). 
à a a 


515. J ^ot δ dy = z Arcoth~ + 2 ln(z? — a°). 
a a 2 


1098 21. Tables 


21.8 Definite Integrals 


21.8.1 Definite Integrals of Trigonometric Functions 


For natural numbers m,n: 


20 2π 2π 
1. [sin nrdr-—0. (21.1) 2. [cos nrdr-0. (21.2) 3. [sin nrcosmzrdr —0. (21.3) 
0 0 0 
2π 2π 
. | 0 form Z n, 0 form zn, 
4. [sin nx sin mz dx = (21.4) 5. [o na cosma dx = (21.5) 
à π form=n. à π [ου πι Ξ- n. 
z 24 -1 
2 a 2 5 Πιό for n odd, 
6. | sin" ede = STRE oat κ (n > 2). (21.6) 
0 2246 ^, Brneven 
Ῥ T(a-1I(841) 1 
Τα. i sin?**! y cos? H! gdr = TT : En Là 3B(a +1,8+4+1). (21.7a) 
r(x) 
B(z,y) = pot denotes the beta function or the Euler integral of the first kind, I' (x) denotes the 
T y 


gamma function or the Euler integral of the second kind (see 8.2.5, 6., p. 514). 


The formula (21.7a) is valid for arbitrary a and β; we use it, e.g., to determine the integrals 


1/2 71/2 


1 3f - 
J vsin £ dx, / Vsin a dz, / 
0 0 0 


For positive integer a, B: 


71/2 
da: 
3 "me 
v COS cr 


etc. 


1/2 
. " a8! 
Tb. / sinj^ ip cos?! ¢ dx — UP 0 (21.7b 
{ 2(α-Ε B 4- 1)! 
π 
pom z f ; 
A f sinat iy 2 ora > 0 (21.8 
) ες, fora «0. 
po rdr 
9. aS = (a arbitrary). (21.9 
0 3 
T 
BOT ΤΗ = for ; 
να . μοι (21.10 
x a fora < 0. 
P cos ax — cos bx b 
11. cos ax — cos ba dca. (21.11 
T a 


0 
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TR 5 for |a| < 1, 
sin x cos ax 
12. [ 5a = H for |a| = 1, (21.12 
: 0 for |a] > 1. 
T sinz COS X T 
13. / dx = dx = ; (21.13 
4 ντ ντ 2 
"Fa sir be 
14. J ee Ξ = Tg (the sign is the same as the sign of b) . (21.14 
$9 +2 2 
15. | 5 ds = Se (21.15 
16 T sin? ax d T | (21.16 
. 72 C= —|d|. . 
oo 7 ? 
Too +00 T 
17. J sin(z?) dx = J cos(a?) dz = "Ls (21.17 
sin z dx 1, 14+k 
18. — = In for |k] « 1. 21.18 
eee ee M ( 
π/2 ΠΕ 1 
19. J Moo — —arcsink for |k| <1. (21.19 
i Vl-KEsmz : 
π/2 29 
sin’ z dx 1 
20. ;— = πείς — E) for|k| <1. 21.20 
‘4 V1— k sin? z jl M ( 
Here, and in the following, E and K mean complete elliptic integrals (see 8.1.4.3, 2., p. 490): 
E=E (r, 5) , K=F (é. 5) (see also the table of elliptic integrals 21.9, p. 1103). 
cos? x dx 1 ͵ 
21. T = G[E- (1 -— k’)K]. 21.21 
Ü V1—k sin? x jl ( JK] ( ) 
22 LLL T forintegera > 0, BN ed (21.22) 
; = r integer a . ; 
" 1-—2bcost +b 1-7} iE 
21.8.2 Definite Integrals of Exponential Functions 
(partially combined with algebraic, trigonometric, and logarithmic functions) 
f i (n+l 
23. | z"e “dr = Pues fora > 0, n > —1, (21.23a) 
" anc 
| 
aw fora>0, n=0,1,2,.... (21.23b) 


anti 
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I (n) denotes the gamma function (see 8.2.5, 6., p. 514); see also the table of the gamma function 21.10, 
p. 1105). 


oo ] 2 Γ (5) 
24. [de dx = Lu fora-0,n»-1l, (21.24a 
0 2al 53) 
1-3- (2k = Tae 
= ISSN SSYD for n= 2k (k =1,2,...), a0, (21.24b 
k! 
Πτι for n 2 2k4-1 (kK=0,1,2,...), a>0. (21.24c 
25. fev dx = vn for a > 0. (21.25 
: 2a 
0 
26. Juge dx = vz for a > 0. (21.26 
4a? 
0 
27. | ει cosbr dx = v7 „eh? fora>0. (21.27 
0 . 
T ασ πλ 
28. mL 21.2 
ή gh] 6 nd 
0 
T xdr 2 
29. | = R (21.29 
Je™+1 12 
0 
e sin x 1 
30. l — — dx = arccot a = arctan for a > 0. (21.30 
0 * 
31. J e ^Inzdz = —C & —0, 5772 (21.31 
0 
C denotes the Euler constant (see 8.2.5, 2., p. 513). 
21.8.3 Definite Integrals of Logarithmic Functions 
(combined with algebraic and trigonometric functions) 
1 
32. n | na | dx = —C = —0,5772 (reduced to Nr. 21.31). (21.32) 
0 
C is the Euler constant (see 8.2.5, 2., p. 513). 
f Ine n? 
33. |- g dx = € (reduced to Nr. 21.28). (21.33) 
Qu 
f lng π᾽ 
94. iS zi da = πα (reduced to Nr. 21.29). (21.34) 
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f Ine 2 
35. [ eT (21.35) 
0 
In(1 
36. [ HO να id. (21.36) 
αἲ 1 δ 
0 
1 
(21.37) 


^ Jas 
37. Πο dr =I(a+1) for (--1 «ας oo). 
J NE 
0 


p. 1105). 
π/2 π/2 
38. J msimzdz E J incose de = -3 2. 
0 0 
fi 2in2 
39. fe ln sin z dz = m . ; 
0 


π/2 


40. / sin z lnsin z dx 112 — 1. 


y /q2 — 52 
41. [ra + bcosx) dx = mln a fora > b. 


0 
T 5 2π]πα for (a>b>0), 
42. [in — 2abcos x + b^) dx = 
4 2rlnb for (b>a> 0). 
π/2 
43. ΠΠ; = 0. 
0 


n/a 


44. / In(1 + tan x) dx = τα 2. 
0 


21.8.4 Definite Integrals of Algebraic Functions 


_ Πας ΟΓ(β-Ε 1) 
Γ(α 1- ϐ 4 2) 


1 1 
45. fea — zr)? απ = 2 [na 2?) dx 
0 0 


= B(a+1, 8--1), (reduced to Nr. 21.7a). 


DI'(x)1*(i 
Βία, υὺ) = ma denotes the beta function (see 21.8.1, p. 1098) or the 1 


(2 


(2 


(2 


(2 


(21.45) 


Euler integral of the first 


kind, ΙΓ (1) denotes the gamma function (see 8.2.5, 6., p. 514) or the Euler in 


46. Iz a for a < 1. 
0 


(l-cz)r"^ | sinam 


egral of the second kind. 


(21.46) 


38 


.39 


40 


Al 


42 


49 


44 
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ix 

47. J 07 —mcotam fora< 1. (21.47 
J (1— x)z* 
es get π . 

48. | d= m for0<a<b, (21.48 
0 b 

49 jd UE 5 (21.49 
/ MEET ar (= *) 

a 


T(x) denotes the gamma function (see 8.2.5, 6., p. 514; see also the table of the gamma function 21.10, 


p. 1105). 


1 
50. / 
J 14 
0 


dx a T 
= O0<a<—). 21.50 
F2rcosa--z? 2sina ( i 2 ) (21:90) 
dx a 
z= 0<a< 5) > 21.51 
F2xcosa--z? sing ( i 2 PL 


51. / 
J 14 
0 
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21.9 Elliptic Integrals 
21.9.1 Elliptic Integral of the First Kind F (y,k), k = sina 


 /? a? 


0 10 20 30 40 50 60 70 80 90 


0 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 
10 | 0.1745 | 0.1746 | 0.1746 | 0.1748 | 0.1749 | 0.1751 | 0.1752 | 0.1753 | 0.1754 | 0.1754 
20 | 0.3491 | 0.3493 | 0.3499 | 0.3508 | 0.3520 | 0.3533 | 0.3545 | 0.3555 | 0.3561 | 0.3564 
30 | 0.5236 | 0.5243 | 0.5263 | 0.5294 | 0.5334 | 0.5379 | 0.5422 | 0.5459 | 0.5484 | 0.5493 
40 | 0.6981 | 0.6997 | 0.7043 | 0.7116 | 0.7213 | 0.7323 | 0.7436 | 0.7535 | 0.7604 | 0.7629 
50 | 0.8727 | 0.8756 | 0.8842 | 0.8982 | 0.9173 | 0.9401 | 0.9647 | 0.9876 | 1.0044 | 1.0107 
60 | 1.0472 | 1.0519 | 1.0660 | 1.0896 | 1.1226 | 1.1643 | 1.2126 | 1.2619 | 1.3014 | 1.3170 
70 | 1.2217 | 1.2286 | 1.2495 | 1.2853 | 1.3372 | 1.4068 | 1.4944 | 1.5959 | 1.6918 | 1.7354 
80 | 1.3963 | 1.4056 | 1.4344 | 1.4846 | 1.5597 | 1.6660 | 1.8125 | 2.0119 | 2.2653 | 2.4362 
90 | 1.5708 | 1.5828 | 1.6200 | 1.6858 | 1.7868 | 1.9356 | 2.1565 | 2.5046 | 3.1534 oo 


21.9.2 Elliptic Integral ofthe Second Kind E (o, k), k = sina 


e /? er 


0 10 20 30 40 50 60 70 80 90 


0 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 
10 | 0.1745 | 0.1745 | 0.1744 | 0.1743 | 0.1742 | 0.1740 | 0.1739 | 0.1738 | 0.1737 | 0.1736 
20 | 0.3491 | 0.3489 | 0.3483 | 0.3473 | 0.3462 | 0.3450 | 0.3438 | 0.3429 | 0.3422 | 0.3420 
30 | 0.5236 | 0.5229 | 0.5209 | 0.5179 | 0.5141 | 0.5100 | 0.5061 | 0.5029 | 0.5007 | 0.5000 
40 | 0.6981 | 0.6966 | 0.6921 | 0.6851 | 0.6763 | 0.6667 | 0.6575 | 0.6497 | 0.6446 | 0.6428 
50 | 0.8727 | 0.8698 | 0.8614 | 0.8483 | 0.8317 | 0.8134 | 0.7954 | 0.7801 | 0.7697 | 0.7660 
60 | 1.0472 | 1.0426 | 1.0290 | 1.0076 | 0.9801 | 0.9493 | 0.9184 | 0.8914 | 0.8728 | 0.8660 
70 | 1.2217 | 1.2149 | 1.1949 | 1.1632 | 1.1221 | 1.0750 | 1.0266 | 0.9830 | 0.9514 | 0.9397 
80 | 1.3963 | 1.3870 | 1.3597 | 1.3161 | 1.2590 | 1.1926 | 1.1225 | 1.0565 | 1.0054 | 0.9848 
90 | 1.5708 | 1.5589 | 1.5238 | 1.4675 | 1.3931 | 1.3055 | 1.2111 | 1.1184 | 1.0401 | 1.0000 
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21.9.3 Complete Elliptic Integral, k = sina 


a /? K E a /? K E a /? K E 
0 5708 | 1.5708 || 30 | 1.6858 | 1.4675 | 60 | 2.1565 | 1.2111 
1 5709 | 1.5707 || 31 | 1.6941 | 1.4608 61 | 2.1842 | 1.2015 
2 5713 | 1.5703 || 32 | 1.7028 | 1.4539 62 | 2.2132 1920 
3 5719 | 1.5697 | 33 | 1.7119 | 1.4469 | 63 | 2.2435 1826 
4 5727 | 1.5689 | 34 | 1.7214 | 1.4397 | 64 | 2.2754 1732 
5 .5738 | 1.5678 | 35 19132 | 1.4323 | 65 | 2.3088 1638 
6 S751 | 1.5665 | 36 | 1.7415 | 1.4248 | 66 | 2.3439 1545 
7 -5767 | 1.5649 | 37 | 1.7522 | 1.4171 | 67 | 2.3809 1453 
8 9785 | 1.5632 | 38 | 1.7633 | 1.4092 | 68 | 2.4198 1362 
9 .9805 | 1.5611 | 39 | 1.7748 | 1.4013 | 69 | 2.4610 1272 
10 5828 | 1.5589 | 40 | 1.7868 | 1.3931 | 70 | 2.5046 | 1.1184 
1 5854 | 1.5564 || 41 1992 | 1.3849 | 71 | 2.5507 | 1.1096 
12 0882] 1.5537 | 42 | 1.8122 | 1.3765 | 72 | 2.5998 | 1.101 
13 .5913 | 1.5507 | 43 | 1.8256 | 1.3680 | 73 | 2.6521 927 
14 5946 | 1.5476 | 44 | 1.8396 | 1.3594 | 74 | 2.7081 844 
15 5981 | 1.5442 | 45 | 1.8541 | 1.3506 | 75 | 2.7681 764 
16 0020 | 1.5405 | 46 | 1.8691 | 1.3418 | 76 | 2.8327 | 1.0686 

17 «6061 | 1.5367 | 47 | 1.8848 | 1.3329 | 77 | 2.9026 | 1.061 
18 6105 | 1.5326 | 48 | 1.9011 | 1.3238 | 78 | 2.9786 | 1.0538 
19 6151 | 1.5283 | 49 | 1.9180 | 1.3147 | 79 | 3.0617 468 
20 .6200 | 1.5238 | 50 | 1.9356 | 1.3055 | 80 | 3.1534 40 
21 .6252 | 1.5191 | 51 | 1.9539 | 1.2963 | 81 | 3.2553 | 1.0338 
22 .6307 | 1.5141 | 52 | 1.9729 | 1.2870 | 82 | 3.3699 278 
23 .6365 | 1.5090 | 53 | 1.9927 | 1.2776 | 83 | 3.5004 223 
24 «6426 | 1.5037 | 54 | 2.0133 | 1.2681 | 84 | 3.6519 72 
25 0490 | 1.4981 || 55 | 2.0347 | 1.2587 δ | 3.8317 27 
26 6557 | 1.4924 | 56 | 2.0571 | 1.2492 | 86 | 4.0528 | 1.0080 
27 6627 | 1.4864 | 57 | 2.0804 | 1.2397 | 87 | 4.3387 | 1.0053 
28 6701 | 1.4803 | 58 | 2.1047 | 1.2301 | 88 | 4.7427 026 
29 6777 | 1.4740 | 59 | 2.1300 | 1.2206 | 89 | 5.4349 | 1.0008 
90 oo «0000 


21.10 Gamma Function 1105 


21.10 Gamma Function 


x I(x) x I(x) x T(x) cz I(x) 


1.00 | 1.00000 | 1.25 | 0.90640 | 1.50 | 0.88623 | 1.75 | 0.91906 
01 | 0.99433 26 | 0.9044 51 | 0.88659 76 | 0.92137 
02 | 0.98884 27 | 0.9025 52 | 0.88704 77 | 0.92376 
03 | 0.98355 28 | 0.90072 53 | 0.88757 78 | 0.92623 
04 | 0.97844 29 | 0.89904 54 | 0.88818 79 | 0.92877 


1.05 | 0.97350 | 1.30 | 0.89747 | 1.55 | 0.88887 | 1.80 | 0.93138 
06 | 0.96874 31 | 0.8960 56 | 0.88964 81 | 0.93408 
07 | 0.96415 32 | 0.89464 57 | 0.89049 82 | 0.93685 
08 | 0.95973 33 | 0.89338 58 | 0.89142 83 | 0.93969 
09 | 0.95546 34 | 0.89222 59 | 0.89243 84 | 0.94261 


1.10 | 0.95135 | 1.35 | 0.89115 | 1.60 | 0.89352 | 1.85 | 0.94561 
1 | 0.94740 36 | 0.89018 61 | 0.89468 86 | 0.94869 
2 | 0.94359 37 | 0.88931 62 | 0.89592 87 | 0.95184 
3 | 0.93993 38 | 0.88854 63 | 0.89724 88 | 0.95507 
4 | 0.93642 39 | 0.88785 64 | 0.89864 89 | 0.95838 


5 | 0.93304 | 1.40 | 0.88726 || 1.65 | 0.90012 | 1.90 | 0.96177 
6 | 0.92980 41 | 0.88676 66 | 0.90167 91 | 0.96523 
7 | 0.92670 42 | 0.88636 67 | 0.90330 92 | 0.96877 
8 | 0.92373 43 | 0.88604 68 | 0.90500 93 | 0.97240 
9 | 0.92089 44 | 0.88581 69 | 0.90678 94 | 0.97610 


1.20 | 0.91817 | 1.45 | 0.88566 | 1.70 | 0.90864 | 1.95 | 0.97988 
21 | 0.91558 46 | 0.88560 71 | 0.91057 96 | 0.98374 
22 | 0.91311 47 | 0.88563 72 | 0.91258 97 | 0.98768 
23 | 0.91075 48 | 0.88575 13 | 0.91467 98 | 0.99171 
24 | 0.90852 49 | 0.88592 74 | 0.91683 99 | 0.99581 


1.25 | 0.90640 | 1.50 | 0.88623 | 1.75 | 0.91906 | 2.00 | 1.00000 


The values of the gamma function for x < 1 (x # 0, —1, —2,...) and x > 2 can be calculated by the 
following formula: 


— I(r4l) 


I(x) , (a) ξ(α--1) Γ(α -- 1). 


1.) 0.90864 
E A: L(0.7) (n = 


WB:I(35)—2.5-I(2.5) 22.5- 1.5- (L5) = 2.5- 1.5 - 0.88623 = 3.32336. 


1.2981. 
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21.11 BesselFunctions (Cylindrical Functions) 


2 Jo(x) Ji (x) Yo(x) Yı (x) p(x) L(x) Ko(x) Κι(α) 
+1.0000 | +0.0000 —oo --οο +1.000 «0000 oo οο 
0.9975 .0499 | —1.5342 | —6.4590 1.003 | +0.0501 | 2.427 9.8538 
.9900 .0995 1.0181 3.3238 1.010 .1005 | 1.7527 4.7760 
9776 1483 0.8073 2.2931 1.023 .1517 | 1.3725 3.0560 
0.9604 .1960 0.6060 1.7809 1.040 .2040 | 1.1145 2.1844 
+0.9385 2428 0.4445 1.4715 1.063 .2579 | 0.9244 1.6564 
9120 .2867 0.3085 1.2604 1.092 .3137 | 0.7775 1.3028 
.8812 .3290 0.1907 1.1032 1.126 .3719 | 0.6605 1.0503 
0.8463 .3688 | —0.0868 .9781 1.167 .4329 | 0.5653 .8618 
0.8075 .4059 | 4-0.0056 .8731 1.213 .4971 | 0.4867 7165 
+0.7652 4401 | +0.0883 0.7812 1.266 5652 | 0.421 .6019 
.7196 A709 0.1622 0.6981 1.326 .6375 | 0.3656 5098 


4983 | 0.2281 6211 1.394 -T147 | 0.3185 «4946 
.5220 | 0.2865 .5485 1.469 -7973 | 0.2782 3725 
.5419 | 0.3379 0.4791 1.553 3861 | 0. 


1 

1 

9 

δ .5579 | +0.3824 .4123 1.647 9817 

4 .5699 | 0.4204 3476 1.750 1.085 

0 5778 | 0.4520 .2847 | 1.864 1.196 
.34 i .5815 | 0.4774 .2237 | 1.990 
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F(x) | Yæ) | Yi) | (x) | Læ) | Κο(α) | Ks (v) 


.00 
0.3276 .3085 | +0.1479 | 27.24 | 24.34 | 3691 4045 
0.3371 .3216 | 0.1137 | 29.79 | 206.68 | 3308 3619 
0.3432 .3313 | 0.0792 | 32.58 | 29.25 | 2966 3239 
0.3460 3374 | 0.0445 | 35.65 | 32.08 | 2659 2900 
0.3453 .3402 | +0.0101 | 39.01 | 35.18 | 2385 2597 


0.3414 «9995 | —0.0238 | 42.69 | 38.59 | 2139 2326 


0.3343 .3354 | 0.0568 | 46.74 | 42.33 | 1918 2083 
0.3241 3282 | 0.0887) 51.17 | 46.44 | 1721 1866 
0.3110 3177) 0.1192 | 56.04} 50.95 | 1544 1673 
0.2951 3044) 0.1481) 61.38 | 55.90 | 1386 1499 


0.2767 .2882 | —0.1750 | 67.23 | 61.34 | 1244 1344 


0.2559 .2694 | 0.1998 | 73.66 | 67.32 | 1117 1205 
0.2329 .2483 | 0.2223 | 80.72 73.89 | 1003 1081 
0.2081 .2251 | 0.2422 | $88.46 | 81.10 | 09001 | 09691 
0.1816 .1999 | 0.2596 | 96.96 | 89.03 | 08083 8693 


0.1538 1732 | —0.274 


0.1250 1452 | 0.285 
0.0953 1162] 0.294: 
0.0652 0.0864 | 0.30 


0.0047 «0259 | —0.302 


+0.0252 | +0.0042 | 0.299: 
0.0543 0.0339 | 0.293 
0.0826 0.0628 | 0.284 


1 

7 

2 
0.0349 0.0563 | 0.3029 | 153.7 | 142.1 | 04728 5059 

7 

5 

6 : : S 
0.1096 .0907 | 0.2731 | 244.3 | 227.2 | 02772 | 02953 


F0.1352 173 0.2591 | 268.2 | 249.6 | 02492 | 02653 
0.1592 ).1424 0.2428 38: 
0.1813 .1658 0.2243 ; : | 
0.2014 1872 0.2039 | 354.7 | 331.1 | 01811 | 01924 
0.2192 0.2065 0.1817 ) 


F0.2346 0.2235 | —0.1581 | 427.6 | 399.9 | 01465 1554 


0.2476 .2381 | 0.1331 | 469.5 | 439.5 | 01317 | 01396 
0.2580 .2501 | 0.1072 | 515.6 | 483.0 | 01185 | 01255 
0.2657 0.2595 | 0.0806 | 566.3 | 531.0 | 01066 1128 
0.2708 0.2662 | 0.0535 | 621.9 | 583.7 | 009588 | 01014 


+0.2731 .2702 | —0.0262 | 683.2 | 641.6 | 008626 | 009120 


0.2728 .2715 | +0.0011 | 750.5 | 705.4 | 007761 | 008200 
0.2697 0.2700 | 0.0280 | 824.4 | 775.5 | 006983 7374 
0.2641 0.2659 | 0.0544 | 905.8 | 852.7 | 006283 6631 
0.2559 .2592 | 0.0799 | 995.2 | 937.5 | 005654 


F0.2453 .2499 | +0.1043 | 1094 1031 005088 


0.2174 2240 0.1491 | 1321 1247 004121 
0.2004 .2086 0.1691 | 1451 1371 003710 
0.1816 0.1907 | 0.1871 | 1595 1508 003339 3512 


0 

0 
0.2324 0.2383 | 0.1275 | 1202 1134 004579 MED 

0 


5 doo; GOOOO 10000000 0600000000 ANNAN NNNNN OO O00 oo0oo09 ANNANIN σισισισισι 
5 CONRAN BRWHHS CONN BWHHO COND RWHHLOS CONN RWHHO CONN Polio 


mn 


F0.1613 0.1712 | +0.2032 | 1753 1658 003036 3160 
0.1395 1502] 0.2171 | 1927 1824 002706 | 002843 
0.1166 1279 | 0.2287 | 2119 2006 002436 | 0 
0.0928 0.1045 | 0.2379 | 2329 2207 002193 2302 
0.0684 0804 | 0.2447 | 2561 2428 001975 2072 


F0.0435 .0557 | +0.2490 | 2816 2671 001778 | 001865 


1108 21. Tables 


21.12 Legendre Polynomials of the First Kind 


P(x) = 1; P(x) =a; 
ΠΠ τὴ 
Py) = σ(8α” =); P3(x) = 307 - 32); 
1 T. os 
Ῥι(α) = g Sie" — 30x? + 3); P3(x) = 3 (632” — 102? + 15x); 


1 ; 1 Ν : : 
P(x) = το 231 315z* 10545 —5) | Pr(x) = 1g 42977 — 693a? + 315a? — 35x). 


x = Pi(x)| P(x) P(x) P,(x) P(x) P(x) P(x) 
.00 —0.3000 .00 0.3750 0.0000 | —0.3125 .0000 
06 —0.4962 | —0.0747 0.3657 0.0927 | —0.2962 | —0.1069 
4 —0.4850 | —0.1475 0.3379 0.1788 | —0.2488 | —0.1995 
15 —0.4662 | —0.2166 0.2928 0.2523 | —0.1746 | —0.2649 
2 —0.4400 | —0.28 0.2320 0.3075 | —0.0806 | —0.2935 
26 —0.4062 | —0.3359 0.1577 0.3397 | +0.0243 | —0.2799 
30 —0.3650 | —0.3825 | +0.0729 0.3454 1292 | —0.2241 
.35 —0.3162 | —0.4178 | —0.0187 0.3225 .2225 | —0.1318 
E! —0.2600 | —0.44l —0.1130 0.2706 «2926 | —0.0146 
45 —0.1962 | —0.4472 | —0.2050 0.1917 .3290 | 40.1106 
.50 —0.1250 | —0.4375 | —0.2891 | +0.0898 3232 2231 
.55 —0.0462 | —0.4091 | —0.3590 | —0.0282 .2708 -3007 
.6 +0.0400 | —0.36 —0.4080 | —0.1526 ΟΙ 0.3226 
65 0.1338 | —0.2884 | —0.4284 | —0.2705 | +0.0347 0.2737 
xí 0.2350 | —0.1925 | —0.4121 | —0.3652 | —0.1253 | +0.1502 
τὸ 0.3438 | —0.0703 | —0.3501 | —0.4164| —0.2808 | -0.0342 
80 0.4600 | +0.08 —0.2330 | —0.3995 | —0.3918 | —0.2397 
85 0.5838 .2603 | —0.0506 | —0.2857 | —0.4030 | —0.3913 
9 0.7150 A725 | +0.2079 | —0.0411 | —0.2412 | —0.3678 
95 0.8538 7184 0.5541 | 40.3727 | 40.1875 | +0.0112 
1.00 1.0000 1.00 1.0000 1.0000 1.0000 1.0000 
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21.13 Laplace Transformation 
(see 15.2.1.1, p. 770) 


C is the Euler constant: 


oo 


F(p) = [ero dt, f(t) =O fort <0. 


0 


C = 0.577216 (see 8.2.5, 2., p. 513). 


No. F(p) f(t) 

1 0 0 
2 = 1 

p 
3 1 pd 

p^ (n — 1)! 

1 pl 

4 LU gat 

(p — a)" (n — 1)! 
: 1 eft — e% 
5 αι ο 

(p — a)(p — 8) β--α 
6 p Be — ae 

(p — o)(p — 8) B-o 

1 ο-αί . : " 

7 P+ Dap +B pra —3 sin \/ 62 — a? t 
8 D i sinat 

pP +a? 

o cos 3 + psin B 
9 a cos : + peni sin(at + 8) 

p ta 
p 33 2 « E a: i ατα 

0 ΕΤΕ] (s VB — o?t Poe sin 8? — a? ) ect 
1 72 p EO cos at 

pcos B — asin B 
2 ee cos(at + ϱ) 
3 s sinh at 
4 z £ P cosh at 
: 1 (B — ye + (y — a)e® + (a — B)e™ 
5 - 

(p — a)(p — B)(p — η) (a — 8)(8 — Αα — α) 
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No. F(p) f(t) 
16 1 e% — [1 + (a — 8)t] οὔ 
p—a)(p— BP (a- Bf 
i p a e?! — [o + βία — B)t]e?' 
p — a)(p — By (a — By? 
18 p ae% — [ρα — β + βία — βγῆ 8e?! 
p — o)(p — 8} (a — By? 
19 1 a sin Bt — B sin at 
p? + o2)(p? + 8?) aB(a? — 8?) 
20 p cos Bt — cos at 
p? + a?) (p? + 8?) (a? — B?) 
2 2 2 . 
21 4 = 7 5 cos? at 
p(p? + 4o? 
2a? 222 
22 np? + dod) sin’ at 
m c 2 2 . 
23 4 m 7 5 cosh? at 
p(p? — 4o? 
2a? , 
24 5 E Ja?) sinh? at 
p(p? — 4o? 
202) 
25 oP sin at - sinh at 
p! + Δα! 
2 2 2 
26 olp L = ) sin at - cosh at 
p* 4c 4a 
2..9 2 
27 alp Ἴ ) cos αἱ: sinh at 
pt + 4a‘ 
E 
28 £ cos at + cosh at 
pt +4at 
ap x 
29 TETE 5 sinat 
ap t. 
30 TETO 5 sinh at 
31 aß B sinh at — a sinh Bt 
(pi af)? A) TE 


21.13 Laplace Transformation 


1111 


No F(p) f(t) 
32 p cosh at — cosh Bt 
(p? — a?)(p? — 8?) a? — f? 
1 1 
33 | — 
p πί 
1 t 
34 | — 24|— 
PVP π 
1 nl 45 21 , "MS 
35 ΓΝ πηγη 2 (n 0, integer) 
1 1 
36 p 
ρα Ti 
1 
37 Ίνρτα-νρεβ tog αἱ 
f iii Qt at ( ) 
38 7 - sin at 
opm MED 
P P ty 2nt 
/2 
39 — sin at 
Ti 
/2 
40 — cos at 
Tt 
2 
41 — sinh at 
πί 
vp? — a 2 
42 ΞΡ — cosh at 
p -— a? vt 
1 2 vii 2 
43 | ——— . / e dr 
pyp+ta Jan " 
(8—a)t 
1 2e 9 2 
44 [TUIS rg —————: / e" dr 
(p+a)vp + i \/7(B — a) 3 
vat 
ο τη —at , 
45 pepe CADR ΕΙ e] edr 
p Tt Toa 
0 
46 : Jo(ot) (Bessel function of order 0, p. 562) 
4 ————á Jo(a essel function of order 0, p. 
Vp x ai o »” 
47 = ; lo(ot) (modified Bessel function of order 0, p. 562) 
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No. F(p) f(t) 
J a+8 a — B 
48 | —— et. Ty (555) 
(p + o)(p + 8) 2 
1 
49 | —————— e% . Jy (a? — B?t 
VE + 2ap + f? i ( S ) 
el/P sinh 2 /t 
50 ———— 
p/P vm 
z a sin at 
51 | arctan — 
] t 
2a 2 
52 | arctan — ua 2 — sin at - cos Bt 
p — a? + 83 t 
We em E N B sat 
53 | arctan = sin Bt 
aj 
Inp 
54 | — —C -lnt 
p 
[ n 1 1 
55 a —[v(n) -Int], v(m) =14+=4+---+—--C 
prt n! 2 n 
1 2 : 2 
56 (np) (Int-- C? - — 
p 6 
E p-a 1 Bt _ aat 
57 |In D-5 " (e —e ) 
ν 2 
58 | In T = 2artanh — sinh at 
p—c p t 
"NES po? _ cos Bt — cos at 
pe - 03 t 
2 2 f 
p-a cosh fit — cosh at 
60 |I Ὀ λατ ἲς ees 
n Poe 7 
, ga-o?/At 
a e 
61 ον, Rea>0 — — 
2/m tyt 
„—a?/4t 
62 | Lev, Rea>0 : 
VP Tt 
63 rre Rev > —1 | a’J,(at) (see Bessel functi 562) 
-------------, Rev > -1 | a’ ναί) (see Bessel function, p. 562 
Pre 
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No. F(p) f(t) 
(p- vz-«y 
64 πι Rev > —1 o" ναί) (see Bessel function, p. 563) 
=f 0 fort« B 
65 e^? (850, reell) 1 fori β 
. e PV PP +e? 0 for t < £ 
66 Jv 3 od Jo (αντ = BP) fort > f 
" e BV 5’ -α” 0 fort « B 
67 rm Ip (av? — PP) fort > B 
eV OFAN PB) ος 
68 | ———————— -(e«8 p, (9 — 8 fm g 
(p -- o)(p + B) e 2 Ío 5 t? — 62) fort > B 
eb PP +e? 1 ) fort < B 
ν΄ 
63 pP +a? (5 Vp + z) —— J, [ovt — s) fort > 8 
a 
eB p02 1 ij fort < 8 
70 (5 : ; ) Vt? — p? ; : 
p? — a? VP — a2 —, A oa t? — ϱ)) fort» 
τας fort < B 
—e^ +a? Ba 
us i e JB ay/t? — s) fort > 8 
0) fort « B 
79 —B/ p? —o? —Bp Ba z z 
if M : yea" a/t? — 8) fort > f 
73 l-e 0 fort >a 
᾿ D l for0<t<a 
ο-αρ Bp 0 for0<t<a 
74 lfora<t< fp 
p 0 fort > B 
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21.14 Fourier Transformation 
The symbols in the table are defined in the following way: 
C: Euler constant (C — 0.577215...) 


I(z) -= Í etd, Rez>0 (Gamma function, see 8.2.5, 6., p. 514), 
Jo 
ἠνία) = » oa (Bessel functions, see 9.1.2.6, 2., p. 562) 
΄ e nl Γ(ν -ε-τ- 1) oe LAN 3 ᾿ 
K,(z) = 57 (sin(x ν)) (2) —L(z) with (z) = e7 Σπ J,(ze2'") 
(modified Bessel functions, see 9.1.2.6, 3., p. 563), 
1 z cost 
lr = { cost y 
1" s Yt (Fresnel integrals, see 14.4.3.2, 5., p. 757), 
δία) = — | —-dt 
(2) vV2mJo Vt 
μα sint 
Sir) = J d 
Jo d sint (Integral sine, see 14.4.3.2, 2., p. 756), 
sir) = / — dt = Si(x) -- 
Je t 2 
. 9ο cost g 
Ciz) = - / i dt (Integral cosine, see 14.4.3.2, 2., p. 756). 


T 
The abbreviations for functions occurring in the table correspond to those introduced in the corre- 
sponding chapters. 


21.14.1 Fourier Cosine Transformation 


No. | f(t) Έ.(ω) = T f(t) cos(tw) dt 
0 
1 1, 0<t<a | sin(aw) 
i 0, t>a ω 
t, 0<t<l1 D 
2. 2-t, l«t«2 4 (coswsin? 2) w? 
0, t>2 2 
0, 0ct«a 
3. 1 —Ci(aw) 
lc ta 
"E 
4 1 m1 
: 0<t< 2C (aw) 
E ον α T 2 aw 
t mp = 
a 2 Ju 
0, t>a 
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No. | f(t) Ε.(ω) =f f(t) cos(tw) dt 
0 
n yi 0<t<a dent t 
` 7 t>a 2 Jw 
T: (a+t) a>0 ||[-si (ac) sin (aw) — Ci (aw) cos (aw)] 
8. (α-- t) a0 [eos (aw) Ci (aw) + sin (aw) (5 οἱ (ω))) 
9. 24 42)-1 π ew 
(ere) 2 a 
2 gni T sin (aw) 
udi ) 2 ω 
b b bo 
11. Pt + Prete π e™ cos (aw) 
at+t a—t p 
12. P+ (tt? + P+ 7 €^ sin(aw) 
3. | (αὐ {ΘΓ} Ko (aw) 
P gy, 0<t< 
14. la »*. b i Jo (aw) 
9 t>a | 2 
15. |, a, 0<Rev<1 | sin Cz) Γ{-ν)ω”-! 
SST a 
6. |é PET 
_ | et erat 1 adig 
TT. 7 ^ In Paw 
18. | View ER (a? +w?) * cos € arctan (2) 
1\4 
9. e ie a+ (+w?) |? 
2 a? + w? 
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No. | f(t) Fw) = J f(t) cos(tw) dt 
o 
—at 414,2 2|—(n4-1) n+1\ (om 
20. de? πι! α"ΤΊ(α΄ +w?) D (-1)" (2) 
0<2m<ntl 2m a 
: me w 
21. (E tet T (v) (a? + w*)~? cos (v arctan (=)) 
a 
1,1 1 1 1 i 
ad. mcs —llnnü- εστω 
τς a zt ý ) 
j 1 w? 
23. | ea? VT các la 
2 
24 peg T LA 2w (cos /2au — sin v/2au) 
; z 2 Jae 
3 
25. |t 3e t (zem cos y 2aw 
a 
96 Int, 0<t<l _ Siw) 
: 0, t>1 ω 
Int T T 
aa -JZ (c+ zm as) 
Vi ΕΝ T 2 + In 4w 
t T 3 ; : 
28. | (t — a°)! In (9) DS (sin (aw) Ci (aw) — cos (aw) si (aw)) 
a 2a 
29. | (t? — a2)! In (bt) 5 — {sin (aw) [Ci (aw) — In (ab)] — cos (aw) si(aw) } 
2a 
. 1 ΤΙ... fw? ο (wry? 
30. P In (14 t) 3 (o (2)) + (s (2) | 
1 
; = {5 [cos (bw) — cos (aw) 
ω 
31. | In : + -| + cos (bw) Si (bw) + cos (aw) Si (aw) 
— sin (aw) Ci (aw) — sin (bw) Ci (bw) } 
Er 1 N^ 2,24 p w 
32. |e" Int Paw [ec t3 In (a^ + w^) +w arctan (5) 


21.14 Fourier Transformation 


1117 


No. | F(t) Fw) = T f(t) cos(t w) dt 
0 
k a? hA È T —bw -αω 
33. | In (Sc) - (et — ecc) 
e+e? T" 
34. | In Pp m (cos (bw) — ε-“”) 
1 iN? 
35. | = ¢ is ) — 2m si (aw) 
t C= 
In (a? + t? ΄ 2w . 
36. : - = = IG +1n (=)) Ko (ω)] 
2 _ p-aw 
37. |n (14% aire 
g zj 
2 =F 2 
35. dn. i 1 — cos (aw) 
t2 ω 
π 
. 2^ w<a 
39. sin (at) T "n" 
t 1) 
0, w>a 
T — a 5 
t sin (at) δέ © cosh (bw) , w<a 
40. -=a - 
Poe ^ e7™™ sinh (ab) , w>a 
i Τρ (1 — e7% cosh (bw)) w<a 
ar. || Se 2 di , 
t (t2 +b?) d e= sinh (abi, ism 
1 Ru") α-ω 
42. e^" gi t a 9 | 
6 sin (a ) 2 li ΜΞ (α +w)? b2 + (a = w)? 
e™t sin t 2 
43. dioe = arctan ( >) 
t w2 
US 2 
4, 8 (at) bu. n 
i ως 
ts, | ο. mo 


t 


NI + 
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No. | f(t) Fw) = f f(t) cos(tw) dt 
o 
in? 5 (a- 57) ω « 2a 
4g, | 52 (at) 2 2 
F 0, w > 2a 
1 ; 
EF að 8 {(w + 3a) In (w + 3a) 
AT. B +(w — 3a) In |w — 3a| — (w + a) In (w + a) 
—(w — a) In |ω- al} 
a (Ba? — u?) 0<w<a 
in? (at T w^ w=a 
48. "n ) 1 
το 9» a « uw «3a 
0, ω > 3a 
49. 1 — cos (at) lal- g 
t 2 
π 
1 — cos (at) πία-ω), w<a 
50. E 2 
0. wa 
m e~% cosh (bw) 
51, | 008 (at) 9 b ? ad 
3 [ΕΝ m e~” cosh (ab) " 
> 5 w>a 
b 1 1 
52. | e^" cos (at 5 H 
᾽ bc 2 li (a-w)? b+(a+w)? 
1 a? +w? 
53. | e™™ cos (at) Ξ [re 4b cosh (=) 
2Vb 
54 f tan (at) osh (bw) (1 + e?%) =! 
δά. | ggg ne π cosh (bw eran) 
ΞΕ t 7 t( t) " 3i (b. ) (εαν. 1) 1 
55. | aa cot (a π cosh (bw) (e 
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No. | f(t) Fw) = f f(t) cos(tw) dt 
D 
"o 2 
56. | sin (at?) Va c (=) sin ( )) 
l mw row? 
5 5] ο 72 a4} 88 2s 
57. | sin [α(1 — 17)] 5 "H cos (« *t4* <) 
5 sin (at?) π w? w? UT. a? 
58. B z7” S Ja σ Ta + να sin Ἴ + da 
sin (at?) πι ANE a 
‘s e e G0] -Ρ(ώ] 
E (a 4- gy e-i aw? (a? + b?)-1 
60. | e-"* sin (bt?) g [nl " b bu? 
sin |5 arctan |z Tae +) 
1 [m- w? w? 
: c (nd 1 m m 
61. | cos (at?) 2 V 2g feos (5) + sin (=) 
62. | cos [a(1 — 09) τω (a+ 242 
: s [la(1 -- ¢? =4/— sin [a+ —+— 
2 4 4a 
ve (a? + gi e-i aw? (a? +°) 
63. | e-"* cos (bt?) bu? M b 
* COS FIT η τη arctan a 
y a T 
64. ; sin (2) 3 Jo (2 Vaw) 
1 a 1 fm 5 
5 --- = Μος — η... o-2Vauw 
65. Ji sin (2) 5 " 5 [sin (2 vaw) + cos (2 Vaw) — e ] 
- 3 p (5) Tu [s (2 Va) + : iliud. ο πο] 
: Vi sin (7 9 Vg Sin aw) + cos aw) + € 
67. J cos (2) 5 VŠ [cos (2 Vaw) — sin (2 Vaw) + ge 
68 lY cos (2) ΠΗ [cos (2 Vaw) — sin (2 Vaw) + po 
i vt BV 2 V 2a i i Í 


1120 21. Tables 

No. | f(t) Fw) = πο cos(t w) dt 

1 7T a? a? ? a? 
0 --- si A 2 7 si , 208 
69. Vi sin (a vi) 2 le (5) sin (s ) E ) cos ο] 

5 Ja (a? + E e eb (uh) 
70. | e^" sin (avt) m a2w 4 ω 
-£ COS IPFA 2 arctan (2) 

sin (avt) [eG a? 
τι. dd π E (ο +C ym 
- 1 π T dg 
(2. Vi cos (a Vt) E sin G + is) 

E Vf (a? 4- why e-i ab? (a? + ὑ2 Γ᾽ 

e 
73. Wr 9 (ὑντ) bw 1 a 

cos | ——— — — arctan | — 
4(a?+w?) 2 a 
ὃ a? 

74. | e-* V! cos (a vt) VT a (2w) 2 e Σω 

e avt Ji Vi . a2 
75 5 — si σε w 
75. JA [cos (a Vt) — sin (av t)] PT 


21.14.2 Fourier Sine Transformation 


oo 


No. | f(t) Fi(w) f f (t) sin(t w) dt 
ο 
1 1, O0<t<a 1 — cos (aw) 
` ] t>a "m 
Es 0<t<l w 
2. 2-t 1«t«2 dw? sin w sin? (5) 
0, t>2 2 
1 π 
3. |— ee 
t 2 
: Oat 
4| εἴτα. | $i (aw) 
Oi, t>a 
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No. | f(t) Fw) = T f(t) sin(tw) dt 
o 
0, 0<t<a 
5. 1 — si (aw) 
Erg t>a 
i 
6 1 π1 
BE 2 yw 
s 0<t< 2 S (aw) 
-> a T aw 
Ti t = 
E 2^ fa 
0, t>a 
8 : ας, T 1 -- 25 (αω) 
f W t>a 2 Jw 
3 
9. (=) πλω 
10. (at)! (a > 0) [sin (aw) Ci (aw) — cos (aw) si (αω)] 
11. | (a= t)! (a > 0) sin (aw) Ci(aw) — cos (aw) (5 + Si(aw)) 
2: E ων 
a? + t? 
PO Iz. ; ; 
3. | (@- t) - [sin (aw) Ci(aw) — cos (aw) Si(aw) 
a 
b b ETE 
14. Pit PFs π e™ sin (aw) 
z üt α-τ ΚΝ 
15. | x1 (+02 B (a-t v e™ cos (aw) 
. i T 
16. Zg Zz 908 (aw) 
7 1 π 1-— cos (aw) 
| [t(a? —8) 2 a? 
8 1 m Sg 
τ | t(a? +4?) 2 æ 
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No. | f(t) Fi(w) = f f(t) sin(tw) dt 
0 
19. | t^", 0« Rer «2 cos Cz) DLüu-v)w"! 
ω 
90. | e-*t J 
a? +w? 
;—at 
31. | arctan (=) 
t a 
—at „—bt 1 b? + w? L 
22. š 8 : ls ω ln PUE = + b arctan (2) — a arctan (2) 
; "E 3 r 
23. | Vte-*t M (a? γω”) 4 sin l5 arctan ©) 
2 2 a 
1 
j p 2 
nis e7% (a? acco —a 
vt a? +w? 

25. | f^ g-at 1a? (a? + w2) 0+ E (-1)" n+1 en 
5. e n la?*!(a? +w? — = 
i m=0 2m+1 a 
1 v—1 ,—at 2 21-4 a: ; ω 

26. |t" te Γ(ν) (a? +w?)? sin L arctan (8) 

a 
ET 1 
27. |e 2" (1— e)! -5 tanh (πω) 
" πω uw 
28. | te ^^ = ^ da 
: a 4a ; 
l a T z 
29. |t 2e t o ὃν 204 (cos V/2aw + sin ν΄2αω] 
2ω 
3 a 
30. |t 2e t "E sin v 2auw 
ω 
3i Int, 0<t<l Οἴ(ω) -- C —Inw 
l 0, t>1 ω 
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No. | f(t) F,(w) = T f(t) sin(tw) dt 
o 
32. = - (C 4- In w) 
. Int T [π 
3a | z|g-C-m4 
34. | t(? — a?) In (bt) 5 [cos (aw) (In (ab) — Ci(aw)) -- sin(aw) : si(aw)] 
35. t(? —a?)! In (4) -5 [cos (aw) Ci(aw) + sin (aw) si (aw)| 
a 2 
1 ω 1 ς ‘ 
> | ρτα adin d εως η 2 1 9 
36. |e“ Int "XT |a arctan (2) Cw 5 In (af + w^) 
1 a : | 
, — {in (5) + cos (bw) Ci(bw) — cos (aw) Ci(aw) 

TN w 

37. | In s ; + sin (bw) Si(bw) — sin (aw) Si(aw) 
+5 [sin (bw) + sin (az)]) 
. at+t p 
38. | In αι — sin (aw) 
2 2 
39. : fe E τ) = [1 — cos (aw) — aw si(aw)| 
p a—t 
αἰ. {2 9 ENS 1 r 

40. | In E ==) TT NVP- 1 sin (2) 

a? +t -- ω 2 

a? 2 : ς : 
41. |In|1 - 5 — [C + In (aw) — cos (aw) Ci(aw) — sin (aw) Si(aw)] 
ω 

a? +(b+t)? 2T = = bi, 
42. | In (5 πα ἘΝῚ uw sin (bw) 
43. | = In [1 — αι -r Ci (2) 

a 
1 a? : 

44. : ln 1 z π [C + In (aw) — Ci(aw)] 
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No. | f(t) Fi(w) = f f(t) sin(tw) dt 
0 
45. sin (at) 1 In w+a 
t 2 wa 
π 
sin (at) χω, O<w<a 
46. ; A 
12 π 
σα, w>a 
47 sin (zt) sin w O<S<w<T 
“| T-# 
0, um 
π e . 
sin (at) = sinh (bw) , O<w<a 
4g, | 9n wan) 2 
Doe me | 
uta sinh (ab) , wa 
1 1 1 
49. | e^" sin (at zb 
ΗΝ. 2 E *t(a—w)? BP+(a+ 3 
50 c^" sin (at) Lu 03 (oa? 
: t 4 02 + (w—a)? 
. 1 1 α΄ γω: 
51. | εδ sin (at) * DE 1 b sinh (5) 
π 
4 , 0 « w « 2a 
sin? (at) : 
52. n s ; w = 2a 
0, w > 2α 
0, 0cwca-b 
sin (at) sin (bt 
53. a 2 a-b<w<a+tb 
0, w>atb 
; [0+ 2a) nw +20) 
" — |(w + 2a) 1n (w + 2a 
54, | 9m ing 4 
15 1 
(w — 2a) In |w — 2α| gv nw 
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No. | f(t) F,(w) = | f(t) sin(tw) dt 
o 
π ω 
sin? (at Tu (21-2), 0 « w « 2a 
55. = ) 4 2 
5 a’, ω > 2a 
0, O<w<a 
56. | «08 (at) T ess 
t 4 
π 
σα w>a 
2 
- t cos (at) m 5 e sinh (bw) , O<w<a 
57. | —— 
b? + ¢? 
5 e™ cosh(ab) , w>a 
; π w? w? w? w? 
58. | sin (at? 8 + si 
58. | sin (at*) 2" eos (=) σ (=) sin (=) S (5) 
: 2 2 2 
κα | Sin (at^) 7 |o (€ ω 
id t zl im 5 us 
60. | cos (at?) " ara C 2 cos 2 S 2 
τ 2a | Να) ^ Ma S Ma 4a 
cos (at?) T w?\ w? 
9h t 2 le e s 4a 
62. | e-*V* sin (avt) /5 NET e^ 


21.14.3 Fourier Transformation 


Although F(w) can be represented by the Fourier cosine transformation F, and the Fourier sine trans- 
formation F, according to (15.75a), here we give some direct transforms F(w). 


No. | f(t) F(w) = f eM F(t) dt 
1. | 6(t) (Dirac 6 function) 1 
2. | 6 (9 Go)" 
3. | ó0)(t — α) (iw)"e* = (n =0,1,2,...) 
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No. | f(t) F(w) = f e f(t) dt 
4. | 1 2πδ(ω) 
5. pu ani" (w) (n-—1,2,...) 
H(t) 2-1 fo ἰ»0 1 F 
6. — + πδ(ω) 
H(t)=0 for t<0 iw 
(Heaviside unit step function) 
n! 
7T. | R(t) fap + mi"ó0(u) (n=1,2,...) 
e* H(t) e^ for t»0 
8. . - (a > 0) 
e H(t) 20 for t«0 a+iw 
9 1 e /(4a) e% (α»0) 
© Μπα ! 
1 1 
10. | z-e- 50 
2a. w*-ra c ) 
1 1 παω 
‘ —e 
Pra a 
t 
12. Pag —ire—“lsign w 
" H(t--a)—- H(t-a) 21 for |t| «a 2sinaw 
^| H(tra)—-H(t-a)-0 for |t| 5 a ω 
14. | eiat 2πδ(ω — a) 
15. | cosat π]δ(ω + a) + δ(ω — a)] 
16. | sinat im[d(w + a) — δ(ω — a)] 
1 π 
τ cosht eos 
2 
1 πω 
.|— --ἶπ tanh — 
i sinht e d 
9. sinat? - e + z) (a > 0) 
20. | cosat? a e - 7 (a » 0) 
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21.14.4 Exponential Fourier Transformation 


Although the exponential Fourier transformation F,(w) can be represented by the Fourier transforma- 


1 
tion F(w) according to (15.77), i.e., Γι(ω) = gf (~o) here we give some direct transforms. 


No. | f(t) Fw) =} F FO εἶ αι 
f(t) 2A for a<t<b dd 22m 48 
1. . == (ee E E) 
f(t) 20 otherwise 2w 
f(t) 2t" for O<t<b 1 n al 
2. f(t) — otherwise 2 nm! (iw) — ev EE (—iw)" p 
(1:24 sa) m=o T^: 
1 π v—1 ,—aw fc " 0 
3. Tc XGA Rev >0 τω” i: Or Gh > 
(a+it) for w <0 
1 0 for w 0 
4. τς Πεν >0 ‘a 
(a —it) τω (ωδή te for w <0 
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21.15 Z Transformation 


For definition see 15.4.1.2, p. 794, for rules of calculations see 15.4.1.3, p. 795, for inverses see p. 797 
No. | Original Sequence fn Transform Convergence Region 
F (z) = Z (fn) 
1 1 À 2] » 
g=] 
2 | (-1)" - PES 
pu 21 
3 |n G-I z| > 
4 z(z+1) x 
4 [mn -I z| > 
5 [p νε... ne 
(2-1) 
à 2 
G nee" z| > |e*| 
ge 
7 |a? Z z| > |α| 
z—a 
8 = ež z|>0 
n 
9 |na" G T z| > |a] 
, glz + 
10 |n?a" 1 " z| > |α| 
z-a) 
n z 
11 (;) G-p De 2151 
k 1\* 
12 ( ) (1 + =) 250 
n z 
z sin b 
13 | sin b TS z| 1 
3 | sin bn iz ozsos Bu] p 
— cos b 
14 | cos bn απο ην z|>1 
z? — 2z cos b+ 1 
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No. | Original Sequence fn Transform Convergence Region 
F (z) = Z (fn) 
o ze" sin b τ " 
15 | e^" sin bn FCMPCR WELT: z| > |e*| 
- z(z — ο” cos b) j 
16 | ε΄" cos bn A Ize cos D eh z| > |e*| 
zsinh b 
17 | sinh bn B M z| > max(]e*|, |e-*|) 
z(z — cosh b) 
18 sh bi > oS z| > max(|e?|, |e7? 
cosh bn Bore Bl max(|e?|, |e~®|) 
zasinh b 
19 | a” sinh bn Ses τα z| > max(|ae?|, |ae-*|) 
: z(z — a cosh b 
20 |a" cosh bn "e z| > max(|ae?|, |ae-*|) 
n=O f'ur nk, 
21 a = | >0 
k= z 
« 2z 
22 fan = 0, fona =2 amd z| > 
23 fon = 0, 22(2? + 1) FT 
fon41 = 2(2n + 1) (2? — 1)? 
2n = 0, z-—1 
24 f x In mE z| > 
fona = 2n 4 1 T 
nT z2 
25 | cos — z| > 
5 | cos = 241 
2 
26 | (n+ 1) e” "EE z| > |e 
(z= e) 
ch) _ ean) 22 b 
2T P Gee e) z| > max(|e*], |e?|), a zz b 
ï E 
28 6 (n — 1) n(n t 1) z 1 
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oar Transform : 
No. | Original Sequence fn Convergence Region 
F(z) = Z (fn) 

1 Zz 

29 | fo=0, τς, n21 In jz| >1 
n p=] 

: (-1" sod 

30 Qn I Jz sin "- |z| >0 

"I 1 

1 OS —— z| > 0 
3 (ny! cos τα |z| >0 
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21.16 Poisson Distribution 


For the formula of the Poisson distribution see 16.2.3.3, p. 817. 


λ 

k 01 0.2 0.3 0.4 5 0.6 

0 0.904837 | 0.818731 | 0.740818 | 0.670320 | 0.606531 | 0.548812 
1 0.090484 | 0.163746 | 0.222245 | 0.268128 | 0.303265 | 0.329287 
2 0.004524 | 0.016375 | 0.033337 | 0.053626 | 0.075816 | 0.098786 
3 0.000151 | 0.001091 | 0.003334 | 0.007150 | 0.012636 | 0.019757 
4 0.000004 | 0.000055 | 0.000250 | 0.000715 | 0.001580 | 0.002964 
5 0.000002 | 0.000015 | 0.000057 | 0.000158 | 0.000356 
6 0.000001 | 0.000004 | 0.000013 | 0.000035 
7 0.000001 | 0.000003 

λ 

k 0.7 0.8 0.9 1.0 2.0 3.0 

0 0.496585 | 0.449329 | 0.406570 | 0.367879 | 0.135335 | 0.049787 
1 0.847610 | 0.359463 | 0.365913 | 0.367879 | 0.270671 | 0.149361 
2 0.121663 | 0.143785 | 0.164661 | 0.183940 | 0.270671 | 0.224042 
3 0.028388 | 0.038343 | 0.049398 | 0.061313 | 0.180447 | 0.224042 
4 0.004968 | 0.007669 | 0.011115 | 0.015328 | 0.090224 | 0.168031 
5 0.000696 | 0.001227 | 0.002001 | 0.003066 | 0.036089 | 0.100819 
6 0.000081 | 0.000164 | 0.000300 | 0.000511 | 0.012030 | 0.050409 
7 0.000008 | 0.000019 | 0.000039 | 0.000073 | 0.003437 | 0.021604 
8 0.000001 | 0.000002 | 0.000004 | 0.000009 | 0.000859 | 0.008102 
9 ).000001 | 0.000191 | 0.002701 
10 0.000038 | 0.000810 
11 0.000007 | 0.000221 
12 0.000001 | 0.000055 
13 000013 
14 000003 
15 000001 
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(continuation) 
A 

k 4.0 5.0 6.0 7.0 8.0 9.0 

0 0.018316 .006738 0.002479 000912 0.000335 .000123 
1 0.073263 .033690 0.014873 .006383 0.002684 .001111 
2 0.146525 .084224 0.044618 .022341 .010735 .004998 
3 0.195367 .140374 0.089235 .052129 .028626 .014994 
4 0.195367 .175467 0.133853 .091126 .057252 .033737 
5 0.156293 .175467 0.160623 127717 0.091604 .060727 
6 0.104194 1460229 0.160623 «149005 0.122138 .091090 
7 0.059540 104445 0.137677 0.149003 .139587 .117116 
8 0.029770 .065278 0.103258 0.130377 .139587 .131756 
9 0.013231 .036266 0.068838 0.101405 .124077 .131756 
0 0.005292 015195 0.041303 .070983 0.099262 .118580 
1 0.001925 8242 0.022529 .045171 0.072190 .097020 
2 0.000642 3434 0.011264 0.026350 .048127 .072765 
3 0.000197 1321 0.005199 0.014188 029616 .050376 
14 0.000056 0472 0.002228 0.007094 016924 .032384 
5 0.000015 0157 0.000891 3311 0.009026 .019431 
6 0.000004 0049 0.000334 1448 0.004513 .010930 
7 0.000001 0014 0.000118 0596 002124 5786 
8 0004 0.000039 0232 .000944 2893 
9 0001 0.000012 0085 .000397 1370 
20 0.000004 0030 0.000159 617 
21 0.000001 .000010 0.000061 264 
22 0.000003 .000022 108 
23 0.000001 000008 042 
24 «000005 016 
25 0.000001 006 
26 002 
27 001 
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21.17 Standard Normal Distribution 


For the formula of the standard normal distribution see 16.2.4.2, p. 819. 
21.17.1 Standard Normal Distribution for 0.00 € x < 1.99 

x P(x) cz P(x) x P(x) x P(x) z | B(x) 
0.00 | 0.5000 | 0.20 | 0.5793 | 0.40 | 0.6554 | 0.60 | 0.7257 || 0.80 | 0.7881 
0.01 | 0.5040 .21 | 0.5832 | 0.4 0.6591 | 0.61 | 0.7291 .81 | 0.7910 
0.02 | 0.5080 22 5871 A 0.6628 || 0.62 1324 .82 | 0.7939 
0.03 | 0.5120 .23 | 0.5910 4 0.6664 | 0.63 | 0.7357 .83 | 0.7967 
0.04 | 0.5160 .24 | 0.5948 A 0.6700 | 0.64 | 0.7389 .84 | 0.7995 
0.05 | 0.5199 .25 | 0.5987 4 0.6736 | 0.65 | 0.7422 .85 | 0.8023 
0.06 | 0.5239 .26 | 0.6026 E 0.6772 | 0.66 | 0.7454 .86 | 0.8051 
0.07 | 0.5279 27 | 0.6064 4 0.6808 | 0.67 | 0.7486 .87 | 0.8079 
0.08 | 0.5319 .28 | 0.6103 4 0.6844 | 0.68 | 0.7517 .88 | 0.8106 
0.09 | 0.5359 29 | 0.6141 4 0.6879 | 0.69 | 0.7549 .89 | 0.8133 
0.10 | 0.5398 | 0.30 | 0.6179 | 0.50 | 0.6915 || 0.70 | 0.7580 || 0.90 | 0.8159 
0.11 | 0.5438 E .6217 51 | 0.6950 | 0.71 7611 91 .8186 
0.12 | 0.5478 92 .6255 .52 | 0.6985 | 0.72 .1642 .92 | 0.8212 
0.13 | 0.5517 .33 | 0.6293 .53 | 0.7019 | 0.73 | 0.7673 .93 | 0.8238 
0.14 | 0.5557 .34 | 0.6331 54 | 0.7054 | 0.74 | 0.7704 .94 | 0.8264 
0.15 | 0.5596 .35 | 0.6368 .55 | 0.7088 | 0.75 | 0.7734 .95 | 0.8289 
0.16 | 0.5636 .36 | 0.6406 .56 | 0.7123 | 0.76 | 0.7764 96 | 0.8315 
0 
0 
0 


COBNOOKWNH 


AT | 0.5675 -37 | 0.6443 ΟΙ | 0.7157 | 0.77 | 0.7794 97 | 0.8340 
18 | 0.5714 38 | 0.6480 .58 | 0.7190 | 0.78 | 0.7823 .98 | 0.8365 
19 | 0.5753 39 | 0.6517 .59 | 0.7224 | 0.79 | 0.7852 99 | 0.8389 


x P(x) x P(x) x P(x) x P(x) cz P(x) 
1.00 | 0.8413 | 1.20 | 0.8849 | 1.40 | 0.9192 | 1.60 | 0.9452 | 1.80 | 0.9641 
1.01 | 0.8438 .21 | 0.8869 Al | 0.9207 | 1.61 | 0.9463 | 1.81 9649 
1.02 | 0.8461 .22 | 0.8888 .42 | 0.9222 || 1.62 | 0.9474 | 1.82 .9656 
1.03 | 0.8485 .23 | 0.8907 .43 | 0.9236 | 1.63 | 0.9484 | 1.83 9664 
1.04 | 0.8508 24 .8925 .44 | 0.9251 | 1.64 | 0.9495 | 1.84 .9671 
1.05 | 0.8531 .25 .8944 .45 | 0.9265 | 1.65 | 0.9505 | 1.85 9678 
1.06 | 0.8554 26 .8962 .46 | 0.9279 | 1.66 | 0.9515 | 1.86 9686 
1.07 | 0.8577 27 | 0.8980 AT | 0.9292 | 1.67 | 0.9525 | 1.87 | 0.9693 
1.08 | 0.8599 .28 | 0.8997 .48 | 0.9306 | 1.68 | 0.9535 | 1.88 | 0.9699 
1.09 | 0.8621 .29 .9015 49 | 0.9319 | 1.69 | 0.9545 | 1.89 9706 
1.10 | 0.8643 | 1.30 | 0.9032 | 1.50 | 0.9332 || 1.70 | 0.9554 | 1.90 | 0.9713 
1.11 | 0.8665 31 | 0.9049 51 | 0.9345 | 1.71 | 0.9564 | 1.91 9719 
1.12 | 0.8686 .32 | 0.9066 .52 | 0.9357 | 1.72 | 0.9573 | 1.92 9726 
1.13 | 0.8708 -33 | 0.9082 .53 | 0.9370 | 1.73 | 0.9582 | 1.93 9732 
1.14 | 0.8729 34 9099 .54 | 0.9382 14 | 0.9591 | 1.94 9738 
1.15 | 0.8749 35 9115 .55 | 0.9394 15 | 0.9599 | 1.95 9744 
1.16 | 0.8770 -36 | 0.9131 .56 | 0.9406 | 1.76 | 0.9608 | 1.96 9750 
1.17 | 0.8790 -37 | 0.9147 ΟΙ | 0.9418 | 1.77 | 0.9616 | 1.97 | 0.9756 
1.18 | 0.8810 38 | 0.9162 .58 | 0.9429 | 1.78 | 0.9625 | 1.98 | 0.9761 
1.19 | 0.8830 99 9177 .59 | 0.9441 | 1.79 | 0.9633 | 1.99 9767 
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21.17.2 Standard Normal Distribution for 2.00 € α < 3.90 


x P(x) cz P(x) x P(x) x P(x) cz P(x) 
2.00 | 0.9773 | 2.20 | 0.9861 | 2.40 | 0.9918 | 2.60 | 0.9953 || 2.80 | 0.9974 
2.01 | 0.9778 | 2.21 .9864 | 2.41 .9920 || 2.61 | 0.9955 | 2.81 .9975 
2.02 | 0.9783 | 2.22 .9868 | 2.42 .9922 || 2.62 | 0.9956 | 2.82 .9976 
2.03 | 0.9788 | 2.23 9871 | 2.43 .9925 | 2.63 | 0.9957 | 2.83 9977 
2.04 | 0.9793 | 2.24 | 0.9875 | 2.44 | 0.9927 | 2.64 | 0.9959 | 2.84 | 0.9977 
2.05 | 0.9798 | 2.25 | 0.9878 | 2.45 | 0.9929 | 2.65 | 0.9960 | 2.85 | 0.9978 
2.06 | 0.9803 | 2.26 .9881 | 2.46 .9931 | 2.66 | 0.9961 | 2.86 9979 
2.07 | 0.9808 | 2.27 9884 | 2.47 .9932 | 2.67 | 0.9962 | 2.87 | 0.9979 
2.08 | 0.9812 | 2.28 .9887 | 2.48 .9934 | 2.68 | 0.9963 | 2.88 9980 
2.09 | 0.9817 | 2.29 | 0.9890 | 2.49 | 0.9936 | 2.69 | 0.9964 | 2.89 | 0.9981 
2.10 | 0.9821 | 2.30 | 0.9893 | 2.50 | 0.9938 | 2.70 | 0.9965 || 2.90 | 0.9981 
2.11 | 0.9826 | 2.31 .9896 | 2.51 .9940 || 2.71 | 0.9966 | 2.91 .9982 
2.12 | 0.9830 | 2.32 .9894 | 2.52 .9941 || 2.72 | 0.9967 || 2.92 .9983 
2.13 | 0.9834 | 2.33 9901 | 2.53 .9943 | 2.73 | 0.9968 | 2.93 9989 
2.14 | 0.9838 | 2.34 | 0.9904 | 2.54 | 0.9945 | 2.74 | 0.9969 | 2.94 | 0.9984 
2.15 | 0.9842 | 2.35 | 0.9906 | 2.55 | 0.9946 | 2.75 | 0.9970 | 2.95 | 0.9984 
2.16 | 0.9846 | 2.36 .9909 | 2.56 9948 | 2.76 | 0.9971 | 2.96 9985 
2.17 | 0.9850 | 2.37 | 0.9911 | 2.57 9949 | 2.77 | 0.9972 | 2.97 | 0.9985 
2.18 | 0.9854 | 2.38 9919 | 2.58 9951 | 2.78 | 0.9973 | 2.98 9986 
2.19 | 0.9857 | 2.39 | 0.9916 | 2.59 .9952 | 2.79 | 0.9974 | 2.99 | 0.9986 

x P(x) cz P(x) x P(x) x P(x) cz P(x) 
3.00 | 0.9987 | 3.20 | 0.9993 | 3.40 | 0.9997 || 3.60 | 0.9998 | 3.80 | 0.9999 
3.10 | 0.9990 | 3.30 9995 | 3.50 | 0.9998 | 3.70 | 0.9999 || 3.90 .9999 
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21.18 x? Distribution 
For the formula of the x? distribution see 16.2.4.6, p. 822. 


x? Distribution: Quantile 2 ,, 


Degree 
of Probability a 
Freedom 
m 0.99 0.975 0.95 005 0025 001 
1 0.00016 0.00098 . 0.0039 3.8 5.0 6.6 
2 0.020 0.051 0.103 6.0 7.4 9.2 
3 0.115 0.216 0.352 7.8 94 11.3 
4 0.297 0.484 0.711 95 111 13.3 
5 0.554 0.831 1.15 TI 128 151 
6 0.872 1.24 1.64 126 144 16.8 
7 1.24 1.69 2.17 141 16.0 185 
8 1.65 2.18 2.73 155 175 20.1 
9 2.09 2.70 3.33 169 19.0 217 
10 2.56 3.25 3.94 183 205 232 
11 3.05 3.82 4.57 19.7 21.9 24.7 
12 3.57 4.40 5.23 21.0 233 26.2 
13 4.1 5.01 5.89 224 24.7 27.7 
14 4.66 5.63 6.57 23.7 26.1 29.1 
15 5.23 6.26 7.26 25.0 27.5 30.6 
16 5.8 6.91 7.96 26.3 28.8 320 
17 6.41 7.56 8.67 27.0 90.2 334 
18 7.0 8.23 9.39 28.9 31.5 34.8 
19 7.63 8.91 0.1 30.1 32.9 36.2 
20 8.26 9.59 0.9 314 34.2 37.6 
21 8.90 0.3 1.6 32.7 35.5 38.9 
22 9.54 1.0 2.3 33.9 36.8 40.3 
23 0.2 IT 3.1 352 381 41.6 
24 0.9 2.4 3.8 36.4 394. 43.0 
25 1.5 3.1 4.6 37.7 40.6 443 
26 2.2 3.8 5.4 38.9 41.9 45.6 
26 2.9 4.6 6.2 40.1 43.2 447.0 
28 3.6 5.3 6.9 41.3 445 48.3 
29 4.9 6.0 τπτ 42.6 45.7 49.6 
30 5.0 6.8 8.5 43.8 47.0 50.9 
40 22.2 24.4 26.5 55.8 59.3 63.7 
50 29.7 32.4 34.8 67.5 714 76.2 
60 37.5 40.5 43.2 79.1 83.3 884 
70 45.4 48.8 51.7 90.5 950 100.4 
80 53.5 57.2 60.4 101.9 106.6 112.3 
90 61.8 65.6 69.1 113.1 118.1 1241 
100 70.1 74.2 77.9 124.8 129.6 135.8 
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21.19 Fisher F Distribution 
For the formula of the Fisher F distribution see 16.2.4.7, p. 823. 


Fisher F Distribution: Quantile [πιο for a = 0.05 


ma; 
m2 
1 2 3 4 5 6 8 12 24 30 40 oo 

1 |161.4 | 199.5 | 215.7 | 224.6| 230.2 | 234.0 | 238.9 | 243.9 | 249.0 | 250.0 | 251.0 | 254.3 
2 | 18.51) 19.00] 19.16 |19.25 | 19.30 | 19.33 | 19.37 | 19.41 | 19.45 | 19.46 | 19.47 | 19.50 
3 |10.13| 9.55} 9.28 | 9.121 9.01] 8.94] 8.85| 8.74] 8.64| 8.62| 8.59| 8.53 
4 7.71] 6.94| 6.59 | 6.39| 6.26 6.16} 6.04| 5.91] 5.77| 5.75) 5.72| 5.63 
5 6.61) 5.79| 5.41 519] 5.05) 4.95 | 4.82) 468) 4.53) 4.50| 4.46] 4.36 
6 5.99| 5.14| 4.76 | 4.53| 4.39| 4.28) 4.15 4.00} 3.84) 3.81] 3.77] 3.67 
7 5.59) 4.74) 4.35 | 4.12) 3.97] 3.87] 3.73] 3.57] 3.41) 3.38} 3.34] 3.23 
8 5.32] 4.46) 4.07 3.84] 3.69} 3.58) 3.44) 3.28 3.12) 3.08} 3.05) 2.93 
9 5.12) 4.26) 3.86 | 3.63) 3.48] 3.37] 3.23} 3.07] 2.90) 2.86] 2.83] 2.71 
0 496] 4.1 3.71 3.48| 3.33, 3.22| 3.07) 2.01] 2.74| 2.0. 2.66) 2.54 
il 4.84) 3.98) 3.59 | 3.36) 3.20] 9.091 2.95 | 2.79} 2.61) 2.57, 2.53} 2.40 
2 4.75] 3.89] 3.49 3.26) 3.11} 3.00) 2.85) 2.698 2.51) 247 2.43) 2.30 
3 4.67) 3.81] 3.41 3.18| 3.08. 2.92) 2.77) 2.60) 2.42) 2.388) 2.34) 2.21 
4 4.60] 3.74] 3.34 811) 2.96 2.85) 2:70) 2.53) 2.35) 2.381) 2.27) 2.13 
5 4.54] 3.68) 3.29 3.06] 2.90] 2.79) 2.64) 2.48) 2.29) 2.25} 2.20) 2.07 
6 4.49) 3.63] 3.24 801) 2.85 2.74) 2.50) 2.42 2.24) 2.19) 2.15) 2.01 
7 4.45) 3.59) 3.20 2.90 2.81) 2.70] 2.55} 2.38) 2.19] 2.15) 2.10] 1.96 
18 4441, 3.55] 3.16 2.98 2.77) 2.66] 2.51] 2.34) 2.15) 2.11) 2.06] 1.92 
9 4.38| 3.52) 3.13 2.90 2.74) 2.63] 2.48) 2.31) 211, 2.07) 2.03] 1.88 
20 4.35] 3.49} 3.10 2.87| 2.71) 2.60] 2.45} 2.28) 2.08) 2.04); 1.99] 1.84 
21 4.32| 3.47] 3.07 3.84) 2.68) 2.57, 2.442) 2.25) 2.05) 2.01) 1.96] 1.81 
22 4.30] 3.44} 3.05 282) 2.66) 2.55) 2.40) 2.23) 2.03) 1.98) 1.94] 1.78 
23 4.28) 3.42} 3.03 2.80] 2.64) 2.53] 2.87] 2.20) 2.00, 1.96 91] 1.76 
24 4.200| 3.40} 3.01 2.78| 2.62) 2.51| 2.36 | 2.18| 1.98] 1.94| 1.89] 1.73 
25 4.24| 3.39 | 2.99 2.16. 2.60) 2.49| 2.34 2.16| 1.96] 1.92) 1.87] 1.71 
26 4.383, 3.37| 2.98 2.14) 2.59) 2.47, 23832) 2.15) 1.95) 1.90) 1.85, 1.69 
27 4.211 3.35] 2.96 2.738| 2.57) 2.46) 2.31 2.13) 1.83) 1.88) 1.84, 1.67 
28 4.20) 3.84] 2.95 2.71| 2.56) 2.45) 2.29) 2.12) 1.91) 1.87) 1.82] 1.65 
29 4.18] 3.33] 2.93 2.10. 2.55) 2.43) 2.28) 2.10); 1.90) 1.85) 1.80] 1.64 
30 4.17| 8.32| 2.92 2.69] 2.53) 2.42) 2.27} 2.09) 1.89] 1.84) 1.79] 1.62 
40 4.08] 3.23] 2.84 2.61] 2.45) 2.34) 2.18) 2.00) 1.79] 1.74) 1.69] 1.51 
60 4.001 3.15) 2.76 2.93 | 2.37) 2.25) 210) 1.92) 1.70) 1.65) 1.59] 1.39 
125 3.92] 3.07] 2.68 2.44} 2.29) 2.17) 2.01] 1.83) 1.60) 1.55) 1.49) 1.25 
oo 3.84] 3.00} 2.60 2.397 | 2.21) 2.10) 1.94 1.75) 1.52) 1.46) 1.39] 1.00 
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Fisher F Distribution: Quantile ώμο for a = 0,01 
my 
m2 
1 2 3 4 5 6 8 12 24 90 40 oo 

1 | 40521 4999} 5403| 5625| 5764| 5859| 5981| 6106| 6235 | 6261 | 6287] 6366 
2 | 98.50 | 99.00 | 99.17 | 99.25 | 99.30 | 99.33 | 99.37 | 99.42 | 99.46 | 99.47 | 99.47 | 99.50 
3 | 34.12 | 30.82 | 29.46 | 28.71 | 28.24 | 27.91 | 27.49 | 27.05 | 26.60 | 26.50 | 26.41 | 26.12 
4 | 21.20) 18.00 | 16.69 | 15.98 | 15.52 | 15.21 | 14.80 | 14.37 | 13.93 | 13.84 | 13.74 | 13.46 
5 | 16.26 | 13.27 | 12.06 | 11.39 | 10.97 | 10.67 | 10.29 | 9.89 | 9.47| 9.38] 9.29| 9.02 
6 |13.74|10.92| 9.78| 9.15| 8.75) 8.47) 8.10] 7.72] 7.381] 7.23| 7.14] 6.88 
7 |12.25| 9.55} 8.45] 7.85) 7.46) 7.19) 6.84] 6.47] 6.07] 5.99] 5.91] 5.65 
8 11126 8.65] 7.59] 7.01) 6.63) 6.37] 6.03] 5.67] 5.28] 5.2 5.12] 4.86 
9 | 10.56} 8.02] 6.99] 6.42) 6.06) 5.80) 5.47] 5.11] 4.73] 4.651 4.57] 4.31 
10 | 10.04} 7.56] 6.55] 5.99] 5.64] 5.39] 5.06] 4.71) 4.33] 4.25} 4.17] 3.91 
11 965] 7.21} 0.22] 5.67) 5.32) 5.07) 4.74] 4.40] 4.02] 3.94] 3.86] 3.60 
12 9.33] 6.93} 5.05] 5.41) 5.06) 4.82) 4.50] 4.16] 3.78] 3.7 3.02] 3.36 
13 9.07| 6.70} 5.74] 5.21) 4.86) 4.62) 4.30] 3.96] 3.59] 3.51] 3.43] 3.16 
14 δ.δ6 6.51] 5.56] 5.04) 4.70) 4.46) 4.14] 3.80] 3.43] 3.85] 3.27] 3.00 
15 8.68 | 6.36] 5.42] 4.89) 4.56) 4.32) 4.00] 3.67] 3.29] 3.21] 3.13] 2.87 
16 8.53] 0.23] 5.20] 4.77) 4.44) 4.20] 3.89] 3.55] 3.18] 3.1 3.02| 2.75 
17 8.40| 6.111 5.18] 4.67] 4.34) 4.10] 3.79} 3.46] 3.08) 3.0 2.92} 2.65 
18 8.29] 6.011 5.09} 4.58] 4.25) 4.01] 3.71] 3.37] 3.00) 2.92] 2.84] 2.57 
19 8.18} 5.93] 5.01] 4.50] 4.17) 3.94] 3.63] 3.30] 2.92) 2.84] 2.76] 2.49 
20 8.10] 5.85} 4.94] 4.43) 4.10) 3.87) 3.56] 3.23] 2.86] 2.78] 2.69} 2.42 
21 8.02] 5.78) 4.87] 4.37] 4.04) 3.81] 83.51] 3.17] 2.80) 2.72] 2.64] 2.36 
22 7.95| 5.72} 4.82] 4.31) 3.99) 3.76) 3.45] 3.12] 2.75] 2.67) 2.58} 2.31 
23 7.88| 5.060 | 4.76] 4.26) 3.94) 3.71) 3.41] 3.07] 2.70] 2.62] 2.54] 2.26 
24 7.82} 5.61) 4.72) 4.22) 3.90] 3.67] 3.36] 3.03] 2.66) 2.58] 2.49] 2.21 
25 7.77| 5.57) 4.68] 4.18] 3.86] 3.63] 3.32. 2.991 2.62) 2.54] 2.45] 2.17 
26 7.72| 5.53| 4.64] 4.14] 3.82] 3.59] 3.29 | 2.96] 2.58) 2.50] 2.42} 2.13 
27 7.68| 5.49| 4.60} 4.11] 3.78) 3.56] 3.26] 2.93} 2.55) 2.47] 2.38} 2.10 
28 7.64} 5.45} 4.57! 4.07) 3.76) 3.53) 3.23] 2.90] 2.52] 2.44] 2.35] 2.06 
29 7.60) 5.42) 4.54] 4.04] 3.73] 3.50] 3.20 2.87] 2.49) 2.41] 2.33} 2.03 
30 7.56| 5.39] 451] 4.02) 3.70) 3.47) 3.17] 2.84] 2.47] 2.38] 2.30} 2.01 
40 7.31| 5.18) 4.31] 3.83] 3.51] 3.29] 2.99} 2.66] 2.29) 2.20] 2.11] 1.80 
60 7.08| 4.98| 4.13] 3.65] 3.34] 3.12] 2.82} 2.501 2.12) 2.03] 1.94] 1.60 
125 6.84] 4.78} 3.94] 3.48] 3.17) 2.95) 2.66) 2.33] 1.94] 1.85] 1.75] 1.37 
oo 6.68| 4.60| 3.78| 3.32| 3.02) 2.80] 2.51| 2.18| 1.79) 1.70] 1.59| 1.00 
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21.20 Student t Distribution 


For the formula o 


the Student t distribution see 16.2.4.8, p. 824. 


Student t Distribution: Quantile tam or ta/2,m 


Degree 
of Probability o for Two-Sided Problem 
Freedom 
m 0.10 0.05 0.02 0.01 0.002 0.001 
1 6.31 12.7 31.82 603.7 318.3 637.0 
2 2.02 4.30 6.97 9.92 22.33 31.6 
3 2.85 3.18 4.54 5.84 10.22 12.9 
4 2.3 2.78 3.75 4.60 7.17 8.61 
5 2.01 2.57 3.97 4.03 5.89 6.86 
6 1.94 2.45 3.14 3.71 5.21 5.96 
i 1.89 2.36 3.00 9.00 4.79 5.40 
8 186 2.31 2.90 3.36 4.50 5.04 
9 1.83 2.26 2.82 3.25 4.30 4.78 
0 181 2.23 276 3.17 4.14 4.59 
1 1.80 2.20 2./0 3.11 4.03 4.44 
2 1.78 2.18 2.68 3.05 3.93 4.32 
3 1.77 216 2.65 3.01 3.85 4.22 
4 1.76 214 2.02 2.98 3.79 4.14 
5 1.75 2.13 2.60 2.95 3.73 4.07 
16 15 232 2.58 292 3.69 4.01 
7 174 2.11 2.57 2.90 3.65 3.96 
8 1.73 210 2.55 2.88 3.6 3.92 
9 1.73 2.09 2.54 2.86 3.58 3.88 
20 1.73 2.09 2.53 2.85 3.55 3.85 
21 1.72 2.08 2.52 2.83 3.53 3.82 
22 1.72 2.07 251 2.82 3.5 3.79 
23 1.71 2.07 2.50 2.81 3.49 3.77 
24 1.71 2.06 2.49 2.80 3.47 3.74 
25 1.71 2.06 2.49 2.79 3.45 3.72 
26 1.71 2.06 2.48 2.78 3.44 3.71 
27 1.71 2.05 247 2.77 3.42 3.69 
28 1.70 2.05 2.46 2.76 3.40 3.66 
29 1.70 2.05 2.46 2.76 3.40 3.66 
30 1.70 2.04 2.46 2.75 3.39 3.65 
40 1.68 2.02 242 2.70 3.31 3.55 
60 1.67 2.00 2.39 2.66 3.23 3.46 
120 1.66 1.98 2.06 2.62 3.17 3.37 
oo 164 1.96 2.83 258 3.09 3.29 
0.05 0.025 0.01 0.005 0.001 0.0005 


Probability o for One-Sided Problem 


21.21 Random Numbers 1199 


21.21 Random Numbers 


For the meaning of random numbers see 16.3.5.2, p. 843. 


4730 1530 [8004 |7993 |3141 [0103 |4528 |7988 |4635 |8478 |9094 |9077 |5306 |4357 |8353 
0612 |2278 |8634 |2549 |3737 |7686 [0723 |4505 |6841 |1379 |6460 |1869 |5700 |5339 |6862 
0285 |1888 |9284 |3672 |7033 |4844 [0149 |7412 |6370 |1884 |0717 |5740 |8477 |6583 |0717 
7768 |9078 |3428 |2217 |0293 |3978 [5933 |1032 |5192 |1732 |2137 |9357 |5941 16564 |2171 
4450 [8085 [8931 |3162 |9968 |6369 |1256 [0416 |4326 |7840 |6525 [2608 |5255 |4811 |3763 


7332 |6563 |4013 [7406 |4439 [5683 [6877 [2920 |9588 [3002 [2869 |3746 [3690 169591 |1230 
4044 1643 [9005 |5969 |9442 |7696 |7510 |1620 |4973 |1911 11255 [6160 |9797 |8755 |6120 
0067 |7697 |9278 |4765 [9647 |4364 |1037 |4975 |1998 |1359 |1346 |6125 |5078 |6742 |3443 
5358 |5256 |7574 [3219 |2532 |7577 [2815 |8696 |9248 |9410 |9282 |6572 |3940 |6655 |9014 
0038 |4772 [0449 [6906 |8859 |5044 [8826 |6218 |3206 [9034 [0843 |9832 |2703 |8514 |4124 


8344 |2271 |4689 [3835 |2938 [2671 14691 |0559 |8382 |2825 |4928 |5379 |8635 |8135 |7299 
7164 |7492 |5157 |8731 |4980 |8674 |4506 |7262 |8127 |2022 |2178 |7463 |4842 |4414 |0127 
7454 |7616 |8021 [2995 |7868 [0683 [3768 [0625 |9887 |7060 [0514 |0034 |8600 |3727 |5056 
3454 |6292 [0067 |5579 [9028 |5660 [5006 |8325 |9677 |2169 [3196 |0357 |7811 |5434 |0314 
0401 |7414 |3186 [3081 |5876 |8150 |1360 |1868 |9265 |3277 |8465 |7502 |6458 |7195 |9869 


2 
2 
7 
2 
3 
6202 |0195 |1077 [7406 |4439 [5683 16877 2920 |9588 [3002 |2869 |3746 13690 |2705 |625 
8284 |0338 |4286 |5969 9442 |7696 |7510 |1620 |6973 |1911 |1288 |6160 |9797 |1547 |4972 
1 
9 
9 
4 
5 
9 


9056 |0151 |7260 |4765 [9647 |4364 |1037 |4975 |1998 |1359 |1346 |6125 |5078 |3424 |1354 
9747 |3840 |7921 [3219 |2532 |7577 |2815 [8696 |9248 |9410 |9282 |6572 |3940 |8969 |3659 
2992 |8836 |3342 [6906 |8859 |5044 |8826 |6218 |3206 |9034 |0843 |9832 |2703 |5225 |8898 


6170 |4595 |2539 [7592 |1339 |48 922 [8877 [9530 [6499 |6432 11510 
3265 |8619 |0814 [5133 |7995 |8030 |7408 |2186 |0725 |5554 |5664 |6791 [9677 |3085 |8319 
0179 |3949 [6995 |3170 9915 [6960 12621 [6718 |4059 9919 |1007 |6469 |5410 10246 |3687 
1839 |6042 9650 |3024 |0680 |1127 |8088 |0200 |5868 |0084 |6362 |6808 |3727 |8710 |6065 
2276 |8078 |9973 |4398 |3121 |7749 |8191 [2087 |8270 |5233 |3980 |6774 |8522 |5736 |3132 


4146 [9952 |7945 |5207 |1967 [7325 |7584 |3485 [5832 |8118 [8433 10606 |2719 |2889 |2765 
3809 |5523 [0648 |3326 |1933 |6265 [0649 |6177 |2139 |7236 |0441 |1352 |1499 |3068 
7825 [7012 [9934 |7022 [2260 [0190 |1816 |7933 |2906 |3030 16032 |1685 |3100 |1929 
j [9286 |5051 14051 |1580 [5004 |8981 |1950 |2201 [3852 |6855 |5489 [6386 |3736 |0498 
7959 |5983 |0204 |4325 |5039 |7342 |7252 |2800 |4706 |6881 |8828 |2785 |8375 |7232 |2483 


6 
0 
6 
9 
8245 |9611 [0641 [7024 |3899 [8981 11250 |5678 |8096 |7010 |1435 |7631 [7361 [8903 |8684 
1 
3 
1 
5 


5751 |3785 |7125 


4915 |2913 [9031 |9735 |7820 |2478 19200 |7269 |6284 |9861 |2849 |2208 |8616 |5865 
2744 |7318 |7614 |5999 |1246 |9759 [6565 |1012 |0059 |2419 |0036 |2027 |5467 |5577 
4521 |5070 |4150 |5059 |5178 |7130 |2641 |7812 |1381 [6158 |9539 |3356 |5861 |9371 
3305 |3814 [0973 |4958 |4830 |6297 [0575 |4843 |3437 |5629 |3496 |5406 |4790 |9734 
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oscillations, 84 
sine function, 77 
spectrum, 787 
analysis 
unctional, 654 
harmonic, 982, 992 
multi-scale, 803 
numerical, 949 
statistical, 832 
ime frequency, 803 

vector, 701 
angle, 129 

acute angle, 130 


heorem for the Euclidean algorithm, 374 
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between vectors, 190 Anosov diffeomorphism, 889 
central, 131, 141 anticommutativity, vector product, 184 
chord and tangent, 140 antiderivative, 480 
circumference, 140 antikink soliton, 607 
convex angle, 130 antilog, 10 
curves, plane, 246 antisoliton, 606 
curves, space, 264 Apollonius 
directional, 146 circle, 743 
escribed circle, 141 theorem, 200 
exterior, 140 apothem, 133 
full angle, 130 applicate, 210 
geodesy, 146 approximate equation, 549 
inclination, 432 approximate formula, empirical curve, 108 
interior, 140 approximation, 982 
lines, plane, 130, 224 asymptotic, polynomial part, 15 
lines, space, 223 best, Hilbert space, 675 
measure in degrees, 131 bicubic, 1000 
notion, 129 Chebyshev, 988 
nutation, 215 ó function, 777 
obtuse angle, 130 formulas, series expansion, 472 
perigon, 131 in mean, 456, 984 
plane, 152 Liouville theorem, 4 
plane, notion, 129 numbers, 4 
planes, space, 220 partial differential equations, 977 
precession, 215 problem, 674 
radian measure, 131 solution by extreme value, 456 
reduction, 175 successive 
right angle, 130 Banach space, 679 
rotation angle, 215 differential equations, ordinary, 549 
round angle, 130 integral equation, 625 
secant and tangent, 140 uniform, 988 
slope, 432 using given functions, 974 
angent, 245 Weierstrass theorem, 665 
solid angle, 152 arc, 131, 161 
straight angle, 130 ellipse, 201 
tilt angle, 144 graph, 401 
vertical angle, 144 chain, 410 
angles hyperbola, 204 
adjacent, 130 intersection, 150 
alternate, 130 length 
at parallels, 130 circular segment, 141 
Cardan, 214, 295 line integral, first type, 517 
complementary, 130 parabola, 206 
corresponding, 130 plane curve, 141, 502 
Euler, 215, 296 space curve, 264, 517 
exterior—interior, 130 sequences, 410 
names, 129 Archimedean spiral, 105 
opposite, 130 area 
sum annulus, 141 
plane triangle, 133 sector, 141 
spherical triangle, 165 circle, 140 
supplementary, 130 circular sector, 141 
vertex, 130 circular segment, 141 
angular cosine, 93 
coefficient, curve third degree, 68 cotangent, 94 
coefficient, plane, 195 curved surface, 535 
frequency, 84 curvilinear bounded, 501 
annuity, 23, 24 curvilinear sector, 501 
calculation, 25 double integral, 527 
payment, 25 ellipse, 201 
perpetual, 25 formula, Heron's, 144 
annulator, 682 function, 93 


annulus, 141 hyperbola, 204 
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parabola, 206 
parallelogram, 135 

with vectors, 190 
planar figures, 501 
polyeder with vectors, 190 
polygon, 194 
rectangle, square, 136 
rhombus, 136 
similar plane figures, 134 
sine, 93 
subset, 693 
surface patch, 265 
tangent, 94 
triangle, 194 

plane, 142, 144 


open set, 664 
pseudonorm, 682 
scalar product, 673 
vector space, 654 
ordered, 659 
axis 
abscissae 
plane coordinates, 190 
space coordinates, 210 
ordinates 
plane coordinates, 190 
space coordinates, 210 
parabola, 204 
azimuth, 161 
azimuthal equation, 599 


spherical, 165, 169 
argument backward substitution, 956 
function of one variable, 48 Baire, second Baire category, 874 
function of several variables, 118 Bairstow method, 954 


arithmetic, 1 ball, metric space, 663 
sequence, 18 Banach 
Arnold tongue, 907 fixed-point theorem, 689 
arrangement, 805, 806 space, 670 
with repetition, 806 example, 670 
without repetition, 806 series, 670 
arrow heorem, continuity, inverse operator, 679 
diagram, 331 band structure, coefficient matrix, 960 
function, 331 barrel, 159 
article number, European, 384 circular, 159 
ASCII (American Standard Code for Information Inter- arabolic, 159 
change), 1001 base, 7 
associative law ower, 7 
Boolean algebra, 396 vector, 187 
generalized imaginary units, 290 reciprocal, 186, 187 
matrices, 272 vector space, 366 
missing, 356 basis, 657 
scalar product, 185 Clifford algebra, 289 
vector product, 185 contravariant, 285 
propositional logic, 324 covariant, 284 
quaternions, 290 Lie Algebra, 357 
sets, 330 orthonormal, 358 
tensors, 282 vector 
astroid, 104, 527 contravariant, 284 
asymptote covariant, 284 
curve, 249, 252 vector space, 657 
definition, 252 Bayes theorem, 810 
attractor, 860 B-B representation 
chaotic, 888 curve, 1000 
examples, 887 surface, 1000 
fractal, 888 bending, 237 
Hénon, 886, 888 Berge theorem, 409 
hyperbolic, 888 Bernoulli 
Lorenz, 887, 892 inequality, 30 
quantitative description, 876 numbers, 465 
solenoid, 888 shift mapping, 876 
strange, 888 shift, chaotic, 889 
autocorrelation function, 878 Bernoulli-l’ Hospital rule, 56 
axiomatization, probability theory, 809 Bernstein polynomial, 1000 
axioms Bessel 
algebra, 672 differential equation, 562 
closed set, 664 differential equation, linear, zero order, 783 


metric spaces, 662 function, 562 
normed space, 669 imaginary variables, 563 
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modified, 563 
functions 
spherical, 564 
inequality, 676 
beta function, 1098 
biangle, spherical, 163 
bifurcation, 604 
Andronov-Hopf, 894 
Bogdanov-Takens, 896 
codimension, 892 
cusp, 895 


double semistable periodic orbits, 898 


flip, 898 

global, 892, 901 
homoclinic, 902 

Hopf, 894 
generalized, 897 

local, 892 


mappings, subcritical saddle node, 898 


pitchfork, 899 
supercritical, 896 
saddle node, 893 
transcritical, 894 
bilinear, 356 
bilinear form, 368 
antisymmetric, 368 
positive definite, 368 
symmetric, 368 
binary number, 876, 1002 
binary system, 1002 
binomial, 71 
coefficient, 13 
distribution, 815 
formula, 12 
linear, 71 
quadratic, 71 
theorem, 12 
inormal, space curve, 257, 259 
iquaternion, 290 
normalized, 306 
biquaternions, 306 
multiplication, 306 
rigid-body motion, 306 
irth process, 829 
bisection method, 319 
bisector, 143 
triangle, 133 
it, 1001 
reversing the order, 995 
block, 153 


Bolzano theorem 
one variable, 61 
several variables, 124 
Bolzano-Weierstrass property, 686 
Boolean 
algebra, 395, 807 
finite, 397 
expression, 397 
function, 324, 397 
n-ary, 397 
variable, 397 
Borel 


body of revolution, lateral surface, 502 


set, 694 

c algebra, 694 
bound 

function, 51 

sequence, 457 
boundary 

collocation, 978 

condition, 610 


uncertain for some variables (fuzzy), 422 


boundary value 
conditions, 540, 569 
problem, 540, 569, 973 
Hilbert, 651 
Hilbert, homogeneous, 651 
Hilbert, inhomogeneous, 652 
homogeneous, 569 
inhomogeneous, 569 
linear, 569 
bounded set (order-bounded), 660 
boundedness of a function 
one variable, 61 
several variables, 124 
Box-Müller method, 934 
brackets, Lie, 356, 596 
Bravais lattice, 349 
break of symmetry, bifurcation, 897 
Breit-Wigner curve, 791 
Brodetsky-Smeal method, 45 
Brouwer fixed-point theorem, 691 
business mathematics, 21 
byte, 1001 


calculation 
complex numbers, 36 


coordinates, polar and rectangular, 146 


determinants, 278 
numerical 
accuracy, 1006 
basic operations, 1005 
polynomials, shift register, 364 
rule 
irredicible polynomials, 363 
quaternions, 292 
tensors, 282 
triangle, oblique plane, 142 
triangle, plane, 144 
triangle, right-angled plane, 142 
triangle, spherical, 171 
calculus 
differentiation, 432 
errors, 848 
integral, 480 


observations, measurement error, 848 


propositional, 323 
variations, 610 


canonical form, circle-mapping, 905, 907 


Cantor function, 907 

Cantor set, 882, 883, 887 
cap, spherical, 158 

capacity, dimension, 883 
capacity, edge, 411 
Caratheodory conditions, 689 
Cardan angles, 214, 295 
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definition, 214 

direction cosines, 215 

quaternions, 298 

rotation matrix, 215, 295 
Cardano formula, 41 
cardinal number, 328, 335 
cardinality, set, 335 
cardioid, 99 
carrier function, 633 
Carson transformation, 769 
Cartesian 

coordinates 

plane, 190 
space, 210 

folium, 96 
cartography scale, 145 
cascade, period doubling, 901, 905 
Cassinian curve, 100 
category, second Baire category, 874 
catenary curve, 89, 107, 613 
catenoid, 89 
Cauchy 

integral, 650 

integral formula, 748 

integral formulas, application, 754 


method, differential equations of higher order, 555 


principal value, improper integral, 507, 510 
principle, 666 
problem, 572 
sequence, 665 
theorem, 443 
Cauchy-Riemann differential equations, partial, 732 
Cauchy-Riemann operator, 306 
Cayley 
table, 337 
theorem, 339, 408 
center 
circle, 198 
curvature, 248 
spherical, 176 
center manifold theorem 
differential equations, 892 
mappings, 897 
center of area method, 427 
center of gravity, 216 
arbitrary planar figure, 506 
arc segment, 505 
closed curve, 506 
double integral, 527 
line integral, first type, 517 
method, 426 
generalized, 427 
parametrized, 427 
trapezoid, 506 
triangle, 133 
triple integral, 532 
center of mass, 193, 216 
central 
angle, 13 
curves, 207 
field, 702 
surface, 224 
central limit theorem, Lindeberg—Levy, 825 


chain, 334, 404 
directed, elementary, 410 
graph, 410 
elementary, 410 
Markov, 826 
stationary, 826 
time-homogeneous, 826 
rule, 702 
composite function, 435 
stochastic, 825, 826 
chaos, 857, 904 
attractor, strange, 888 
from torus to chaos, 904 


one-dimensional mapping, 889 


routes to chaos, 892 
through intermittence, 905 
transition to chaos, 901 
character 
group element, 343 
representation of groups, 344 
characteristic (logarithm), 10 
characteristic strip, 573 
Chebyshev 
approximation, 988 
continuous, 988 
discrete, 991 
inequality, 31 
polynomial formula, 88 
polynomials, 989 
theorem, 489 
Chinese postman problem, 406 
Chinese remainder theorem, 379 
x? distribution, 822 
X? test, 835, 836 
Cholesky 
decomposition, 958 
method, 314, 958 
chord, 141 
theorem, 139 
circle 
Apollonius, 742 
area, 139 
center, 198 
chord, 139 
circumference, 139 
convergence, 750 
curvature, 248 
plane curve, 246 
dangerous, 150 
definition, 139, 198 
equation 


Cartesian coordinates, 198 


polar coordinates, 199 
great circle, 160, 174 
intersection, 160 
mapping, 906, 9077 


parametric representation, 199 


periphery, 139 

plane, 139, 198 

radius, 139, 198 

small circle, 160, 176 

tangent, 139, 199 
circuit 
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directed, graph, 410 combination, 805 
Euler circuit, 405 with repetition, 805 
Hamilton circuit, 406 without repetition, 805 
circuit integral, 521 combinatorics, 805 
being zero, 523 commensurability, 4 
circular commutative law 
field, 705 Boolean algebra, 396 
point, 267 matrices, 272, 273 
sector, 141 missing, quaternion multiplication, 290 
segment, 141 propositional logic, 324 
circumcircle sets, 330 
quadrangle, 136 vectors, 273 
triangle, 133, 143 commutativity 
circumscribing groups, 353 
quadrangle, 137 scalar product, 184 
triangle, 133 commutator, 356, 367, 596 
cissoid, 96 comparable function, 617 
Clairaut complement, 329 
differential equation, ordinary, 546 algebraic, 278 
differential equation, partial, 574 fuzzy, 421 
class fuzzy set, 418 
defined by identities, 395 orthogonal, 674, 682 
equivalence class, 334 sets, 328 
midpoint, 834 Sugeno complement, 421 
statistics, 832 Yager complement, 421 
Clebsch-Gordan complementary angles formulas, 80 
coefficient, 345 completeness relation, 676 
series, 345 completion, metric space, 668 
theorem, 345 complex analysis, 731 
Clifford algebra, 289 complex function, 48 
Clifford numbers, 290 pole, 733 
closure complex numbers, 34 
closed linear, 674 generalized, 290 
linear, 655 mapping, plane, 745 
set, 665 plane, 34 
transitive, 332 complex-valued function, 48 
clothoid, 107 complexification, 659 
code, 383 composition, 424 
ASCII (American Standard Code for Information max-average, 424 
Interchange), 1001 max-(t norm), 424 
BCH, 386 compound interest, 22 
cyclic, 386 calculation, 22 
error correcting, 385 computation of adjustment, 982 
linear, 385 computer 
public key, 392 basic operations, 1005 
RSA Code, 392 error of the method, 1007 
codimension, 892 error types, 1004, 1006 
coding, 388, 392 internal number representation, 1003 
coefficient, 11 internal symbol representation, 1001 
Clebsch-Gordan, 345 numerical accuracy, 1006 
Fourier, 474 numerical problems in calculation, 1004 
leading, 38 use of computers, 1001 
matrix, extended, 957 computer algebra systems, 1023 
metric, 186, 187 differential and integral calculus, 1042 
vector decomposition, 183 graphics, 1045 
collinearity, vectors, 185 manipulation of algebraic expressions, 1036 
collocation purpose, 1023 
boundary, 978 computer graphic 
domain, 978 Mathematica, 1045 
method, 634, 974, 978 quaternions, 302 
points, 974, 978 concave, curve, 246 
column pivoting, 957 conchoid 


column sum criterion, 960 Nicomedes, 97 
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general, 98 

of the circle, 98 

of the line, 98 
conclusion, 425 
concurrent expressions, 398 
condition 


boundary value, ordinary differential equation, 540 


Caratheodory conditions, 689 
Cauchy (convergence), 123 
Dirichlet (convergence), 475 
initial value, ordinary differential equation, 540 
Kuhn-Tucker 
global, 925 
local, 925 


Lipschitz, higher-order differential equations, 551 


Lipschitz, ordinary differential equation, 541 
number, 959 
regularity condition, 941 
cone, 157, 225, 671 
central surface, 228 
circular, 157 
generating, 660 
imaginary, 228 
normal, 671 
regular, 671 
solid, 671 
truncated, 157 
vector space, 657 
confidence 
interval, 841 
regression coefficient, 841 
variance, 838 
limit 
for the mean, 837 
prescription, 853 
probability, 837 
region, 841 
congruence 
algebraic, 377 
corners, 152 
linear, 378 
method, 844 
plane figures, 134 
polynomial congruence, 380 
quadratic, 379 
relation, 394 
kernel, 394 
simultaneous linear, 379 
system simultaneous linear, 379 
theorems, 134 
congruent 
directly, 133 
indirectly, 134 
mapping, 133, 134 
conic section, 158, 206, 207 
singular, 207 
conjugate complex number, 36 
conjunction, 323 
elementary, 399 
consistency 
integration of differential equation, 972 
order p, 972 
constant 


Euler, 513 
polynomials, 62 
propositional, 323 
term, 308 
constants 
important natural, table, 1053 
mathematical, frequently used, 1053 
continuation, analytic, 751 
continued fractions, 3 
continuity 
composite functions, 61 
elementary function, 60 
from below, 694 
function 
one variable, 58 
several variables, 124 
Holder, 649 
continuous, absolutely, 697 
continuum, 335 
contour integral, vector field, 721 
contracting principle, 666 
contraction, 282 
tensor, 287 
transformation (geometric), 236 
contradiction, Boolean function, 397 
control by step size, 935 
control digit, 383 
convergence 
absolute, 462, 469 
complex terms, 750 
alternating series test of Leibniz, 463 
Banach space, 670 
circle of convergence, 750 
condition of Cauchy, 53 
conditional, 462 
complex terms, 750 
dominated, 697 
in mean, 475 
infinite series, complex terms, 749 
integration of differential equation, 972 
non-uniform, 468 
order p, 972 
sequence of numbers, 45 
complex terms, 749 
series, 460, 462 
complex terms, 749 
uniform, 468, 469 
uniformly, function sequences, 664 
weak, 687 
Weierstrass criterion, 468 
convergence criterion 
alternating series test of Leibniz, 463 
comparison criterion, 460 
D’Alembert’s ratio test, 461 
integral test of Cauchy, 462 
necessary, 460 
root test of Cauchy, 461 
sufficient, 460 
uniformely convergence, Weierstrass, 468 
convergence theorem, 459 
measurable function, 697 
conversion, number systems, 1002 
convex, curve, 246 
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convolution 
Fourier transformation, 789 
Laplace transformation, 773 
one-sided, 789 
two-sided, 789 
Z-transformation, 796 
coordinate inversion, 287 
coordinate line, 210, 284 
coordinate surface, 210, 284 
coordinate system 
cartesian, 210 
curvelinear, 210 
cylindrical polar, 211 
double logarithmic, 117 
Gauss-Krueger, 144 
left-hand, 209 
local, 234, 238 
object, 229, 234 
orientation, 209 
orthogonal, 181 
orthonormal, 181 
plane, 190 
projection, 238 
right-hand, 209 
semilogarithmic, 116 
Soldner, 144 
spatial, 209 
spherical polar, 211 
transformation, 280 
world-, 234 
coordinate transformation, 229, 286, 706 
2-dimensional, 231 
3-dimensional, 234 
equation 
central curves second order, 207 
quadratic curve (parabolic), 208 
coordinates 
affine, 186 
axis, 190 
backward, 890 
barycentric, 982 
Cartesian, 187 
plane, 190 
space, 210 
contravariant, 188, 287 
covariant, 188, 287 
curvilinear, 191, 261, 284 
three dimensional, 210 
cylindrical, 211 
vector field, 706 
Descartes, 190 
equation, space curve, 259 
forward, 890 
Gauss, 261 
Gauss-Kriiger, 162 
geodetic, 144 
geographical, 162 
homogeneous, 231, 355 
mixed, 286 
point, 190 
polar, plane, 191 
polar, spherical, 211 
representation with scalar product, 188 


Soldner, 162 
spherical, 211 
vector field, 706 
ransformation, 191 
between orthogonal coordinates, 212 
Cartesian to polar, 192 
riangle coordinates, 982 
vector, 183 
coprime, 5 
corner 
convex, 152 
figure, 152 
symmetric, 152 
rihedral, 152 
correction form, 961 
corrector, 971 
correlation, 839 
analysis, 839 
coefficient, 840 
empirical, 840 
cosecant 
hyperbolic, 89 
trigonometric, 78 
geometric definition, 131 


coset 
left, 338 
right, 338 
cosine 


hyperbolic, 89 
geometric definition, 132 
trigonometric, 77 
cosine law, 143 
geometric definition, 131 
law for sides, 166 
rule, spherical triangle, 166 
cosine integral, 513 
cotangent 
hyperbolic, 89 
trigonometric, 77 
geometric definition, 131 
Coulomb field, point-like charge, 705, 727 
counterpoint, 160 
course angle, 161 
covariance, two-dimensional distribution, 840 
covering transformation, 336 
covering, open, 860 
Cramer rule, 311 
credit, 22 
criterion 
convergence 
sequence of numbers, 458 
series, 459 
series with positive terms, 460 
uniform, Weierstrass, 468 
divisibility, 373 
subspace, 367 
cross product, 184 
cryptanalysis, classical, methods, 389 
Kasiski-Friedman test, 390 
statistical analysis, 390 
cryptography, conventional 
methods, 388 
linear substitution ciphers, 389 
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substitution, 388 curve (continued I) 
monoalphabetic, 388 area tangent, 94 
monographic, 388 astroid, 104 
polyalphabetic, 388 asymptote, 249, 252 
polygraphic, 388 asymptotic point, 250 

transposition, 388 B-B representation, 1000 

cryptology, 386 cardioid, 99 

classical Cartesian folium, 96 
Hill cypher method, 389 Cassinian, 100 
matrix substitution method, 389 catenary, 107 
Vigenere cypher method, 389 cissoid, 96 

cryptosystem, 387 clothoid, 107 

DES algorithm, 393 concave, 246 

Diffie-Hellman key exchange, 391 conchoid of Nicomedes, 97 

encryption convex, 246 
context free, 387 corner point, 250 
context sensitive, 387 cosecant, 78 

IDEA algorithm, 393 cosine, 77 

mathematical foundation, 387 cotangent, 77 

one-time pad, 390 curvature, 246 

one-way function, 391 cuspidal point, 250 

public key methods, 391 cycloid, 101, 102 

RSA method, 392 damped oscillation, 85 

security of cryptosystems, 388 directing, 156 

subject, 386 double point, 250 

crystal class, 350 empirical, 108 

crystal system, 350 envelope, 254 

crystallography epicycloid, 102 
lattice, 348 epitrochoid, 104 
symmetry group, 348 equation 

cube, 154 complex form, 760 

curl, 727 plane, 195, 243 

density, 728 second degree, 206 

field second order, 206 
pure, 727 space, 218 
zero-divergence field, 727 error curve, 73 

line, 714 evolute, 254 

curvature evolvent, 254, 255 

center, 248 of the circle, 106 

circle, 248 exponential, 72 

curves on a surface, 265 ourth order, 97 

Gauss surface, 267 Gauss error curve, 819 

mean, surface, 267 general discussion, 253 

minimal total curvature, 997 hyperbolic cosine, 89 

plane curve, 246 hyperbolic cotangent, 90 

radius, 248 hyperbolic sine, 89 
curve on a surface, 265 hyperbolic tangent, 90 
principal, 265 hyperbolic type, 70, 72 

space curve, 258 hyperbolice spiral, 105 

surface, 265, 267 hypocycloid, 103 
constant curvature, 267 hypotrochoid, 104 

total, 261 imaginary, 195 

curve inflection point, 249 

algebraic, 95, 195 involute, 255 
n-th order, 252 isolated point, 250 

arc cosine, 86 Koch curve, 883 

arc cotangent, 86 emniscate, 101 

arc sine, 86 ength, line integral, first type, 517 

arc tangent, 86 ogarithmic, 73 

Archimedean spiral, 105 ogarithmic spiral, 106 

area cosine, 93 Lorentz curve, 95 

area cotangent, 94 multiple point, 251 


area sine, 93 normal curve, plane, 244 
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curve (continued IT) d'Alembert formula, 591 

n-th degree, 65, 195 damping parameter, 962 
n-th order, 65, 195 damping, oscillations, 85 
parabolic type, 65 Darboux vector, 261 
Pascal limaçon, 98 data type, 394 
plane, 243 De Morgan 

direction, 243 law, 330 

vertex, 250 rule, 324 
quadratic, 206 Boolean algebra, 396 
radius of curvature, 246, 247 death process, 829 
representation with splines, 996 debt, 23 
secant, 78 decay, radioactive, 829 
second degree, 206 decima 
second order, 206 number, 1002 
semicubic parabola, 95 normalized, 1005 
sine, 7T representation, 1002 
space, 256 system, 1002 
spherical, 160, 174, 702 decoding, 392 
spiral, 105 decomposition 
strophoide, 97 orthogonal, 674 
tacnode, 250 partial fraction, 778 
tangent, 77 partial fractions, 15 

plane, 244 QR, matrix, 959 
terminal point, 250 singular value, 321 
third degree, 67 theorem, higher order differential equations, 554 


third order, 95 
tractrix, 108 
transcendent, 195 
trochoid, 102 
witch of Agnesi, 95 


curves 
family of, envelope, 255 
plane 
angle, 246 


second order 
central curves, 207 
parabolic curves, 208 
polar equation, 208 
spherical, intersection point, 180 
cut, 329 
Dedekind's, 334 
fuzzy sets, 417 
set, 329 
cutting plane method, 942 
cycle, 404 
chain, 410 
limit, 895 
cycloid, 101 
basis, 101 
common, 101 
congruent, 101 
curtate, 102 
prolate, 102 
cylinder, 156, 227 
circular, 156 
elliptic, 227 
hollow, 157 
hyperbolic, 227 
invariant signs, 229 
parabolic, 227 
cylindrical coordinates, 211 
cylindrical function, 562 
cylindrical surface, 156, 218 


triangle, matrix, 955 
vectors, 183 
decyphering, 392 
Dedekind cut, 334 
defect, 974, 978 
vector space, 367 
definite 
negative, 317 
positive, 317, 958 
defuzzification, 426 
degeneracy of states, 598 
degree 
curve, second degree, 206 
curve, n-th degree, 195 
homogeneity, 122 
in-degree, 401 
matrix, 408 
measure in degrees, 131 
out-degree, 401 
Delambre equations, 168 
6 distribution, 699 
ó function, 694, 699 
application, 777 
approximation, 777 
Dirac, 774 
ó functional, 681 
density function, 812 
multidimensional, 814 
dependence, linear, 308, 366 
deposit 
in the course of the year, 22 
regular, 22 
single, 22 
depreciation, 26 
arithmetically declining, 26 
digital, 27 
geometrically declining, 27 
straight-line, 26 
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derivative 
complex function, 731 
constant, 433 
directional, 708 
scalar field, 708 
vector field, 708 
distribution, 700 
exterior, 435 
fraction, 435 
Fréchet, 690 
function 
composite, 435 
elementary, 433 
implicit, 436 
inverse, 436 
parametric form, 437 
several variable, 445 
generalized, 699 
higher order, 438, 448 
inverse function, 440 
parametric form, 440 
interior, 435 
left-hand, 433 
logarithmic, 435 
mixed, 448 
one variable, 432 
partial, 445 
product, 434 
quotient, 435 
right-hand, 433 
scalar multiple, 433 
Sobolev sense, 699 
space, 708 
sum, 433 
table, 434 
vector function, 701 
volume differentiation, 709 


Derive (computer algebra system), 1023 


Descartes rule, 45 
descendant (offspring), 933 
descent method, 931 
determinant, 278 
differentiation, 279 
evaluation, 279 
functional, 123, 691 
Jacobian, 123, 691 
multiplication, 279 
reflection, 279 


rotation, transformation properties, 214 


rules of calculation, 278 
Wronskian, 553, 862 
zero value, 279 
determination of extrema 
absolute extremum, 445 
implicit function, 445 
deviation, standard deviation, 813 
devil's staircase, 907 
diagonal matrix, 270 
diagonal method, Maxwell, 744 
diagonal strategy, 957 
diameter 
parabola, 205 
circle, 140 


conjugate 
ellipse, 200 
hyperbola, 203 

ellipse, 200 

hyperbola, 203 


diffeomorphism, 858 


Anosov, 889 
orientation-preserving, 906, 907 


difference 


di 


di 


di 
di 
di 


di 


di 


bounded, 419, 420 
finite expression, 973 
sets, 330 
symmetric, 330 
significant, 839 
Z-transformation, 796 
erence equation, 794 
boundary value, 800 
inear, 798 
artial differential equations, 976 
second-order, 799, 800 
erence method, 973 
artial differential equations, 976 
erence quotient, 963 
erence schema, 18 
erentiability 
complex function, 731 
unction of one variable, 432 
unction of several variables, 447 


erentiable 
continuously, 690 
Fréchet, 690 
erential 
arc 
plane, 243 
surface, 263 
complete, 447 
first-order, 433 
higher order, 447 
integrability, 521 
notion, 446 
artial, 447 
rincipal properties, 447 
quotient (see also derivative), 432 
second-order, 449 
otal, 447, 448 
n-th order, 449 
second-order, 449 


differential calculus, 432 


undamental theorem, 441 


differential equation, 540 


boundary value problem, 569 
eigenfunction, 569 
eigenvalue, 569 

eigenvalues, 597 

ow, 857 

Fourier transformation, 791 
Laplace transformation, 781 
numerical solution, 550 
operational notation, 555 
order, 540 

orthogonality, 570 

Riccati 


with respect to the initial conditions, 857 
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normal form, 544 
ordinary, 544 
self-adjoint, 569 
singular solution, 542 
stiff, 973 
topological equivalence, 870 
Weber, 601 
differential equation, linear 
autonomous, on the torus, 863 
constant coefficients, 555 
first-order, 861 
fundamental theorem, 861 
homogeneous, 861 
inhomogeneous, 861 
matrix-differential equation, 861 
non-autonomous, on the torus, 906 
periodic coefficients, 863 
second order, 560 
Bessel, 562 
Hermite, 568 
hypergeometric, 567 
Laguerre, 568 
Legendre, 565 
method of unknown coefficients, 560 
second-order system, 560 
differential equation, ordinary, 540 
approximate integration, 969 
autonomous, 860 
Bernoulli, 544 
boundary value conditions, 540 
boundary value problem, 540 
center, 548 
central point, 548 
Clairaut, 546 
direction field, 540, 541 
element, singular, 546 
exact, 542 
existence theorem, 540 
explicit, 540 
first-order, 540 
approximation methods, 549 
important solution methods, 542 
flow, 857 
fraction of linear functions, 547 
general solution, 540 
general, n-th order, 540 
graphical solution, 550 
homogeneous, 542 
implicit, 540, 545 
initial value conditions, 540 
initial value problem, 540 
integral, 540 
integral curve, 541 
integral, singular, 546 
integrating factor, 543 
Lagrange, 545 
linear, 792 
constant coefficients, 781 
first-order, 543 
variable coefficient, 782 
linear, planar, 857, 871 
multiplier, 543 
non-autonomous, 860 


notion, 540 
particular solution, 540 
point, singular, 546 
radicals, 546 
ratio of arbitrary functions, 548 
separation of variables, 542 
series expansion, 549 
singular point, 547 
solution, 540 
successive approximation, 549 
van der Pol, 895 
differential equation, partial, 571 
approximate integration, 976 
Cauchy-Riemann, 584, 732 
characteristic system, 572 
Clairaut, 574 
completely integrable, 576 
eigenfunction, 598 
electric circuit, 585 
elliptic type, 576 
constant coefficients, 578 
Euler, calculus of variations, 612 
field theory, 729 
first-order, 571 
canonical systems, 573 
characteristic strip, 573 
characteristic system, 571 
inear, 571 
non-linear, 573 
non-linear, complete integral, 573 
otal differentials, 576 
wo variables, 575 
Fourier transformation, 792 
Hamilton, 861, 903 
heat conduction equation 
one-dimensional, 583, 585 
hree-dimensional, 591 
Helmholtz, 594 
hyperbolic type, 576, 578 
integral surface, 572 
Laplace, 729 
Laplace transformation, 783 
inear, 571 
ongitudinal vibrational bar, 580 
normal system, 574 
notion, 540 
arabolic type, 576 
constant coefficients, 578 
Poisson, 592, 726, 729 
quasilinear, 571 
reduced, 893 
reduced, normal form, 893 
second-order, 576 
constant coefficients, 578 
separation of variables, 597 


ultra-hyperbolic type, constant coefficients, 578 


vibrating membrane, 581 
vibrating string, 579 

differential equations, higher order, 550 
constant coefficients, 553 
decomposition theorem, 554 
Euler, n-th order, 557 
fundamental system, 553 
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linear, n-th order, 553 dihedral angle, 152 
lowering the order, 552, 554 dihedral group, 336 
quadrature, 554 dimension, 882 
reduction to a system of differential equations, 550 capacity, 883 
superposition principle, 554 correlation, 885 
system of solutions, 551 defined by invariant measures, 884 
variation of constants, 554 Douady-Oesterlé, 886 
differential equations, linear embedding, 890 
second order ormula, 367 
defining equation, 561 generalized, 885 
differential equations, partial, non-linear, 603 Hausdorff, 882, 886 
first-order, 573 information, 884 
non-linear waves, 604 Lie algebra, 353 
pattern, periodic, 604 ower point-wise, 884 
Schroedinger, 606 Lyapunov, 885 
differential operation matrix-Lie group, 353 
review, 717 measure, 884 
vector components, 718 metric, 882 
differential operator Renyi, 885 
divergence, 712, 719 upper point-wise, 884 
gradient, 710, 719 vector space, 366, 657 
Laplace, 716, 719 Dirac 
nabla, 715, 719 distribution, 699 
non-linear, 690 matrices, 290 
relations, 719 measure, 694, 876 
rotation, 713, 719 operator, 306 
rules of calculations, 717 heorem, 406 
space, 708, 715 directing curve, 156 
vector gradient, 711 direction 
differential transformation, affine, 734 cosines, space, 212 
differentiation, 432 lane curve, 243 
complex function, 731 rojection, 237 
composite function, 450 space, 212 
function space curve, 256 
elementary, 433 direction cosines 
implicit, 436 Cardan angles, 215 
inverse, 436 Euler angles, 216 
one variable, 432 directional 
parametric form, 437 angle, 146 
several variables, 445 erivative, 710 
graphical, 437 directrix 
higher order ellipse, 200 
inverse function, 440 hyperbola, 202 
parametric form, 440 arabola, 204 
implicit function, 451 Dirichlet 
logarithmic, 435 condition, 475 
matrix, 276 roblem, 582, 729 
several variables, 450 discontinuity, 58 
under the integration sign, 512 unction, 58 
volume, 709 removable, 59 
differentiation rules discount, 21 
basic rules, 433 discretization error 
derivative of higher order, 438 global, 972 
function ocal, 972 
one variable, 433 discretization step interval, 963 
several variables, 433, 448 scriminant, 576 
table, 439 isjoint, 329 
vector function, 701 tion, 323 
vectors, 701 elementary, 399 
diffusion coefficient, 592 dispersion, 813 
diffusion equation, 609 dissolving torus, 904 
diffusion equation, three-dimensional, 591 distance, 504 


digon, spherical, 163 Hamming distance, 662 
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line-point, 196 
metric space, 662 
planes, 221 
parallel, 221 
point-line, space, 222 
point-plane, space, 220 
pherical, 160 
wo lines, space, 223 
wo points, 192 
space, 217 
distribution, 698, 699, 777 
binomial, 815 
x?, 822 
x?, table, 1135 
continuous, 818 
erivative, 699 
Dirac, 699 
discrete, 814 
exponential, 820 
Fisher, 823 
frequency, 833 
function, 811 
function, continuous, 812 
hypergeometric, 814, 816 
logarithmic normal, 819 
lognormal distribution, 819 
marginal, 814 
measurement error density, 848 
normal, 818 
Poisson, 817 
regular, 699 
standard normal, 819 
Student, 824 
t distribution, 824 
telephon-call, 829 
theory, 774 
Weibull, 821 
distributive law 
Boolean algebra, 396 
matrices, 272 
propositional logic, 324 
ring, field, 361 
sets, 330 
tensors, 282 
distributivity 
vector product, 185 
left sided, 368 
right sided, 368 
divergence, 726 
central field, 713 
definition, 712 
different coordinates, 712 
improper, 458 
proper, 458 
remark, 709 
sequence of numbers, 458 
series, 462 
theorem, 724 
vector components, 718 
vector field, 712 
divisibility, 370 
criteria, 372, 373 
division 
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complex numbers, 37 
computer calculation, 1005 
external, 193 
golden section, 194 
harmonic, 193 
in extreme and mean ratio, 194 
internal, 193 
line segment, plane, 193 
polynomial, 14 
ring, 363 
quaternions, 293 
rational numbers, 1 
segment, space, 217 
divisor, 370 
greatest common (gcd) 
integer numbers, 373 
inear combination, 374 
polynomials, 14 
positive, 372 
zero, 361 
dodecahedron, 156 
domain, 119 
closed, 119 
convergence, function series, 467 
doubly-connected, 119 
function, 48 
image, set, 49 
integrity, 361 
multiply-connected, 119 
non-connected, 119 
of attraction, 860 
of individuals, predicate calculus, 326 
open, 119 
operator, 658 
set, 49 
simply-connected, 119, 747 
hree or multidimensional, 120 
wo-dimensional, 119 
values, set, 49 
dot product, 184 
double integral, 524 
application, 527 
notion, 524 
double line, 207 
duality, 396 
linear programming, 919 
non-linear optimization, 926 
principle, Boolean algebra, 396 
heorem, strong, 926 
dualization, 396 
Duhamel formula, 782 
dynamical system, 857 
continuous, 857 
discrete, 858, 871 
reconstruction space, 889 
time discrete, 857 
volume contracting, 858 
volume pressing, 858 


eccentricity, numerical 
curve second order, 207 
ellipse, 199 
hyperbola, 202 
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parabola, 204 
edge 
angle, 152 
figure, 152 
graph, 401 
length, 403 
valuation, 403 
multiple, 401 
sequence, 404 
cycle, 404 
directed circuit, 404 
isolated edge, 404 
open, 404 
path, 404 
effective rate, 22 
eigenfunction, 598 
differential equation, 569 
integral equation, 623, 628 
normalized, 570 
eigenvalue, 314, 597 
differential equation, 569 
integral equation, 623, 628 
numerical calculation, 319 
operator, 680 
eigenvalue problem 
general, 314 
matrices, 314 
special, 314 
eigenvector, 283, 314, 680 
Einstein’s summation convention, 280 
element, 327 
finite, 979 
generic, 874 
linearly independent, 656 
neutral, 335 
positive, 659 
lement of area, plane, table, 527 
ement of surface, curved, table, 534 
ement of volume, table, 532 
ementary cell 
crystal lattice, 348 
non-primitive, 349 
primitive, 349 
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lementary volume 
arbitrary coordinates, 531 
Cartesian coordinates, 529 
cylindrical coordinates, 530 
spherical coordinates, 530 
elements of curved surfaces, 534 
elements of plane surfaces, 527 
elimination method, Gauss, 312, 955 
e 
e 


imination step, 956 
lipse, 199 
arc, 201 
area, 201 
diameter, 200 
equation, 199 
focal properties, 200 
focus, 199 
perimeter, 201 
radius of curvature, 200 


ementary formula, predicate logic, 326 
ementary surface, parametric form, 534 


semifocal chord, 199 
tangent, 200 
transformation, 207 
vertex, 199 
ellipsoid, 224 
central surface, 228 
cigar form, 224 
imaginary, 228 
lens form, 224 
of revolution, 224 
surface second order, 228 
embedding, 890 
canonical, 684 
dimension, 890 
encoding schemes, 381 
encoding, RSA code, 392 
encyphering, 386 
endomorphism, 659 
endomorphism, linear operators, 659 
endpoint, 401 
energy 
particle, 594 
spectrum, 594 
system, 593 
zero-point translational energy, 598 
zero-point vibration energy, 603 
entropy, 879, 880 
generalized, 885 
metric, 879 
opological, 879, 889 
envelope, 254, 255 
epicycloid, 102 
curtate, 104 
rolate, 104 
epitrochoid, 104 
equality 
asymptotic, 472 
complex numbers, 35 
matrices, 272 
equality relation, 10 
equation, 10 
algebraic, 38 
iquadratic, 42 
Boussinesq, 609 
Burgers, 609 
characteristic, 315, 547 
cubic, 64 
curve 
plane, 195, 243 
second degree, 206 
second order, 206 
defining, 561 
degree, 38 
degree one, 39 
diffusion, three-dimensional, 592 
Diophantine, 375 
inear, 376 
ellipse, 199 
evolution, 605 
heat conduction 
hree-dimensional, 591 
Hirota, 609 
homogeneous, 687 
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equation (continued) 
hyperbola, 202 
inhomogeneous, 687 
irrational, 39 
Kadomzev-Pedviashwili, 609 
Korteweg de Vries (KdV), 604, 605 
modfied, 608 
Kuramoto-Sivashinsky (KS), 604 
line in a plane, 195 
line in space, 221 
logistic, 858, 899 
non-linear 
fixed point, 949 
numerical solution, 949 
non-linear evolution, 605 
non-linear Schroedinger (NLS), 606 
normal form, 38 
operator equation, 687 
parabola, 205 
Parseval, 475, 570, 676 
pendulum, 904 
plane 
curve, 195 
general, 218 
Hessian normal form, 219 
intercept form, 219 
polynomial 
numerical solution, 952 
quadratic, 64 
root, definition, 38 
Schroedinger 
inear, 592 
non-linear (NLS), 604 
sine-Gordon (SG), 607 
Sinh-Gordon, 609 
solution, general, 38 
space curve, 218, 256 
vector form, 256 
sphere, 261 
surface 
normal form, 224 
second order, 228 
space, 217, 261 
surface normal, 264 
tangent plane, 264 
term algebra, 395 


vector, 188 
wave, three-dimensional, 590 
equations 


algebraic, 38 

general properties, 43 
degree four, 42 
degree one (linear), 39 
degree three (cubic), 40 
degree two (quadratic), 40 
degree n, 43 

solution, 45 
Delambre, 168 
exponential, solution, 46 
hyperbolic functions, solution, 47 
L'Huilier, 169 
logarithmic, solution, 46 
Mollweide, 143 


Neper, 169 
lane, 218 
real coefficients, 44 
reducing transcendental to algebraic, 45 
ranscendental, 38 
rigonometric, solution, 46 
equilibrium point, 857 
hyperbolic, 864 
equivalence, 323 
class, 334 
ogic, 324 
roof, 5 
relation, 333 
Eratosthenes sieve, 370 
ergodic system, Birkhoff theorem, 877 
ergodic theory, dynamical systems, 877 
error 
absolute, 852, 1004 
absolute maximum error, 852 
apparent, 851 
average, 849-852 
computer calculation, 1006 
defined, 853 
density function, 848 
discretization, 1007 
equation, 958 
estimation, iteration method, 962 
input error, 1006 
least mean squares, 475 
mean, 849, 851 
square, 851, 852 
normally distributed, 849 
percentage, 852 
probable, 849-852 
propagation, 854 
relations between error types, 850 
relative, 852, 1004 
relative maximum error, 853 
round-off, 1007 
single measurement, 850 
standard, 851 
true, 850 
truncation error, 1007 
type 1 error rate, 813 
type, measurement errors, 848 
error analysis, 856 
error calculus 
direct problem, 1006 
inverse problem, 1006 
error curve, 73 
error estimation, mean value theorem, 442 
error function, 514 
Gauss, 819 
error integral, Gauss, 514 
error orthogonality, 974 
error propagation law, Gauss, 855 
error types, computer calculation, 1006 
ess. sup (essential supremum), 697 
estimate, 831 
Euclidean 
algorithm, 3, 14 
polynomial rings, 363 
group, 354 
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scaled, 354 
norm, 367 


R? (3 dimensional vector space), 289 


ΓΕ’ (4 dimensional vector space), 289 


vector norm, 276 
vector space, 367 
Euler 

angles, 215 

broken line method, 969 

circuit, 405 

constant, 513 

differential equation 
n-th order, 557 
variational calculus, 612 


formula (curvature of a surface), 265 
formulas (Fourier representation), 474 


function (theory of numbers), 381 
graph, 405 
integral, first kind, 1098 
integral, second kind, 512, 1098 
numbers, 466 
polygonal method, 969 
relation, 36 
complex numbers, 758 
theorem (polyeder), 155 
trail, 405 
open, 405 
Euler angles, 296 
definition, 215 
direction cosines, 216 
rotation matrix, 216, 296 
Euler-Hierholzer theorem, 405 
event, 807 
certain, 807 
complete system of, 808, 810 
elementary, 807 
impossible, 807 
independent, 810 
random, 807 
set of events, 807 
simple, 807 
evolute, 254 
evolution 
equation, 605 
function, 605 
evolution algorithms, 933 
evolution principles, 933 
evolution strategies, 933 
algorithms, 933 
application, 934 
classification, 934 
from some populations, 936 
(u + A) strategy, 935 
(u, A) strategy, 935 
mutation, 933 
mutation-selection 
mutation step, 934 
selection step, 935 
step size determination, 935 
strategy, 934 
populations, 935 
recombination, 933 


selection, 933 


wi 


pressure, 936 
th recombination, 936 


evolvent, 254, 255 


of 


circle, 106 


excess, spherical triangle, 165 
exchange theorem, 448 
exchange, cyclic, sides and angles, 142 
excluded middle, 324 
existential quantifier, 326 
expansion 
Fourier expansion, forms, 477 
Laplace expansion, 278 
Laurent expansion, 752 
Maclaurin, 472 
Taylor, 442, 471 
expectation, 813 
expectation value (quantum mechanical), 595 
pected value, 813 
bivariate distribution, 840 
two-dimensional distribution, 840 
exponent, 7 
exponential distribution, 820 
ponential function, 72 
complex, 739 
general, 759 
natural, 758 
quaternions, 293 
onential integral, 514 
onential sum, 74 


ex] 


ex] 


ex] 
ex] 
ex] 


pression 


algebraic, 10 
analytic, 49 


ex] 


in 


ra 


ver 


domain, 49 

explicit form, 49 
implicit form, 49 
parametric form, 50 


Boolean, 397 
concurrent, 398 


plicit form, 49 


finite, 973 


partial differential equations, 976 


implicit form, 49 


egral rational, 11 


irrational, 11, 17 
arametric form, 50 
ropositional logic, 323 


ional, 11, 14 


semantically equivalent, 398 
autology, 325 
ranscendent, 11 


ctor analysis, 718 


extension field, 362 

extension principle, 421 

extension theorem of Hahnium, 683 
extension, algebraic, 364 
extension, linear functional, 682 
extraction of the root 

complex numbers, 38 

real numbers, δ 
extrapolation principle, 967 
extremal, radius of curvature, 616 
extreme value, 51, 443 
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absolute, 443 

determination, 444 
higher derivatives, 444 
side conditions, 456 
sign change, 444 

function, 454 

relative, 443 


ace, corner, 152 
actor algebra, 394 
actor group, 339 
actor ring, 363 
actorial, 13 
generalization of the notion, 515 
actoring out, 11 
Falk scheme, 273 
Feigenbaum constant, 900, 901 
FEM (finite element method), 978 
FFT (fast Fourier transformation), 993 
Fibonacci 
explicite formula, 375 
numbers, 375, 908 
recursion formula, 375 
sequence, 375 
field, 361 
algebraically closed, 364 
axial field, 703 
central symmetric, 702 
circular, 705 
complex numbers, 364 
conservative, 721 
Coulomb field, point-like charge, 705, 727 
cyclotomic, 364 
cylinder symmetric, 703 
extension field, 363 
extensions, 362 
finite, 363 
flow, 723 
function, 741 
Galois, 363 
gravitational field, point mass, 728 
Newton field, point-like mass, 705 
over-field, 362 
pole, 703 
potential, 721 
scalar field, 702 
kew field, 361 
rings, 361 
source field, 726 
spherical field, 702 
splitting 
polynomial, 363 
vector field, 701 
field theory 
basic notions, 701 
ifferential equations, partial, 729 
fields, superposition, 728 
finite difference method, 588 
finite element method, 588, 978 
fitting problem 
ifferent versions, 456 
linear, 958 
non-linear, 109 


ορ 


fixed point 
conformal mapping, 735, 736 
flip bifurcation, 899 
fixed-point number, 1003 
floating-point number, 1003 
IEEE standard, 1004 
Mathematica, 1025 
semilogarithmic form, 1003 
Floquet 
representation, 865 
theorem, 863 
flow 
differential equation, 857 
edge, 411 
scalar field, 723 
vector field 
scalar flow, 723 
vector flow, 723 
fluctuation 
random, 825 
focal line, tractrix, 108 
focus, 869 
compound, 894 
ellipse, 199 
hyperbola, 201 
arabola, 204 
saddle, 864 
saddle focus, 869 
stable, 864 
form 
ilinear, 368 
negative definite, 317 
normal 
generating, 318 
Jordan, 319 
ositive definite, 317 
quadratic, 958 
real positive definite, 318 
transformation, 317 
transformation, principal axis, 317 
real quadratic, 317 
saddle, 267 
semidefinite, 317 
sesquilinear, 369 
formula 
inomial, 12 
Cardano, 41 
closed, predicate logic, 326 
d'Alembert, 591 
Duhamel, 782 
Euler (curvature of a surface), 265 
Heron's, 144 
interpretation, predicate logic, 326 
Kasterinian, 565 
Kirchhoff, 590 
Leibniz, 438 
Liouville, 554, 862 
manipulation, 1023 
de Moivre 
complex number, 38 
generalized, 293 
hyperbolic functions, 92 
quaternions, 293 
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formula (continued) 
trigonometric functions, 81 
Parseval, 789 
Pesin, 881, 888 
Plemelj and Sochozki, 651 
Poisson, 591 
predicate logic, 326 
Rayleigh, 565 
rectangular, 964 
Riemann, 584 
Simpson, 965 
Stirling, 515 
tangent, 143 
tautology, 327 
Taylor 
one variable, 442 
several variables, 450 
trapezoidal, 964 
formulas 
basic, plane trigonometry, 142 
Darboux, 261 
Euler (Fourier representation), 474 
Frenet, 261 
half-angle (plane trigonometry), 143 


half-angle (spherical trigonometry), 166 


predicate logic, 326 
four group, Klein's, 339 
four vector 
homogeneous coordinates, 355 
quaternions, 290 
Fourier analysis, 474 
Fourier coefficient, 474, 992 
asymptotic behavior, 475 
determination, 456 
numerical methods, 477 
Fourier expansion, 474 
complex functions, 479 
forms, 477 
symmetries, 476 
Fourier integral, 478, 784 
complex representation, 784 
equivalent representations, 784 
Fourier series, 474 
best approximation, 675 
complex representation, 475 
Hilbert space, 675 
Bessel inequality, 676 
Fourier sum, 475, 992 
complex representation, 993 
Fourier transformation, 784 
addition law, 788 


comparing to Laplace transformation, 790 


convolution, 789 
two-sided, 773 
definition, 785 
differentiation 
image space, 788 
original space, 788 
discrete complex, 993 
fast, 993 
Fourier cosine transformation, 786 


Fourier exponential transformation, 786 


Fourier sine transformation, 786 


requency-shift theorem, 788 
integration 
image space, 788 
original space, 789 
inverse, 785 
inearity law, 788 
ocalization property, no, 801 
shifting theorem, 788 
similarity law, 788 
spectral interpretation, 786 
survey, 769 
ables, 786 
ransform, 790 
fractal, 882 
fractile, 812 
fraction 
continued, 3 
decimal, 1 
improper, 15 
proper, 15 
fractional part of z, 50 
frames (wavelet transformations), 803 
Fréchet 
derivative, 690 
differential, 690 
Fredholm 
alternative, 687 
alternative theorem, 629 
integral equation, 621, 667 
first kind, 635 
solution method, 627 
theorems, 627 
Frenet formulas, 261 
frequency, 77 
angular/radial, 84 
distribution, 833 
locking, 907 
relative, 808 
sine, 84 
spectrum, 787 
continuous, 478 
discrete, 478 
statistics, 808, 833 
cumulative, 833 
Fresnel integral, 757 
frustum 
cone, 157 
pyramid, 154 
function, 48 
algebraic, 62 
complex, 758 
amplitude function, elliptic, 763 
analytic, 732 
analytic representation, 49 
explicite form, 49 
implicit form, 49 
parametric form, 50 
area, 93 
arrow, 331 
autocorrelation, 878 
Bessel, 562 
modified, 563 
beta function, 1098 
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function (continued I) 


Boolean, 324, 397 
bounded, 51 
circular, 131 
comparable, 617 
complement, 421 
complex, 48, 731 
algebraic, 758 
bounded, 733 
linear, 735 
linear fractional, 736 
quadratic, 737 
square root, 737 
complex-valued, 48 
complex-variable, 731 
composite, 63 
derivative, 435 
intermediate variable, 435 
continuity 
in interval, 58 
one-sided, 58 
piecewise, 58 
continuous, complex, 731 
cosecant, 78 
cosine, 77 
cotangent, 77 
cyclometric, 85 
cylindrical, 562 
density, 812 
dependent, 122 
discontinuity, 58 
removable, 59 
discrete, 794 
distribution, 811 
double periodic, 763 
elementary, 62 
elementary, transcendental, 758 
elliptic, 490, 753, 763 
entire rational, 62 
error function, 514 
Euler (theory of numbers), 381 
even, 51 
exponential, 62, 72 
complex, 739 
natural, 758 
quaternions, 293 
extreme value, 51 
function series, 467 
gamma, 514 
generalized, 698, 699, 777 
Green, 586 
Hamilton, 574, 861 
harmonic, 729, 732 
Heaviside, 700 
Hermite, 674 
holomorphic, 732 
homogeneous, 122 
homographic, 62, 66 
hyperbolic, 89, 759 
geometric definition, 131, 132 
quaternions, 294 
impulse, 774 
increment, 972 


function (continued II) 


independent, 122 
integrable, 494 
absolutely, 508, 511 
integral rational, 62 
first degree, 63 
n-th degree, 65 
second degree, 64 
third degree, 64 
inverse, 52 
complex, hyperbolic, 759 
complex, trigonometric, 759 
derivative, 436 
derivative of higher order, 440 
existence, 61 
hyperbolic, 93 
rigonometric, 63, 85 
irrational, 62, 71 
Jacobian, 763 
Lagrangian, 925 
Laguerre, 674 
Laplace, 729 
limit, 53 
at infinity, 54 
infinity, 53 
iterated, 124 
eft-hand, 54 
right-hand, 54 
Taylor expansion, 57 
heorems, 55 
linear, 62, 63 
linear fractional, 62, 66 
local summable, 698 
logarithm, complex, 738 
logarithmic, 63, 73 
quaternions, 294 
MacDonald, 563 
matrix-exponential, 352, 862 
mean value, 497 
measurable, 695 
measuring, 889 
meromorphic, 753, 763, 779 
monotone 
decreasing, 50 
increasing, 50 
strictly, 50 
non-elementary, 62 
notion, 48 
objective 
linear programming, 909 
selection, 933 
odd, 51 
of angle, 76 
one variable, 48 
order of magnitude, 57 
parametric form 
derivative, 437 
derivative of higher order, 440 
periodic, 52, 776 
piecewise continuous, 58 
point of discontinuity, 58 
finite jump, 59 
tending to infinity, 59 
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function (continued III) 
positive homogeneous, 616 
power, 71 
primitive, 480 
pure, 1031 
quadratic, 62 
random variable, 811 
rational, 62, 66 
real, 48 
regular, 732 
Riemann, 584 
sample function, 831 
secant, 78 
several variables, 48, 118 
sign of, 50 
sine, 76 
special fractional linear fractional, 66 
state, 594 
statistic, 831 
step, 794 
stream, 741 


sum of linear and fractional linear functions, 738 


summable, 696 
switch, 399 
system 
orthogonal, 985 
orthonormal, 985 
tangent, 77 
theta, 764 
transcendental, 62 
trigonometric, 63, 76, 759 
geometric definition, 131 
quaternions, 293 
truth, 323, 324 
wave, statistical interpretation, 594 
Weber, 562 
Weierstrass, 765 
function theory, 731 
functional, 611, 677 
definition, 48 
linear, 366, 658, 659 
linear continuous, 681 
L? space, 682 
functional determinant, 123, 285, 526 
functions 
analytic, 732 


Bessel 
1. kind, 564 
2. kind, 564 


table, 1106 
Bessel, complex argument, 564 
Bessel, spherical, 564 
complex argument, 564 
Neumann, spherical, 564, 565 
spherical, 564 
fundamental form 
first quadratic, of a surface, 263 
second quadratic, of a surface, 266 
fundamental formulas 
spherical trigonometry, 165 
fundamental laws, set algebra, 329 
fundamental matrix, 862-864 
fundamental problem 


first, triangulation, 148 
second, triangulation, 149 
fundamental space, 699 
fundamental system 
differential equation, higher order, 553 
fundamental theorem 
Abelian groups, 339 
algebra, 43 
elementary number theory, 371 
integral calculus, 495 
future value, 25 
fuzzy 
control, 427 
inference, 425 
linguistics, 414 
logic, 413 
logical inferences, 425 
product relation, 424 
relation, 422 
relation matrix, 423 
system, 430 
systems, applications, 427 
valuation, 422 
fuzzy set 
aggregation, 418 
aggregation operator, 423 
complement, 418 
composition, 424 
cut (representation theorem), 417 
degree, 417 
empty, 416 
intersection, 418 
intersection set, 419 
level set, 417 
normal, 417 
peak, 416 
similarity, 417 
subnormal, 417 
subset, 416 
support, 413 
tolerance interval, 416 
union, 418 
universal, 416 
fuzzy sets 
cut, 417 
intersection, 419 
union, 419 


Gabor transformation, 803 
Galerkin method, 974 
Galois field, 363 
gamma function, 512, 514 
Gauss 
algorithm, 312, 956 
coordinates, 261 
curvature, surface, 267 
elimination method, 312, 955 
error curve, 819 
error function, 819 
error integral, 514 
error propagation law, 855 
integral formula, 725 
integral theorem, 724 
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least squares method, 456, 959, 984 
plane, 34 
step, 312 
transformation, 313, 843, 959 
Gauss-Kriiger coordinates, 162 
Gauss-Newton method, 962 
derivative free, 963 
Gauss-Seidel method, 960 
ged (greatest common divisor), 374 
gcd and lem, relation between, 375 
generating line, 156 
generator, 156 
ruled surface, 226 
generizity 
metric, 891 
geodesic line, 268 
geodesy 
angle, 146 
coordinates, 144 
polar coordinates, 144 
geometric sequence, 19 
geometry, 129 
analytical, 181 
plane, 190 
projections, 237 
space, 209 
ransformatians, 229 
vector algebra, 181 
differential, 243 
plane curves, 243 
space curves, 256 
surfaces, 261 
plane, 129 
plane trigonometry, 142 
spherical trigonometry, 160 
stereometry, 151 
Gimbal Lock case, 295 
Girard theorem, 165 
golden section, 2, 4, 194, 908 
gon, 146 
grades, 131 
gradient 
definition, 710 
different coordinates, 710 
remark, 709 
scalar field, 710 
vector components, 718 
vector gradient, 711, 716 
Graeffe method, 954 
Gram-Schmidt, orthogonalization process, 316 
graph 
alternating way, 409 
arc, 40 
bipartite, 402 
complete, 402 
complete bipartite, 402 
components, 404 
connected, 404, 410 
cycle, 410 
directed, 401 
edge, 401 
circuit, 410 
edge, 401 


Euler, 405 
ow, 41 
increasing way, 409 
infinite, 402 
isomorphism, 402 
oop, 40 
mixed, 401 
non-planar, 410 
artial, 402 
lanar, 410 
lane, 402 
regular, 402 
simple, 401 
special classes, 402 
strongly connected, 410 
subdivision, 410 
subgraph, 402 
transport network, 402 
tree, 402 
undirected, 401 
vertex, 401 
weighted, 403 
graph paper 
double logarithmic, 117 
log-log paper, 117 
notion, 116 
reciprocal scale, 117 
semilogarithmic, 116 
graph theory, algorithm, 401 
gravitational field, point mass, 728 
great circle, 160, 174 
greatest common divisor (gcd), 373 
linear combination, 374 
polynomials, 14 
Green 
function, 586 
integral theorem, 725 
method, 586, 587 
group, 336 
Abelian, 336 
cyclic, 340 
Ds (dihedral), 341 
point group, 340 
affine transformations, 355 
complete, linear 
GL(n, R), 353 
continuous 
dimension, 354 
dihedral, 336, 341 
element, 336 
character, 343 
inverse, 336 
Euclidean, 354 
scaled, 354 
factor group, 339 
four group (Klein’s), 339 
GA(2), 355 
general linear, 352 
GL(n), 353 
Lie group, 351 
definition, 354 
L(n, R), 353 


matrix-Lie, 351 
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group (continued) 
dimension, 353 
multiplicative, cyclic, 364 
O(n), 351 
permutation, 338 
point group, 346 
real, special, linear SL(n , R), 354 
representation, 340 
irreducible, 344 
reducible, 344 
SE(2), 355 
SE(3), 355 
S E(n), 354 
SL(2), 355 
SO(2), 354 
SO(n), 351 
special, affine, 354 
subgroup, 337 
symmetry group, 346 
table, 337 
Cayley table, 337 
grouping, 11 
groups 
applications, 345 
continuous, 353 
definition, 353 
product, 353 
convergence, 353 
direct product, 338 
general linear, 353 
homomorphism, 339 
theorem, 339 
isomorphism, 339 
Lie groups, 351 
matrix-Lie, 351-353 
definition, 353 
special, 354 
growth factor, 22 
Guldin’s first rule, 506 
Guldin’s second rule, 506 


half-angle formulas 
plane trigonometry, 143 
herical trigonometry, 166 
half-line, notion, 129 
half-side formulas, 167 
Hamel basis, 657 
Hamilton 
circuit, 406 
ifferential equation, 875 
ifferential equation, partial, 861, 903 
function, 574, 861 
system, 875 
Hamiltonian, 593 
Hamming distance, 662 
Hankel transformation, 768 
harmonic analysis, 982 
harmonics, spherical (Legendre) 
first kind, 566 
second kind, 567 
Hasse diagram, 334 
heat conduction equation 
one-dimensional, 583, 783 


[^] 


three-dimensional, 591 
Heaviside 
expansion theorem, 779 
function, 700, 757 
unit step function, 774, 1126 
helix, 260 
Helmholtz, differential equation, 594 
Hénon mapping, 858, 872 
Hermite 
differential equation, 568 
polynomial, 568 
polynomials, 602 
trapezoidal formula, 965 
Hessian 
matrix, 931 
normal form 
line equation, plane, 196 
plane equation, space, 219 
hexadecimal 
number, 1002 
system, 1002 
Hilbert 
boundary value problem, 651 
homogeneous, 651 
inhomogeneous, 652 
matrix, 1041 
space, 673 
isomorphic, 676 
pre-Hilbert space, 673 
scalar product, 673 
histogram, 833 
hodograph, 701 
Hólder 
condition, 649 
continuity, 649 
inequality, 32 
integrals, 32 
series, 32 
Holladay, theorem, 997 
holohedry, 350 
homeomorphism 
and topological equivalence, 870 
conjugation, 873 
orientation preserving, 906 
homomorphism, 339, 394 
algebra, 672 
groups, 339 
theorem, 339 
linear operators, 658 
natural, 339, 363 
ring, 362 
theorem, 394 
ring, 362 
vector lattice, 661 
Hopf bifurcation, 894 
Hopf-Landau model, turbulence, 904 
Horner 
rule, 1003 
scheme, 952, 1003 
two rows, 953 
horseshoe mapping, 887 
Householder 
method, 314, 985 
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tridiagonalization, 319 
V’Huilier equations, 169 
hull 

convex, 657 

linear, 655 
hyper surface, 119 
hyperbola, 201 

arc, 204 

area, 204 

asymptote, 202 

binomial, 71 

conjugate, 203 

diameter, 203 

equation, 202 

equilateral, 66, 204 

focal properties, 202 

focus, 201 


radius of curvature, 204 


segment, 204 
semifocal chord, 201 
tangent, 202 
transformation, 207 
vertex, 201 
hyperbolic function, 759 
cosecant, 89 
cosine, 89 
cotangent, 89 


geometric definition, 132 


inverse, complex, 759 
secant, 89 
sine, 89 
tangent, 89 
hyperboloid, 225 
one sheet, 225, 226 
central surface, 228 
two sheets, 225 
central surface, 228 
hyperplane, 683 
of support, 684 
hypersubspace, 683 
hypocycloid, 103 
curtate, 104 
prolate, 104 
hypotenuse, 131 
hypothesis testing, 839 
hypotrochoid, 104 


icosahedron, 156 


ideal, 362 
principal ideal, 362 
idempotence law 
Boolean algebra, 396 


sets, 330 
identically valid, 10 
identity, 10 
Boolean function, 397 
Jacobian, 356 


entity matrix, 271 
IEEE standard, 1004 
IF-THEN rule, 425 
iff (if and only if), 370 


propositional logic, 324 


representation of groups, 342 


image 

function, 49 

set, 49 

space, 49, 767 

subspace, 367 
imaginary 

number, 34 

part, 34 

unit, 34 

units, generalized i, j, k, 290 
immersion, 890 
implication, 323 

proof, 5 
impulse function, 774 
incidence function, 401 
incidence matrix, 403 
incircle 

quadrangle, 137 

triangle, 133, 144 
incommensurability, 4, 866 
increment, 21, 447 
independence 

linear, 308, 656 

path of integration, 521, 747 

potential field, 721 

induction step, 5 
inequality, 28 


arithmetic and geometric mean, 30 


arithmetic and quadratic mean, 30 
Bernoulli, 30 
Bessel, 676 
inomial, 31 
Cauchy-Schwarz, 31 
Chebyshev, 31, 814 
generalized, 32 
different means, 30 
first degree, 33 
Holder, 32 
inear, 33 
first degree, 33 
solution, 33 
Minkowski, 32 
roduct of scalars, 31 
ure, 28 
quadratic, 33 
solution, 33 
Schwarz-Buniakowski, 673 
second degree, 33 
solution, 28 
special, 30 
triangle, 182 
norm, 669 
triangle inequality 
complex numbers, 30 
real numbers, 30 
infimum, 660 
infinite 
denumerable, 335 
non-denumerable, 335 
infinitesimal quantity, 494 
infinity, 1 
infix form, 335 
inflection point, 249, 443, 444 
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initial conditions, 540 
initial value problem, 
inner product, 673 
inscribed circle 
quadrangle, 137 
triangle, 133 


540, 969 


inscribed pentagram, 139 


instability, round-off e 


rror 


numerical calculation, 1007 


insurance mathematic: 
integer 
non-negative, 1 
part of x, 50 
integrability 
complete, 576 
condition, 521 
conditions, 521, 7: 
differential, 521 
quadratic, 637 
integral 


s, 21 


21 


absolutely convergent, 511 
antiderivative, 480 


basic 

notion, 481 
calculus, 480 
Cauchy type, 650 
circuit, 515, 521 
complex 

definite, 745 

indefinite, 746 
complex function 


measurable function, 696 
cosine integral, 513 
elementary functions, 481 
error integral, 514 
Euler integral, second kind, 512, 514 
exponential integral, 514 


Fourier integral, 4 


78, 784 


Fresnel integral, 757 
integral logarithm, 484, 513 
interval of integration, 494 


Lebesgue integral, 


comparison wit. 


507, 696 
h Riemann integral, 506 


limits of integration, 494 


line integral, 515 


first type, applications, 517 


general type, 5 
second type, 51 


logarithm, 484, 513 


lower limit, 494 


non-elementary, 484, 


parametric, 512 


primitive function, 480 
probability integral, 819 
Riemann integral, 494 
comparison with Stieltjes integral, 506 


sine integral, 513, 
Stieltjes integral, 


756 
686 


comparison with Riemann integral, 506 


notion, 506 


surface integral, 532, 723 
triple integral, 527 


upper limit, 494 


volume integral, 527 
integral calculus, 480 
fundamental theorem, 495, 511 
mean value theorem, 497 
integral curves, 858 
integral equation, 621 
Abel, 648 
adjoint, 623, 650 
approximation 
successive, 625 
characteristic, 650 
collocation method, 634 
eigenfunction, 623 
eigenvalue, 623 
first kind, 621 
Fredholm, 621, 667 
degenerate kernel, 635 
first kind, 635 
second kind, 622 
general form, 621 
homogeneous, 621 
inhomogeneous, 621 
iteration method, 625, 642 
xernel, 621 
degenerate, 622 
iterated, 645 
product, 622 
ernel approximation, 632 
tensor product, 632 
inear, 621 
Nystróm method, 631 
orthogonal system, 637, 640 
erturbation function, 621 
quadratically integrable function, 637 
quadrature formula, 630 
second kind, 621 
singular, 648 
Cauchy kernel, 649 
transposed, 623 
Volterra, 621, 667 
convolution type, 645 
first kind, 643 
second kind, 643 
integral equation method, closed curve, 588 
integral exponential function, table, 1093 
integral formula 
Cauchy, 748 
application, 754 
Gauss, 725 
integral logarithm, 484, 513 
table, 1095 
integral norm, 698 
integral surface, 572 
integral test of Cauchy, 462 
integral theorem, 724 
Cauchy, 747 
Gauss, 724 
Green, 725 
Stokes, 725 
integral transformation, 767 
application, 769 
Carson, 768 
definition, 767 
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Fourier, 768, 784 sine function, table, 1083 
Fourier (table), 1114, 1125 table, 1065 
Fourier cosine (table), 1114 tangent function, table, 1091 
Fourier exponential (table), 1127 trigonometric functions, table, 1083 
Fourier sine (table), 1120 integrand, 481, 494 
Gabor, 803 integrating factor, 543 
Hankel, 768 integration 
image space, 767 approximate 
inverse, 767 ordinary differential equation, 969 
kernel, 767 partial differential equation, 976 
Laplace, 768, 770 complex plane, 745 
Laplace (table), 1109 constant, 481 
linearity, 767 unction, non-elementary, 513 
Mellin, 768 graphic, 485 
multiple, 769 graphical, 499 
one variable, 767 in complex, 754 
original space, 767 interval, 494 
several variables, 769 imit 
special, 767 depending on parameter, 512 
Stieltjes, 768 lower, 494 
Walsh, 804 upper, 494 
wavelet, 800, 801 ogarithmic, 483 
fast, 803 numerical, 963 
Z-, 794 multiple integrals, 968 
Z- (table), 1128 artial, 484 
integral, definite, 493 ower, 483 
differentiation, 496 rational functions, 485 
notion, 480 rules 
particular integral, 495 by series expansion, 484 
table, 1098 by substitution, 484 
algebraic functions, 1101 constant multiple rule, 482 
exponential function, 1099 definite integrals, 496 
logarithmic function, 1100 general rule, 482 
trigonometric functions, 1098 indefinite integrals, 483 
integral, elliptic, 490 interchange rule, 497 
first kind, 484, 490, 763 interval rule, 496 
second kind, 490 series expansion, 499, 513 
series expansion, 515 sum rule, 482 
table, 1103 under the integration sign, 512 
third kind, 490 variable, 494 
integral, improper, 493 variable, notion, 481 
Cauchy’s principal value, 507, 510 vector field, 719 
convergent, 507, 509 volume, 503 
divergent, 507, 509 integrator, 500 
infinite integration limits, 506 integrity, domain, 361 
notion, 506 intensity, source, 727 
principal value, 507 interaction, soliton, 604 
unbounded integrand, 506 intercept theorem, 135 
integral, indefinite, 480 interest, 22 
basic integral, 481 calculation, 21 
cosine function, table, 1085 compound, 22 
cotangent function, table, 1091 intermediate value theorem 
elementary functions, 481 one variable, 61 
elementary functions, table, 1065 several variables, 124 
exponential function, table, 1093 intermediate variable, 435 
hyperbolic functions, table, 1092 intermittence, 902, 905 
inverse hyperbolic functions, table, 1097 Internal Number Representation INR, 1003 
inverse trigonometric function, table, 1096 Internationale Standard Book Number ISBN, 383 
irrational functions, table, 1072 interpolation 
logarithmic functions, table, 1095 Aitken-Neville, 983 
notion, 481 condition, 982 
other transcendental functions, table, 1092 formula 


sine and cosine function, table, 1087 Lagrange, 983 
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Newton, 982 
fuzzy system, 430, 431 
knowledge-based, 430 
Lerp, 302 
node, 963 
equidistant, 969 
points, 982 
quadrature, 964 
quaternions, 302 
rotation matrices, 303 
Slerp, 302 
spline, 982, 996 
bicubic, 998 
cubic, 996 
Squad, 303 
trigonometric, 982, 992 
interpretation 
formula, predicate logic, 326 
variable, 324 
intersecting circles, 160 
intersecting plane, 160 
intersection, 329 
angle, 161 
by two oriented lines, 148 
fuzzy set, 418 
on the sphere, 172 
point 
lines, 197 
set, fuzzy set, 419 
sets, 328, 329 
without visibility, 148 
intersection line, 152 
intersection point 
four planes, 221 
plane and line, 223 
spherical curves, 175 
three planes, 220 
two lines, space, 223 
interval 
convergence, 469 
numbers, 2 
order (ο) interval, 660 
rule, 496 
statistics, 832 
invariance 
rotation invariance, cartesian tensor, 283 
transformation invariance, cartesian tensor, 283 
translation invariance, cartesian tensor, 283 
invariant, 765 
quadratic curve, 206 
scalar, 214 
scalar invariant, 185 
surface second order, 228 
inverse, 659 
inverse function, 52 
hyperbolic, 93 
trigonometric, 85 
inverse transformation, 767 
inversion 
Cartesian coordinate system, 287 
conformal mapping, 736 
space, 287 
involute, 255 


isometry, space, 669 
isomorphism, 339, 394 
Boolean algebra, 397 
graph, 402 
groups, 339 
surjective norm, 684 
vector spaces, 659 
iteration, 949 
inverse, 319 
method, 319, 949, 960 
ordinary, 949, 961 
sequential steps, 961 
simultaneous steps, 961 
vector, 321 


Jacobian 
determinant, 123, 285, 691 
function, 763 
identity, 356 
matrix, 691, 962 
method, 319, 960 
joint 
kinematics, robotics, 360 
mechanical, S E(3) application, 359 
Jordan 
matrix, 319 
normal form, 319 
jump, finite, 59 


KAM (Kolmogorov-Arnold-Moser) theorem, 875 
Kasterinian formula, 565 
ker, 394, 659 
kernel, 339 
approximation 
integral equation, 632 
spline approach, 633 
tensor product, 632 
congruence relation, 394 
homomorphism, 394 
integral equation, 621 
degenerate, 632 
iterated, 626, 645 
resolvent, 628 
solving, 626 
integral transformation, 767 
operator, 659 
ring, 362 
subspace, 367 
kink soliton, 607 
Kirchhoff formula, 590 
Klein's four group, 339 
Koch curve, 883 
Korteweg de Vries equation, 604 
Kronecker 
generalized delta, 283 
product, 276 
symbol, 271, 283 
Kuhn-Tucker conditions 
global, 925 
local, 925 
reference, 684 
Kuratowski theorem, 410 


Lagrange 
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function, 456 lattice, 395 
identity, vectors, 186 Banach, 672 
interpolation formula, 983 Bravais, 349 
method of multiplier, 456 crystallography, 348 
theorem, 338 distributive, 395 
Lagrangian, 456, 925 vector, 672 
Laguerre Laurent 
differential equation, linear, second order, 568 expansion, 752 
polynomials, 568 series, 752, 796 
Lanczos method, 319 law 
Landau, order symbol, 57 cosine, 166 
Laplace Gauss error propagation, 855 
differential equation, partial, 592, 729 large numbers, 825 
expansion, 278 Bernoulli, 825 
function, 729 imit theorem, Lindeberg-Levy, 825 
wave equation, 590 sides (cosine), 166 
Laplace operator sine, 166 
different coordinates, 716, 717 sine-cosine, 166 
polar coordinates, 454 polar, 166 
uaternions, 306 tangent, 169 
vector components, 718 aws 
Laplace transformation, 770 propositional calculus, 324 
addition law, 771 ayer, spherical, 158 
comparing to Fourier transformation, 790 cm (least common multiple), 374 
convergence, 770 east common multiple (lcm), integer numbers, 374 
convolution, 773 east squares method, 109, 313, 974, 978, 984 
complex, 774 calculus of observations, 848 
one-sided, 773 Gauss, 456 
definition, 770 regression analysis, 841 
differential equation, ordinary east squares problem, linear, 313, 959 
linear, constant coefficients, 781 Lebesgue 
linear, variable coefficients, 782 integral, 507, 696 
differential equation, partial, 783 comparison with Riemann integral, 506 
differential equations, 781 measure, 694, 876 
differentiation, image space, 772 eft-hand coordinate system, 209 
differentiation, original space, 772 eft-singular vector, 321 
discrete, 796 eg or side of an angle, 129 
division law, 773 Legendre 
frequency-shift theorem, 771 differential equation, 565 
image space, 770 polynomial 
integration, image space, 772 associated, 567 
integration, original space, 772 second kind, 567 
inverse, 770, 778 polynomials 
inverse integral, 780 first kind, 566 
linearity law, 771 symbol, 379 
original function, 770 Leibniz 
original space, 770 alternating series test, 463 
partial fraction decomposition, 778 formula, 438 
piecewise differentiable function, 775 lemma 
series expansion, 779 Jordan, 755 
similarity law, 771 Schur, 345 
step function, 774 lemniscate, 101, 252 
survey, 769 length 
table, 1109 arclenth (radian), 131 
transform, 770 interval, 693 
translation law, 771 line integral, first type, 517 
largest area method, 427 measure of arc 
lateral area space curve, 264 
cone, 157 spherical geometry, 161 
cylinder, 156 reduced, 175 
polyhedron, 153 vector, 190 
lateral face, 152 level 


latitude, geographical, 162, 262 curve, 119 
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line, 119, 703 
surface, 703 
level set (fuzzy set), 417 
library (numerical methods), 1009 
Aachen library, 1011 
IMSL library, 1010 
NAG library, 1009 
Lie algebra, 351, 356 
basis, 357 
Lie group - Lie algebra, connection, 356 
matrix-exponential function, 357 
real, 356 
rigid-body motion, 358 
robotics applications, 358 
special, 356 
vector product, 360 
Lie brackets, 356, 596 
Lie groups, 351 
continuous 
definition, 353 
matrix-Lie groups, 353 
applications, 355, 356 
special, 354 
rigid-body movement 
group SE(3), 355 
robotics, control, 355 


limit 
cycle, 866, 895 
stable, 866 
unstable, 866 
definite integral, 493 
function 
complex variable, 731 
one variable, 53 
several variables, 123 
theorems, 55 
function series, 467 
integration, depending on parameter, 512 
partial sum, 467 
sequence of numbers, 458 
sequence, in metric space, 664 
series, 459 
superior, 469 
Lindeberg-Levy, central limit theorem, 825 
line, 63 
curvature, surface, 266 
geodesic, 160 
imaginary, 207 
notion, 129 
vector equation, 189 
line element 
surface, 263 
vector components, 719 
line equation 
plane, 195 
Hessian normal form, 196 
intercept form, 196 
polar coordinates, 196 
slope, 195 
through a point, 195 
through two points, 196 
space, 221 
line integral, 515 


Cartesian coordinates, 721 
first type, 516 
applications, 517 
general type, 519 
second type, 517 
vector field, 719 
linear combination, vectors, 85, 183, 185 
linear form, 658, 659 
continuous, 681 
linear programming 
basic notions, 911 
constraints, 909 
extreme point, 911 
normal form, 911 
objective function, 909 
linearly 
dependent, 366 
independent, 366 
lines (plane) 
angle between two lines, 197 
intersection point, 197 
orthogonal, 129, 198 
arallel, 129, 198 
encil, 197 
erpendicular, 129, 198 
lines (space), 151 
equations, 218, 221 
intersecting points with planes, 223 
arallelism, 151 
skew, 151 
Liouville 
approximation theorem, 4 
ormula, 554, 862 
heorem, 861 
heorem about the constant, 733 
Lipschitz condition 
differential equations of higher-order, 551 
ordinary differential equation, 541 
locus, geometric, 195 
logarithm 
binary, 10 
Briggsian, 9 
decimal, 9 
definition, 9 
natural, 9, 758 
Neperian, 9 
n x n matrices, 356 
principal value, 758 
quaternions, 294 
table, 10 
taking of a number, 9 
logarithm function, 73 
complex, 738 
conformal mapping, 738 
many valued, 739 
quaternions, 294 


representation of transcendental functions, 759 


logarithmic decrement, 85 
logarithmic normal distribution, 819 
logic, 323 

fuzzy, 413 

predicate, 326 

propositional, 323 
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longitude, geographical, 162, 262 
loop, graph, 401 
Lorentz curve, 791 
Lorenz system, 858, 861, 901 
loxodrome, 178 
arclength, 179 
course angle, 179 
intersection point, 179 
intersection point of two loxodromes, 180 
L? space, 697 
LU factorization (lower and upper triangular matrix), 
956 
Lyapunov 
dimension, 885 
exponent, 880 
function, 863 
stability, 863, 864 
theorem, 863 


MacDonald function, 563 
Maclaurin series expansion, 472 
Macsyma (computer algebra system), 1023 
majorant series, 466 
manifold 
center manifold theorem 
local, 897 
mappings, 897 
of solutions, 955 
stable, 866, 872 
unstable, 866, 872 
mantissa, 10, 1005 
Maple 
input, output, 1019 
numerical mathematics, 1019 
differential equations, 1022 
equations, 1020 
expressions and functions, 1019 
integration, 1021 
short characteristic, 1023 
Maple (computer algebra system), 1023 
mapping, 331, 333 
affine, 230 
anti symmetric, 356 
between groups, 339 
bijective, 658 
bilinear, 356, 368 
complex number plane, 745 
conformal, 734 
circular transformation, 736 
exponential function, 739 
fixed-point, 735, 736 
inversion, 736 
isometric net, 735 
linear fractional, 736 
linear function, 735 
logarithm, 738 
quadratic function, 737 
square root, 737 
sum of linear and fractional linear functions, 738 
contracting, 666 
equivalent, 905 
function, 49, 331 
Hénon mapping, 872 


horseshoe, 887 
injective, 333, 658 
inverse mapping, 333 
ernel, 339 
ifted, 905 
inear, 658 
inear operator, 366 
inear transformation, 658 
modulo, 876 
one to one, 333 
Poincaré, 868, 887 
reduced, 897 
regular, 367 
rotation mapping, 877 
shift, 880 
surjective, 333, 658 
tent, 876 
topological conjugation, 873 
unit circle, 905 
mass 
double integral, 527 
line integral, first type, 517 
triple integral, 532 
matching, 409 
maximal, 409 
perfect, 409 
saturated, 409 
Mathcad (computer algebra system), 1023 
Mathematica (computer algebra system), 1023 
Mathematica 
3D graphics, 1051 
algebraic expressions 
manipulation, 1036 
multiplication, 1036 
apply, 1033 
attribute, 1035 
basic structure elements, 1024 
context, 1035 
curves 
parametric representation, 1050 
two-dimensional, 1049 
differential and integral calculus, 1042 
differential equations, 1044 
differential operator, 1042 
differential quotient, 1042 
differentiation, 1033 
functions, 1042 
eigenvalue problems, 1040 
eigenvalues, 1041 
eigenvectors, 1041 
elements, 1024 
equations 
logical expressions, 1038 
solution, 1038 
transcendental, 1039 
expressions, 1024 
FixedPoint, 1033 
FixedPointList, 1033 
floating-point number, conversion, 1026 
function, inverse, 1033 
functional operations, 1032 
functions, 1031 
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Mathematica (continued) 


graphics 
functions, 1047 
options, 1047 
primitives, 1045, 1046 
heads, 1024 
important applications, 1036 
input, output, 1016, 1023, 1025 
integrals 
definite, 1044 
indefinite, 1043 
multiple, 1044 
rational functions, 1043 
trigonometric functions, 1043 
lists, 1027 
nested, 1028 
manipulation 
formulas, 1023 
non-polynomial expressions, 1038 
manipulation with matrices, 1030 
manipulation with vectors, 1030 
Map, 1033 
matrices as lists, 1029 
messages, 1035 
Nest, 1033 
NestList, 1033 
numerical calculations 
introduction, 1024 
numerical mathematics, 1016 
curve fitting, 1016 
differential equations, 1018 
integration, 1018 
interpolation, 1017 
polynomial equations, 1017 
objects, three-dimensional, 1051 
operators, important, 1026 
partial fraction 
decomposition, 1037 
pattern, 1032 
polynomial equations, 1039 
polynomials 
factorization, 1037 
operations, 1037 
programming, 1034 
series, inverse, 1033 
short characteristic, 1023 
surfaces, 1051 
surfaces and space curves, 1051 
syntax, additional information, 1035 
system description, 1024 
system of equations 
general case, 1040 
special case, 1040 
systems of equations, 1040 
solution, 1038 
types of numbers, 1025 
vectors as lists, 1029 


mathematics, discrete, 323 
Matlab, 1011 


functions, survey, 1011 

numerical integration, 1014 

numerical interpolation, 1013 
curve fitting, 1013 


numerical linear algebra, 1012 

numerical solution 
differential equations, 1015 
equations, 1013 

toolbox, 1012 

matrices 

arithmetical operations, 272 

associative law, 272 

commutative law, 272, 273 

Dirac, 290 

distributive law, 272 

division, 272 

eigenvalue, 314 

eigenvalue problem, 314 

eigenvectors, 314 

equality, 272 

multiplication, 272 

Pauli, 290 

powers, 276 

products, 275 

quaternions, 291 

rules of calculation, 275 

matrix, 269 

adjacency, 403 

adjoint, 269, 278 

admittance, 408 

anti-Hermitian, 271 

antisymmetric, 270 

block tridiagonal, 977 

calculation of inverse matrix, 308 

complex, 269 

conjugate, 269 

deflation, 321 

degree, 408 

diagonal, 270 

differentiation, 276 

distance, 404 

extended coefficient, 957 

full rank, 313 

functional, 691 

fundamental, 862 

Hermitian, 271 

Hessian, 931 

identity, 271 

incidence, 403 

inverse, 274, 278 

Jacobian, 691, 962 

Jordan, 319 

lower triangular, 957 

main diagonal element, 270 

matrix-Lie group, 352 

monodromy, 863, 872 

nxn, 353 

normal, 270 

of coefficients, 308 
augmented, 309 

orthogonal, 275 

projection, 238 

quaternions representation, 291 
complex matrix, 292 
real matrix, 291 

rank, 274 

real, 269 
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matrix (continued) 

rectangular, 269 

regular, 274 

rotation, 231, 275 

space, 212, 215, 216 

rotation, object, 295 

rotations, 294 

scalar, 270 

scaling, 231 

self-adjoint, 27 

shearing, 231 

singular, 274 

size, 269 

skew-symmetric, 270 

spanning tree theorem, 408 

sparse, 977 

spur, 270 

square, 269, 270 

stochastic, 826 

symmetric, 270 

trace, 270 

transformation, 231 

translation, 231 

transposed, 269 

triangle decomposition, 955 

triangular, 271 

unitary, 275 

upper triangular, 957 

zero, 269 
matrix product 
isappearing, 275 
inequality, 273 
matrix-exponential function, 352, 862 
Lie algebra, 357 
special, 352 
matrix-Lie algebra, 3 
group SE(3), 35 
matrix-Lie group 
definition, 353 
imension, 353 
uaternions, 355 
matrix-Lie groups 
applications, 355 
special, 354 
max-min composition, 424 
maximum 
absolute, 51, 61, 125, 443 
global, 51, 443 

local, 51 

relative, 51, 443 
maximum-criterion method, 426 
Maxwell, diagonal method, 744 
mean 

arithmetic, 19, 813, 850 

geometric, 20 

golden, 908 

harmonic, 20 

quadratic, 20 

sample function, 831 

statistics, 833 

value, 19 

weighted, 813, 850, 854 
mean squares problem 


different versions, 456 

linear, 313 

non-linear, 987 

rank deficiency case, 314 
mean value formula, 964 


mean value function, integral calculus, 497 


mean value method, 109 
mean value theorem 
differential calculus, 442 
generalized, 443 
integral calculus, 497 
generalized, 498 
measure, 694 
concentrated on a set, 876 
counting, 694 
degrees, 131 
dimension, 884 
Dirac, 694, 876 
ergodic, 877 


function, convergence theorems, 697 


Hausdorff, 882 
invariant, 876 
Lebesgue, 694, 876 
natural, 876 
physical, 877 
probability, 697 
invariant, 876 
SBR (Sinai-Bowen-Ruelle), 877 
σ algebra, 694 
σ finite, 697 
support, 876 
measured value, 848 
measurement 
error, 848 
error density, 848 
error, characterization, 848 
error, distribution, 848 
rotocol, 848 
measuring function, time series, 889 
median, 143 
sample function, 832 
statistics, 834 
riangle, 133 
Mellin transformation, 768 
Melnikov method, 903 
membership 
degree, 413 
unction, 413, 414 
bell-shaped, 415 
trapezoidal, 414 
relation, 327 
meridian, 162, 262 
convergence, 171 
tangential, 178 
method 
Bairstow, 45, 954 
barrier, 941 
Bernoulli, 954 
bisection, 319 
Box-Miiller, 934 
Brodetsky-Smeal, 45 
broken line, Euler, 969 
center of area, 427 
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method (continued I) method (continued II) 
center of gravity, 426 Riemann, 584 
generalized, 427 Ritz, 618, 974 
parametrized, 427 Romberg, 966 
Cholesky, 314, 958 RSA, 392 
collocation Runge-Kutta, 969 
oundary value problem, 974 separation of variables, 542, 579 
integral equation, 634 shooting, 975 
partial integral equation, 978 single-target, 975 
comparing coefficients, 17 SOR (successive over-relaxation), 961 
cutting plane, 942 statistical experiment, 847 
descent, 931 steepest descent, 931 
difference Steffensen, 951 
ordinary differential equations, 973 successive approximation 
partial differential equations, 976 Banach space, 679 
extrapolation, 970 Picard, 549 
Fibonacci, 930 Sugeno, 428 
finite difference, 588 transformation, 319 
finite element, 588, 619, 978 undetermined coefficients, 17 
Galerkin, 974 variation of constants, 554 
Gauss elimination, 955 Wolfe, 928 
Gauss-Newton, 962 method unknown coefficients 
derivative free, 963 n-th order, 560 
Gauss-Seidel, 960 metric, 662 
Gaussian least squares, 456 Euclidean, 662 
gradient, 619 surface, 265 
Graeffe, 954 metric prefixes, 1054 
Green, 586, 587 Meusnier, theorem, 265 
Hildreth-d'Esopo, 929 midpoint line segment 
Householder, 314, 985 plane, 193 
inner digits of squares, 843 space, 217 
integrating factor, 543 midpoint rule, 971 
interpolation minimal surface, 267 
polynomials, 982 minimum 
quaternions, 302 absolute, 51, 61, 125, 443 
iteration, 319, 949, 955, 960 global, 51, 443 
ordinary, 949, 961 local, 5 
Jacobian, 319, 960 point, 924 
Kelley's, optimization, 942 global, 924 
Lagrange multiplier, 925 local, 924 
Lanczos, 319 relative, 51, 443 
largest area, 427 Minkowski 
least squares, 109, 955, 974, 978, 984, 986 inequality, 32 
Mamdani, 427 integrals, 33 
maximum-criterion, 426 series, 32 
mean-of-maximum, 426 Mises, power method, 319 
Melnikov, 903 mixed product, 188 
Mises, 319 modal value, statistics, 834 
Monte Carlo, 843 mode, statistics, 834 
multi-step, 970 model 
multiple-target, 975 Hopf-Landau, turbulence, 904 
Newton, 950, 962 urn model, 814 
modified, 950 module of an element, 661 
non-linear operators, 690 modulo 
operator, 769 congruence, 334, 377 
orthogonalization, 314, 958, 959 mapping, 876 
parametrized center of area, 427 modulus, analytic function, 732 
pivoting, 307 de Moivre formula 
polygonal, Euler, 969 complex number, 38 
predictor-corrector, 971 generalized, 293 
regula falsi, 951 hyperbolic functions, 92 
regularization, 321 trigonometric functions, 81 


relaxation, 961 Mollweide equations, 143 
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moment of inertia, 282, 505 

double integral, 527 
ine integral, first type, 517 
triple integral, 532 
moment, order n, 813 
monodromy matrix, 863, 864, 872 
monotony, 441, 694 
unction, 50 
sequence, numbers, 457 
Monte Carlo method, 843 
application in numerical mathematics, 845 
usual, 845 
Monte Carlo simulation, 843 

example, 845 
Morse-Smale system, 875 
multi-index, 671 
multi-scale analysis, 803 
multi-step method, 970 
multiple integral, 523 

transformation, 769 
multiple-target method, 975 
multiplication 

complex numbers, 37 

computer calculation, 1005 

matrices, 272, 275 

polynomials, 11 

quaternions, 292, 293 

rational numbers, 1 

rule 

quaternions, 291 
units, generalized imaginary i, j, k, 290 

multiplicity, of divisors, 370 
multiplier, 543, 863, 864, 872 

Lagrange method, 456 
mutation, 933 


nabla operator 
definition, 715 
divergence, 715 
gradient, 715 
quaternions, 305 
rotation, 715 
twice applied, 716 
vector gradient, 715, 716 
NAND function, 325 
nautical, radio bearing, 172 
navigation, 174 
satellite, 304 
negation, 323 
Boolean function, 397 
double, 325 
neighborhood, 663 
point, 663 
Neper 
analogies, 169 
equations, 169 
logarithm, 9 
rules, 170 
Neumann 
functions, spherical, 565 
problem, 729 
series, 626, 645 
Newton 


field, point-like mass, 705 
interpolation formula, 982 
Newton method, 950, 962 
adaptive, 102 
modified, 950 
non-linear operators, 690 
approximation sequence, 690 
modified, 690 
non-linear optimization, 931 
damping method, 932, 962 
nodal plane, 598 
node, 869 
approximation interval, 996 
differential equation, ordinary, 547 
saddle node, 864, 869 
stable, 864 
triple, 896 
nominal rate, 22 
nomography, 125 
alignment charts, 125, 126 
examples with three variables, 126 
alignment nomogram, 126 
key equation, 125 
net charts, 125 
more than three variables, 128 
three variables, 125 
nomogram, 125 
non-negative integer, 1 
NOR function, 325 
norm 
axioms, linear algebra, 276 
Euclidean, 367 
integral, 698 
isomorphism, 684 
matrix, 276 
column sum norm, 277 
row sum norm, 277 
spectral norm, 277 
subordinate, 277 
uniform norm, 277 
operator norm, 677, 862 
pseudonorm, 682 
residual vector, 313 
semi-norm, 682 
s norm, 418 
space, 669 
t norm, 418 
vector, 276 
Euclidean norm, 276, 277 
matrix norm, 277 
sum norm, 277 
normal 
plane curve, 244 
principle (surface), 265 
section (surface), 265 
normal distribution, 818 
logarithmic normal distribution, 819 
observational error, 73 
standard normal distribution, 819 
standard, table, 1133 
two-dimensional, 840 
normal equation, 959, 984, 985 
system, 842, 984 
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normal form, 399 
differential equation, Riccati, 544 
equation of a surface, 224 
Jordan, 319 
principal conjunctive, 399 
principal disjunctive, 399 
normal plane, space curve, 257, 259 
normal vector 
plane, 218 
plane sheet, 722 
surface, 262 
normalization condition, 598 
normalizing factor, 196 
northern direction 
geodesical, 170 
geographical, 170 
notation 
Polish notation, 408 
postfix notation, 408 
prefix notation, 408 
reversed polish, 408 
n tuple, 118 
number, 1 
approximation, 4 
cardinal, 328, 335 
Carmichael, 382 
complex, 34 
absolute value, 35 
addition, 36 
algebraic form, 34 
argument, 35 
division, 37 
exponential form, 36 
field, 364 
main value, 35 
multiplication, 37 
power, 38 
subtraction, 36 
aking the root, 38 
rigonometric form, 35 
composite, 370 
conjugate complex, 36 
coprime, 373 
hyper-complex, 289 
imaginary, 34 
integer, 
irrational, 2-4, 334 


natural, 
non-negative, 1 
pseudoprime, 382 
random, 830 
rational, 1, 3 
real, 2 
taking of root, 8 
sequence, 457 
convergent, 664 
metric space, 664 
transcendental, 2 
number line, extended, 694 
number plane, complex, Gauss, 34 


number representation, computer internally, 1001, 1003 


number system, 1001 
inary, 1002 
decimal, 1002 
hexadecimal, 1002 
octal, 1002 
number theory, elementary, 370 
numbers 
Bernoulli, 465 
Clifford, 290 
complex 
generalized, 290 
Euler, 466 
Fermat, prime, 371 
Fibonacci, 375, 908 
interval, 2 
Mersenne, prime, 371 
rime, 370 
seudo-random, 844 
seudorandom, 844 

random, 843 

normal distributed, 934 

numerical 

analysis, 949 

axis, 1 

calculations, computer, 1004 

library (numerical methods), 949 
nutation, 215 
nutation angle, 215 
Nystróm method, 631 


obelisk, 155 
objective function, 909 
selection, 933 
observable, 594 
observational value, 848 
occupancy number, statistics, 833 
octahedron, 156 
octal number, 1002 
octal system, 1002 
octant, 210 
Octave, 1011 
offspring (descendant), 933 
populations, 936 
w-limit set, 859, 865, 872 
operation, 335 
algebraic, 282 
arithmetical, 1 
associative, 335 
binary, 335 
commutative, 335 
exterior, 336 
n ary operation, 335 
operational method, 588 


operational notation, differential equation, 


operator 
adjoint, 684 
AND, 421 
bounded, 677 
closed, 678 
coercivity, 693 
compact, 686 
compensatory, 421 
completely continuous, 686 
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operator (continued) 
continuous, 668 
inverse, 679 
contracting, 666 
demi-continuous, 693 
differentiable, 690 
finite dimensional, 686 
gamma, 42 
Hammerstein, 689 
idempotent, 686 
inverse, 659 
isotone, 692 
lambda, 42 
linear, 366, 658 
bounded, 677 
continuous, 677 
linear transformation, 366 
linear, permutability, 367 
linear, product, 366 
linear, sum, 366 
monotone, 693 
Nemytskij, 689 
non-linear, 689 
norm, 862 
notion, 49 
OR, 421 
positive, 660 
positive definite, 686 
positive non-linear, 692 
self-adjoint, 685 
singular, 650 
strongly monotone, 693 
Urysohn, 690 
operator method, 769 
opposite side, 131 
optimality condition, 924, 926 
sufficient, 925 
optimality principle, Bellman, 945 
optimization, 909 
optimization method 
Bellman functional equation, 946 
conjugate gradient, 932 
cutting plane method, 942 
damped, 932 
DFP (Davidon-Fletcher-Powell), 932 
evolution strategies, 933 
feasible direction, 937 
Fibonacci method, 930 
golden section, 930 
Hildreth-d'Esopo, 929 
Kelley, 942 
penalty method, 940 
projected gradient, 938 
unconstrained problem, 931 
Wolfe, 928 
optimization problem 
convex, 683, 926 
dynamic, 943 
non-linear, 991 
optimization, non-linear, 924 
barrier method, 941 
convex, 926 
convexity, 927 


descent (offspring) method, 931 
direction search program, 937 
duality, 926 
evolution strategies, 933 
gradient method 
inequality constraints, 936 
mutation, 933 
Newton method, 931 
numerical search procedure, 930 
offspring (descent) method, 931 
quadratic, 926 
recombination, 933 
saddle point, 925 
selection, 933 
steepest descent (offspring) method, 931 
orbit, 857 
double, periodic, 898 
heteroclinic, 868 
homoclinic, 868, 903 
periodic, 857 
hyperbolic, 865 
saddle type, 865 
order 
Clifford algebra, 289 
curve, second order, 206 
curve, n-th order, 195 
differential equation, 540 
interval, 660 
of magnitude, function, 57 
(0) interval, 660 
second order surface, 224 
wavelet, 801 
order relation, 333 
linear, 334 
order symbol, Landau, 57 
ordering, 334 
lexicographical, 334 
linear, 334 
partial, 334, 659 
ordinate 
plane coordinates, 190 
space coordinates, 210 
Ore theorem, 406 
orientation, 288 
coordinate system, 209 
numerical axis, 1 
origin 
numerical axis, 1 
plane coordinates, 190 
space coordinates, 210 
original function, 767 
original space, 767 
set, 49 
orthocenter, triangle, 133 
orthodrome, 174 
arclength, 176 
course angle, 174 
intersection point 
meridian, 176 
parallel circle, 176 
two orthodromes, 180 
point, closest to the north pole, 174 
orthogonal 
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function system, 985 

polynomials, 985 

space, 674 

spherical, 171 
orthogonality 

eigenvalues, differential equation, 570 

Hilbert space, 674 

lines, 129 

real vector space, 367 

trigonometric functions, 368 

vectors, 185 

weight, 570 
orthogonality conditions 
line-plane, 224 
lines in space, 223 
planes, 221 
orthogonalization method, 958, 959 
Givens, 960 
Gram-Schmidt, 675 
Householder, 314, 960 
Schmidt, 637 
orthogonalization process 
Gram-Schmidt, 316 
orthonormal function system, 985 
orthonormalization, vectors, 281 
oscillation 
duration, 84 
harmonic, 84 
oscillator, linear harmonic, 601 
osculating plane, space curve, 257, 259 
osculation point, curve, 250 
over-field, 362 
oversliding, 285 


pair, ordered, 331 
parabola, 204 
arclength, 206 
area, 206 
axis, 204 
binomial, 71 
cubic, 64 
diameter, 205 
directrix, 204 
equation, 205 
ocus, 204 
intersection figure, 226 
n-th degree, 66 
parameter, 205 
quadratic polynomial, 64 
radius of curvature, 206 
semicubic, 95 
semifocal chord, 204 
tangent, 205 
transformation, 207 
vertex, 204 
paraboloid, 226 
elliptic, 226 
hyperbolic, 226, 227 
central surface, 228 
invariant signs, 229 
of revolution, 226 
parallel circle, 262 
parallelepiped, 153 


rectangular, 153 
parallelism conditions 
line-plane, 224 
lines, 129 
lines in space, 223 
planes, 221 
parallelogram, 135 
parallelogram identity, unitary space, 673 
parameter, 11, 50 
parabola, 205 
statistical, 833 
parameter space, stochastic, 826 
parametric integral, 512 
parametric representation, circle, 199 
parametrized center of area method, 427 
parity, 601 
Parseval 
equation, 475, 570, 676 
formula, 789 
partial fraction decomposition, 15, 778 
special cases, 1071 
partial ordering, 334 
partial sum, 459 
partition, 334 
Pascal limacon, 98 
Pascal triangle, 13 
path, 404 
closed, 404 
integration, 516 
patterns, periodic, 604 
Pauli matrices, 290 
PCNF (principal conjunctive normal form), 399 
PDNF (principal disjunctive normal form), 399 
pencil of lines, 197 
pendulum 
equation, 904 
Foucault, 1019 
mathematical, 762 
period, 763 
pentagon, regular, 139 
pentagram, 4 
regular, 139 
percentage, 21 
calculation, 21 
performance score, 425 
perimeter 
circle, 139 
ellipse, 201 
polygon, 138 
period 
function, 52 
secant, 78 
sine, 77, 84 
tangent, 77 
period doubling, 898 
cascade, 899, 905 
period parallelogram, 763 
permutability, linear operators, 367 
permutation, 805 
cyclic, vectors, 209 
group, 338 
matrix, 957 
with repetition, 805 
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without repetition, 805 
perturbation, 589, 1006 
Pesin formula, 881, 888 
phase 
initial, 84 
portrait, dynamical systems, 860 
shift, 77, 84 
sine, 77, 84 
space, dynamical systems, 857 
spectrum, 787 
icar 
teration method 
integral equation, 625 
successive approximation method, 549 
icard-Lindelöf theorem, 668 
pivot, 956 
column, 307, 915 
element, 307, 915 
row, 307, 915 
scheme, 307 
pivoting, 307, 310 
column pivoting, 957 
step, 307 
plain trigonometry, 142 
plane (space) 
intersecting line, 152 
normal plane, 257, 262 
osculating plane, 257 
parallel line, 152 
projecting plane, 221 
rectifying plane, 257, 259 
tangent plane, 257, 262, 264 
vector equations, 189 
plane geometry, 129 
planes (space), 151 
equations, 218 
intersecting points with lines, 223 
orthogonality, conditions, 221 
parallelism, 151 
distance, 221 
parallelism, conditions, 221 
planimeter, 500 
Poincaré mapping, 868, 870 
Poincaré section, 903 
point 
accumulation, 664 
asymptotic, curve, 250 
boundary, 664 
circular, 267 
coordinates, 190 
corner, 250 
cuspidal, curve, 250 
discontinuity, 58 
double, curve, 250 
fixed, conformal mapping, 736 
fixed, stable, 899 


focal, ordinary differential equation, 548 


image, 237 
improper, 193 
infinite, 193 
interior, 663 
isolated, 664 
isolated, curve, 250 


limit, 255 
limiting, 664 
multiple, curve, 251 
n-dimensional space, 118 
nearest approach, 255 
neighborhood, 663 
non-wandering, 875 
notion, 129 
plane curve, 249 
principal vanishing, 241 
rational, 1 
regular, surface, 262 
saddle, ordinary differential equation, 548 
singular, 243, 250 
isolated, 547 
ordinary differential equation, 547 
surface, 262, 263 
spectrum, 680 
spiral, ordinary differential equation, 548 
surface point 
circular, 267 
elliptic, 266 
hyperbolic, 267 
parabolic, 267 
spherical, 266 
umbilical, 267 
terminal, curve, 250 
transversal homoclinic, 872 
umbilical, 267 
vanishing, 237, 241 
Poisson 
differential equation, partial, 592, 726, 729 
distribution, 817 
formula, 591 
integral, 587 
polar, 164 
angle, 191 
axis, 191 
coordinates, plane, 191 
coordinates, spherical, 211 
distance, 262 
equation, 599 
curve second order, 208 
subnormal, 245 
subtangent, 245 


pole 
complex function, 733 
field, 703 
function, 60 
left, 164 
multiplicity m, complex function, 753 
on the sphere, 164 
order m, complex function, 753 
origin, 191 
right, 164 
Polish notation, 408 
polyeder, 153 
polygon 
area 
2n gon, 139 
n gon, 138 
base angle, 138 
central angle, 138 
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circumcircle radius, 138 complex terms, 750 
circumscribing, 138 expansion, analytic function, 749 
exterior angle, 138 inverse, 471 
inscribed circle radius, 138 power set, 335 
interior angle, 138 power spectrum, 878 
perimeter, 138 precession, 215 
plane, 138 predicate, 326 
regular, 340 logic, 326 
regular, convex, 138, 139 n ary, 326 
side length, 138 predictor, 971 
similar, 134 predictor-corrector method, 971 
polyhedral angle, 152 pre-Hilbert space, 673 
polyhedron, 152, 153 present value, 25 
convex, 155 pressure, 504 
regular, 155 gravitational, 504 
polynomial, 63 lateral, 504 
characteristic, 315 prevalence, 891 
control, 386 prime 
equation, numerical solution, 952 coprime, 5, 14 
first degree, 63 decomposition, canonical, 372 
generating, 386 element, 370 
indecomposable, 363 actorization, 371 
integral rational function, 62 canonical, 371 
interpolation, 982 Fermat, 371 
irreducible, 363 Mersenne, 371 
n-th degree, 65 notation (measurement protocol), 848 
primitive, 364 air, 371 
product representation, 43 quadruplet, 371 
quadratic, 64 relatively, 14 
ring, 363 riplet, 371 
second degree, 64 prime formula, predicate logic, 326 
third degree, 64 prime number, 370 
trigonometric, 992 seudo, 382 
polynomials, 11 est 
Bernstein, 1000 AKS, 383 
Chebyshev, 89, 985, 989 Fermat, 382 
Hermite, 568, 602, 675, 985 Lucas-Lehmer, 371 
Laguerre, 568, 675, 985 Rabin-Miller, 382 
Legendre, 566, 600, 674, 985 sieve of Eratosthenes, 370 
orthogonality, 985 principal (a sum of money), 21 
population, 830 principal axis 
two-stage, 814 direction, 283 
population strategies, 935 transformation, 283 
Posa theorem, 406 principal ideal, 362 
position coordinate, reflection, 287 principal normal 
positive definite, 958 section, surface, 266 
postfix notation, 408 space curve, 257, 259 
potential principal quantities, 11 
complex, 741 principal value 
equation, 592 integral, improper, 507 
field, 721 inverse hyperbolic function, 760 
conservative, 721 inverse trigonometric function, 86, 760 
rotation, 715 logarithm, 758, 760 
retarded, 591 principle 
power Cauchy, 666 
complex number, 38 Cavalieri, 503 
notion, 7 contracting mapping, 666 
real number, 7 extensionality, 328 
reciprocal, 70 extrapolation principle, 967 
power function, 71 Neumann, 345 
quaternions, 294 two-values, 323 
power series, 469 prism, 153 


asymptotic, 472 lines, 153 
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regular, 153 
probability 
acceptance, 853 
area interpretation, 812 
conditional, 810 
definition, 808 
position of a particle, 596 
total, 810 
probability amplitude, 596 
probability integral, 819 
probability measure, 876 
ergodic, 880 
invariant, 876 
probability paper, 834 
probability theory, 805, 807 
probability vector, 827 
problem 
Cauchy, 572 
Dirichlet, 582, 729 
discrete, 985 
eigenvalue, 315 
inhomogeneous, 591 
linear fitting, system of equations, 958 
multidimensional 
computation of adjustments, 986 
n-dimensional, 590 
Neumann, 729 
regularized, 314 
shortest way, 404 
Sturm-Liouville, 569 
two-body, 574 
problems, basic 
plane trigonometry, 144 
spherical trigonometry, 169 
process 
birth process, 829 
death process, 829 
orthogonalization process, 316 
Poisson process, 828 
stochastic, 825, 826 
product, 7 
algebraic, 419, 420 
Cartesian 
fuzzy sets, 422 
n fold, 423 
sets, 331 
derivative, 434 
direct 
group, 338 
Q algebra, 395 
drastic, 419, 420 
dyadic, 273 
inner 
continous groups, 353 
Hilbert space, 673 
Lie algebra, 358 
vectors, 184 
Kronecker, 276 
matrices, 272 
complex, 273 
vanishing, 275 
n times direct, 397 
product sign, 7 


quaternions, 290, 293 
rules of calculation, 7 
scalar, 184, 273 

Hilbert space, 673 
tensors, dyadic, 282 
vectors 

cross, 184 

dot, 184 

dyadic, 273 
properties, 184 
quaternions, 291 
scalar, 273 
ensor, 273 
vector, 184 


programming, 909 


continuous dynamic, 943 
discrete dynamic, 943 
Bellman functional equation, 944 
Bellman functional equation method, 946 
Bellman optimality principle, 945 
constraint dynamic, 943 
constraint static, 943 
knapsack problem, 944 
problem, 943 
purchasing problem, 944 

state vector, 943 
linear, 909 
Mathematica, 1034 


programming, discrete 


continuous dynamical, 943 
cost function, 944 
decision, 943 
space, 943 
dynamic, 943 
functional equation, 945 
Bellman, 944 
method, 946 
knapsack problem, 944, 947 
minimum interchangeability, 945 
minimum separability, 944 
n-stage decision process, 943 
optimal policy, 946 
optimal purchasing policy, 946 
purchasing problem, 944 
state costs, 943 
state vector, 943 


programming, linear 


application, 920 

basic solution, 913 

basis of the extreme point, 912 

basis, inverse, 913 

constraints, 909 

duality, 919 

duality theorems, 919 

extreme point, 912 
degenerate, 912 

feasible set, 910 

forms, 909 

general form, 909 

Hungarian method, 923 

integer programming, 910 

maximum point, 910 

normal form, 913 
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programming (continued) 
northwest corner rule, 921 
objective function, linear, 909 
problem 
assignment, 923 
distribution, 923 
dual, 919 
minimum, 910 
primal, 919 
round-tour, 923 
scheduling, 924 
transportation, 920 
transportation, simplex method, 921 
properties, 911 
simplex method, 913, 914 
revised, 917 
solution point, 910 
variable 
artificial, 916 
basic, 913 
non-basic, 913 
non-negative, 910 
slack, 910 
surplus, 913 
projection 
axonometric, 238 
cabinet, 238, 240 
Cavalier, 238, 240 
central, 237 
isometric, 239 
matrix, 238, 240 
oblique, 238 
orthogonal parallel, 237 
orthographic, 237 
parallel, 237 
oblique, 238, 240 
perspective, 237, 241 
mapping prescription, 241 
vanishing point, 241 
planar, 237 
principal, 238 
stereographic, 303 
projection sides, 143 
projection theorem, orthogonal space, 674 
projector, 686 
proof 
by contradiction, 5 
constructive, 6 
direct, 5 
indirect, 325 
implication, 5 
mathematical induction, 5 
step from n ton +1, 5 
proportionality 
direct, 63 
inverse, 66 
proportions, 17 
proposition 
dual, 396 
true or false, 323 
propositional 
calculus, 323 
logic, 323 


expression, 323 
operation, 323 

extensional, 323 
variable, 323 


propositional logic, theorems, 324 
protocol, 832 

pseudo-random numbers, 844 
pseudonorm, 682 

pseudoscalar, 288, 289 
pseudotensor, 287, 288 
pseudovector, 287, 288 

Ptolemy’s theorem, 137 

pulse, rectangular, 774 


bipolar, 791 
Jordan lemma, 757 
unipolar, 786 


pyramid, 154 


frustum, 154 
n faced, 154 
regular, 154 
right, 154 
truncated, 154 


Pythagoras 


general triangle, 143 
right-angled triangle, 142 


QR algorithm, 319, 959 
QR decomposition, 959 
quadrangle, 135, 136 


circumscribing, 136, 137 
concave, 136 

convex, 136 

inscribed, 137 


quadrant relations, trigonometric functions, 79 


qua 


rant, Cartesian coordinates, 190 


quadratic 


curve, 206 
surface, 228 


quadratic form 


first fundamental, of a surface, 263 
index of inertia, 318 
second fundamental, surface, 266 


quadrature formula, 963 


Gauss type, 965 

Hermite, 964 

integral equation, 630 
interpolation quadrature, 964 
Lobatto type, 966 


quadruple (ordered 4 tuple), 331 
quantification, restricted, 327 
quantifier, 326 

quantile, 812 

quantity, infinitesimal, 494, 500 
quantum number, 597 


energy, 597 

magnetic, 601 

orbital angular momentum, 599 
vibration quantum number, 602 


quartic, 97 
quasiperiodic, 866 
quaternion 


biquaternion, 290 
conjugation, 292 
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definition, 290 
inverse element, 292 
pure quaternion, 291 
unit quaternion, 291 
quaternions, 289 
addition, 292 
algorithm efficiency, 301 
applications, 302 
Cardan angles, 298 
complex matrix, 292 
computer graphic, 302 
conjugation, 298 
division, 293 
Euler angles, 296 
hyperbolic functions, 294 
interpolation, 302 
Lerp, 302 
rotation matrices, 303 
Slerp, 302 
Squad, 303 
Laplace operator, 306 
logarithmic function, 294 
multiplication, 292, 293 
power function, 294 
representation 
complex numbers, 290 
four vector, 290 
matrix, 291 
rigid-body motion, 306 
biquaternions, 306 
rotation matrix, 294 
rotations, 294, 297 
skew field, 290 
subtraction, 292 
trigonometric form, 291 
trigonometric functions, 293 
vector analysis, 305 
queuing, 829 
theory, 829 
quintuple (ordered 5 tuple), 331 
quotient, 1, 12 
derivative, 435 
differential, 432 
set, 334 


R? (3 dimensional Euclidean vector space), 289 


R^ (4 dimensional Euclidean vector space), 289 
radial equation, 599 
radian definition, 131 
radian measure, 131 
radian unit, 1055 
radicals, ordinary differential equation, 546 
radicand, 8 
radius 
circle, 140, 198 
circumcircle, 143 
convergence, 469 
curvature, 248 
curve, 247 
space curve, 258 
curvature, extremal, 616 
polar coordinates, 191 


principal curvature 
surface, 265 
short, 133 
torsion, space curve, 260 
vector, 182 
raising to a power 
complex numbers, 38 
real numbers, 7, 9 
random numbers, 830, 843, 934 
application, 844 
congruence method, 844 
construction, 844 
different distributions, 844 
normal distributed, generation, 934 
pseudo-random, 844 
table, 1139 
uniformly distributed, 843 
random variable, 811 
continuous, 811, 812 
discrete, 811 
independent, 814, 839 
mixed, 811 
multidimensional, 814 
two-dimensional, 839 
random vector 
mathematical statistics, 830 
multidimensional random variable, 814 
random walk process, 846 
range, 48 
operator, 658 
sample function, 832 
statistics, 834 


rank 


matrix, 274 
tensor, 281 
vector space, 367 
rate 
effective, 22 
nominal, 22 
of interest, 22 
ray point, 548 
ray, notion, 129 
Rayleigh-Ritz algorithm, 319 
reaction, chemical, concentration, 117 
real part (complex number), 34 
rebate, 21 
recombination, 933 
intermediary, 933 
reconstruction 
dynamic from time series, 889 
embedding, 890 
immersion, 890 
mapping, 890 
pair sets, 890 
prevalence, 891 
theorem 
Kupka, Smale, 890 
Sauer, Yorke, Casdagli, 891 
Takens, 890 
reconstruction space 
dynamic, 890 
time series, 889 
rectangle, 136 
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rectangular formula, simple, 964 
rectangular sum, 964 
rectification, 108 
Reduce (computer algebra system), 1023 
reduction, 21 
angle, 175 
reduction formula, trigonometric functions, 79 
reflection principle, Schwarz, 741 
reflection, position coordinate, 287 
region, 119 
multiply-connected, 119 
non-connected, 119 
simply-connected, 119 
two-dimensional, 119 
regression 
analysis, 839 
coefficient, 841 
ine, 841 
linear, 841 
multidimensional, 842 
regula falsi, 951 
regularity condition, 925, 941 
regularization method, 321 
regularization parameter, 314 
relation, 331 
binary, 331 
congruence relation, 394 
equivalence relation, 333 
uzzy-valued, 422 
inverse, 332 
less or equal than (< relation), 326 
matrix, 331 
n ary, 331 
n place, 331 
order relation, 333 
product, 332 
uncertainty, 596 
relaxation method, 961 
relaxation parameter, 961 
reliability testing, 829 
relief, analytic function, 732 
remainder 
estimation, 466 
series, 459 
term, 459 
Remes algorithm, 990 
rent, 25 
ordinary, constant, 25 
representation function 
explicit form, 49 
implicit form, 49 
parametric form, 50 
representation of groups, 340 
adjoint, 342 
direct product, 344 
direct sum, 344 
equivalent, 342 
faithful, 342 
identity, 342 
irreducible, 344 
non-equivalent, 342 
particular, 342 
properties, 342 


reducible, 344 
reducible, complete, 344 
representation matrix, 340 
representation space, 340 
subspace, 343 
true, 342 
unitary, 342 
representation theorem (fuzzy logic), 417 
representative, 362 
resection 
Cassini, 150 
Snellius, 149 
residual spectrum, 681 
residual sum of squares, 313, 959 
residual vector, 313 
residue, 313, 958 
quadratic modulo m, 379 
complex function, 753 
theorem, 754 
residue class, 377 
addition, 377 
multiplication, 377 
primitive, 378 
relatively prime, 378 
ring, 361, 377 
modulo m, 378 
residue theorem, 752 
application, 755 
resolvent, 626, 628, 645, 680 
set, 680 
resonance torus, 904 
reversing the order of the bits, 995 
rhombus, 136 
Riemann 
formula, 584 
function, 584 
integral, 494 
comparison with Lebesgue integral, 506 
comparison with Stieltjes integral, 506 
method, 584 
sum, 494 
surface, many-sheeted, 745 
theorem, 463 
right-hand coordinate system, 209 
right-hand rule, 184, 722 
right-screw rule, 722 
right-singular vector, 321 
rigid-body motion, 358 
biquaternions, 306 
Chasles theorem, 359 
uaternions, 306 
screwing motion, 359 
ring, 361 
division ring of quaternions, 290 
factor ring, 363 
homomorphism, 362 
theorem, 362 
isomorphism, 362 
subring, 362 
risk theory, 21 
Ritz method, 618, 974 
robotics, 358 
Romberg method, 966 
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root 
complex function, 733 
complex number, 38 
equation, 43 
non-linear, 949 
notion, 8 
N-th of unity, 994 
real number, 8 
square root, complex, 737 
theorem of Vieta, 44 
root locus theory, 954 
root test of Cauchy, 461 
rotation 
angle, 215 
arbitrary zero point axis, 296 
around an arbitrary space axis, 232 
Cardan angles, 295, 298 
coordinate axis, 191 
coordinate system, 212 
definition, 713 
different coordinates, 714 
direction cosines, 213 
Euler angles, 296 
mapping, 877 
object, 213, 229 
around the coordinate axis, 295 
potential field, 715 
remark, 709 
vector components, 718 
vector field, 713 
rotation determinant, 213 
transformation properties, 214 
rotation field 
pure, 727 
zero-divergence field, 727 
rotation invariance, 283, 284 
rotation invariance, cartesian tensor, 283 
rotation matrix, 191, 212, 275 
arbitrary zero point axis, 296 
Cardan angles, 295, 298 
coordinate transformation, 231 
Euler angles, 296 
orthogonal, 28 
quaternions, 294, 297 
space, 212, 215, 216 
rotation number, 906 
rotations 
coordinate system, 212 
geometric 3D transformation, 234 
geometric, 2D transformation, 229 
object, 213 
quaternions, 297 
rotator, rigid, 598 
round-off, 1005 
error, 1007 
measurement, 848 
error method, 1007 


row echelon form, system of linear equations, 312 


row sum criterion, 960 

Ruelle-Takens-Newhouse scenario, 904 

rule, 971 
Adams and Bashforth, 971 
Bernoulli-l’ Hospital, 56 


Cartesian rule of signs, 953 
Cramer, 311 
De Morgan, 324 
Descartes, 45 
Guldin, first rule, 506 
Guldi 
linguistic, 428 
Milne, 971 
Sarrus, 279 
ruled surface, 268 
rules 
calculation determinants, 278 
calculation matrices, 275 
calculation quaternions, 292 
composition, 424 
differentiation 
one variable, 433 
several variables, 450 
divisibility, elementary, 370 
integration (general), 482 
Neper, 170 
rank, 274 
Runge-Kutta method, 969 


saddle, 864, 871 
saddle form, 267 
saddle point 
differential equation, 548 
Lagrange function, 925 
Sage (computer algebra system), 1023 
sample, 814, 830 
function, 831 
random, 830 
size, 814 
summarizing, statistical, 832 
variable, 830, 831 
Sarrus rule, 279 
scalar 
invariant, 281 
notion, 181 
scalar field, 702 
axial field, 703 
central field, 702 
coordinate definition, 703 
directional derivative, 708 
gradient, 710 
lane, 702 
scalar matrix, 270 
scalar product, 184, 367 
ilinear form, 368 
Hilbert space, 673 
quaternions, 291, 293 
representation in coordinates, 187, 189 
rotation invariance property, 288 
wo functions, 985 
vectors, 273 
scale 
cartography, 145 
equation, 115 
actor, 115, 147 
ogarithmic, 115 
notion, 115 
semilogarithmic, 116 
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scenario, Ruelle- Takens-Newhouse, 904 
Schauder fixed-point theorem, 691 
scheme 
Falk, 273 
Young, 345 
Schmidt, orthogonalization method, 637 
Schoenflies symbolism, 346 
Schroedinger equation 
central field, 598 
angular dependence, 601 
azimutal equation, 600 
polar equation, 600 
radial equation, 599 
solution, 599 
linear, 592 
linear harmonic oscillator, 601 
non-linear, partial, 604, 606 
spherical von Neumann functions, 565 
time-dependent, 593 
time-independent, 594 
Schur's lemma, 345 
Schwarz 
exchange theorem, 448 
reflection principle, 741 
Schwarz-Buniakowski inequality, 673 
Schwarz-Christoffel formula, 739 
Scilab, 1011 
screw 
left, 260 
Lie algebra, 360 
right, 260 
screw motion 
Chasles theorem, 359 
rigid-body motion, 359 
search procedure, numerical, 930 
secant, 139 
hyperbolic, 89 
theorem, 139 
trigonometric, 78 
geometric definition, 131 
secant-tangent theorem, 139 
section 
normal (surface), 265 
priciple normal (surface), 265 
section, golden, 2, 4, 194, 908 
sector formula, 725 
sector, spherical, 158 
segment 
normal, 245 
notion, 129 
polar normal, 245 
polar tangent, 245 
tangent, 245 
selection, 933 
self-similar, 883 
semantically equivalent, 324 
expressions, 398 
semi-linear, 369 
semi-monotone, 671 
semi-norm, 682 
semifocal chord 
ellipse, 199 
hyperbola, 201 


parabola, 71, 204 
semigroup, 336 

free, 336 
semilogarithmic paper, 116 
semiorbit, 857 
sense class, 134 

of a figure, 133 


sensitive, with respect to the initial values, 889 


sentence, 323 
separable sets, 683 
separation 
constant, 597 
theorems (convex sets), 683 
variables, 542, 579, 597 
separatrix 
loop, 868, 903 
surface, 866, 872 
sequence, 457 
bounded, 656 
Cauchy sequence, 665 
convergente, 664 
finite, 655 
in metric space, 664 
infinite, 457 
numbers, 457 
bounded, 457, 656 
bounded above, 458 
bounded below, 458 
convergence, 458 
converging to zero, 656 
divergence, 458 
finite, 655 
aw of formation, 457 
imit, 458 
monotone, 457 
erm, 457 
series, 18, 457 
alternating, 463 
arithmetic, 18 
Banach space, 670 
Clebsch-Gordan, 345 
comparison criterion, 460 
constant term, 459 
convergence, 459, 462 
absolute, 469 
non-uniform, 468 
theorems, 459 
uniform, 468, 469 
definite, 18 
divergence, 459, 462 
expansion, 471 
area functions, 1061 
binomial, 1057 
exponential functions, 1057 
Fourier, 474 
hyperbolic functions, 1061 
inverse trigonometric functions, 1060 
Laplace transformation, 779 
Laurent, 752 
logarithmic functions, 1059 
Maclaurin, 472 
power series, 469 
Taylor, 442, 471 
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series (continued) fractal, 888 
trigonometric functions, 1058 stable, 859 
finite, 18 irrational numbers, 2 
Fourier, 474 linear, 655 
complex representation, 475 manyness, 335 
function, 467 measurable, 694 
domain of convergence, 467 natural numbers, 1 
general term, 459 non-denumerable infinite, 335 
geometric, 19 notion of set, 327 
infinite, 19, 459 open, 663 
harmonic, 459 operation 
hypergeometric, 567 Cartesian product, 331 
infinite, 457, 459 complement, 328 
Laurent, 752 difference, 330 
Maclaurin, 472 intersection, 328 
Neumann, 626, 678 union, 328 
partial sum, 459 operations, 328 
power, 469 order-bounded, 660 
expansion, 471 original space, 49 
inverse, 471 power, 335 
remainder, 459, 467 power set, 328 
sum, 459 quotient set, 334 
Taylor, 442, 471 rational numbers, 1 
test real numbers, 2 
Cauchy, integral test, 462 relative compact, 686 
Cauchy, root test, 461 subset, 328 
D'Alembert's ratio test, 461 theory, 327 
Leibniz, alternating series test, 463 universal, 329 
uniformly convergent void, 328 
continuity, 468 set algebra, 693 
differentiation, 469 sets, 327 
integration, 469 coordinates z, y, 331 
properties, 468 difference, symmetric, 330 
Weierstrass criterion, 468 operation 
sesquilinear form, 369 symmetric difference, 330 
hermitian, 369 sexagesimal degree, 131 
set, 327 shearing, transformation, 230 
absorbing, 859 shift mapping, 880 
algebra, fundamental laws, 329 shift register, linear, 364 
axioms of closed sets, 664 shooting method, 975 
axioms of open sets, 664 simple, 975 
Borel, 694 shore-to-ship bearing, 172 
bounded in metric space, 664 side condition, 614 
cardinality, 335 side or leg of an angle, 129 
closed, 664 Sierpinski 
closure, 665 carpet, 884 
compact, 686, 860 gasket, 884 
complex numbers, 34 sieve of Eratosthenes, 370 
convex, 657 c additivity, 694 
dense, 665 c algebra, 694 
denumerable infinite, 335 Borelian, 694 
disjoint, 329 sign of a function, 50 
element, 327 signal, 800 
empty, 328 analysis, 800 
equality, 328 synthesis, 800 
equinumerous, 335 signature, universal algebra, 394 
fundamental, 329, 674 significance, 839 
fuzzy, 413 level, 836 
image, 49 level of type 1, 813 
infinite, 335 similarity transformation, 316, 317 
integers, 1 simplex 
invariant, 859 method, 913, 914 


chaotic, 888 revised, 917 
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pivot element, 915 
secondary program, 917 
step 
revised, 918 
tableau, 914 
degenerate case, 916 
non-generate case, 915 
revised, 917 
variable 
artificial, 916 
Simpson's formula, 965 
simulation 
digital, 843 
Monte Carlo, 843 
sine 
hyperbolic, 89 
geometric definition, 132 
trigonometric, 76 
geometric definition, 131 
sine law, 143 
sine integral, 513, 756 
sine law, 166 
sine-cosine law, 166 
polar, 166 
sine-Gordon equation, 607 
sine-Gordon equation (SG), 604 
single-target method, 975 
singleton, 416 
singular value, 321, 880 
decomposition, 321 
singularity 
analytic function, 733 
essential, 733, 753 
isolated, 752 
pole, 753 
removable, 733 
sink, 864, 871 
vector field, 712 
vertex, 411 
sinusoidal amount, 84 
skew field 
quaternions, 289 
rings, 361 
Slater condition, 926 
slide rule, 10 
logarithmic scale, 115 
slope 
plane, 195 
tangent, 245 
small circle, 160, 176 
arclength, 177 
course angle, 177 
intersection point, 177 
radius, plane, 176 
radius, spherical, 176 
smoothing 
continuous problem, 984 
error propagation, 856 
parameter, 998 
spline, 997 
cubic, 997 
Sobolev space, 671 
solid angle, 152 


soliton 
antikink, 608 
antisoliton, 606 
Boussinesq, 609 
Burgers, 609 


dissipative, 604 
Hirota, 609 
interaction, 604 
Kadomzev-Pedviashwili, 609 
kink, 607 

lattice, 608 
ink-antikink, 608 

collision, 608 

doublet, 608 
kink-kink 

collision, 608 
Korteweg de Vries, 605 
non-linear, Schroedinger, 606 


SOR method (successive over-relaxation), 961 


source, 864, 871 
distribution 
continuous, 728 
discrete, 728 
field 
irrotational, 726 
pure, 726 
vector field, 712 
vertex, 411 
space 
abstract, 49 
Banach, 670 
complete, metric, 665, 666 
directions, 212 
finite-dimensional, 686 
undamental, 699 
higher-dimensional, 118 
Hilbert, 673 
infinite-dimensional, 657 
isometric, 669 
Kantorovich, 661, 686 
inear, 654 
L? space, 697 
metric, 662 
completion, 668 
convergence of sequence, 664 
normable, 670 
separable, 665 
non-reflexive, 684 
normed 
axiom, 669 
properties, 670 
ordered normed, 671 
orthogonal, 674 
reflexive, 684 
Riesz, 660 
second adjoint, 684 
separable, 665 
Sobolev, 671 
unitary, 673 
vector, 654 
space curve, 256 
binormal, 257, 259 


differential equations, partial, non-linear, 603 
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coordinate equation, 259 
curvature, 258 
direction, 256 
equation, 218, 256 
moving trihedral, 257 
normal plane, 257, 259 
osculating plane, 257, 259 
principal normal, 257, 259 
radius of curvature, 258 
radius of torsion, 260 
tangent, 257, 259 
torsion, 260 
vector equation, 256, 259 
space inversion, 287 
mixed product, 289 
scalar product, 288 
spectral radius, 678 
spectral theory, 680 
spectrum, 787 
amplitude, 787 
continuous, 681 
frequency, 787 
linear operator, 680 
phase, 787 
sphere, 158 
ellipsoid, 224 
equation, three forms, 261 
spherical 
biangle, 163 
coordinates, 211 
diagon, 163 
distance, 160 
field, 702 
helix, 178 
lune, 163 
spherical functions, 564 
complex, 564 
spinor, 290 
spiral, 105 
Archimedean, 105 
hyperbolic, 105 
logarithmic, 106, 251 
spline, 996 
basis spline, 998 
bicubic, 998 
approximation, 1000 
interpolation, 998 
net points, 998 
cubic, 996 
interpolation, 996 
smoothing, 997 
interpolation, 982, 996 
natural, 996 
normalized B-spline, 998 
periodic, 996 
smoothing, 997 
splitting field, polynomials, 363 
spur 
matrix, 270 
tensor, 284 
square, 136 
stability 
absolutely stable, 972 


first approximation, 864 


Lyapunov stability, 863 
orbital, 863 
eriodic orbit, 864 


structural, 873, 874 
stability theory, 863 
classification, 864 
standard 

eviation, 813, 852, 854 
error, 851 

normal distribution, 819 
state 
degenerate, 598 

particle, 592 

space, stochastic, 825, 826 
stationary, 594 


statistical analysis, 832 
statistical summarization, 832 
statistics, 805 
escriptive, 832 
estimate, 831 
mathematical, 805, 830 
sample function, 831 
steady state, 857 
Steffensen method, 951 
step 
from nton+1,5 
function, 757, 774, 794 
functions, 804 
interval parameter, 962 
size, 969 
change, 970 
steradian, 152, 1055 
stereometry, 151 
Stieltjes integral, 506, 686 


erturbation of initial values, 972 
round-off error, numerical calculation, 1007 


steady, dynamical system, 857 


integration of differential equation, 972 


comparison with Riemann integral, 506 


notion, 506 
Stieltjes transformation, 768 
Stirling formula, 515 
stochastic 
basic notions, 826 
chains, 826 
process, 825 
processes, 826 
ochastics, 805 
tokes, integral theorem, 725 
trangling, 282 
ream function, 741 
rip, characteristic, 573 
rophoide, 97 
structure 
algebraic, 323 
classical algebraic, 335 
Sturm 
chain, 44 
function, 44 
sequence, 954 
theorem, 45 
Sturm-Liouville problem, 569 
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subdeterminant, 278 
subdomain, 998 
subgraph, 402 
induced, 402 
subgroup, 337 
criterion, 337 
cyclic, 337 
invariant, 338 
normal, 338 
trivial, 337 
subinterval, 494 
subnormal, 245 
subring, 362 


subset, 328, 655 
affine, 655 
linear, 655 
open, 663 
subspace, 367 
affine, 655 
criterion, 367 
invariant, representation of groups, 343 
subtangent, 245 
subtraction 
complex numbers, 36 
computer calculation, 1005 
polynomials, 11 
quaternions, 292 
rational numbers, 1 
subtractive cancellation, 1005 
sum, 6 
algebraic, 419 
bounded, 420 
drastic, 419, 420 
of the digits 
alternating, first order, 372 
alternating, second order, 372 
alternating, third order, 372 
first order, 372 
second order, 372 
third order, 372 
residual squares, 986 
Riemann, 494 
rules of calculation, 6 
summation sign, 6 
transverse, 372 
vectors, 182 
summarization, statistical, 832 
summation convention, Einstein’s, 280 
superposition 
fields, law, 728 
linear, 595 
non-linear, 606 
oscillations, 84 
principle, 595, 744 
differential equation, linear, 560 
differential equations, higher order, 554 
supplementary angle formulas, 80 
support 
compact, 802 
measure, 876 
membership function, 413 
supporting functional, 683 


ΕΞ 


supremum, 660 

surface, 261 
area, double integral, 527 
B-B representation, 1000 
barrel, 159 
block, 153 
cone, 157 
conical, 157, 218 
constant curvature, 267 
cube, 154 
curvature of a curve, 265, 267 
cylinder, 156 
cylindrical, 156, 218 
developable, 268 
element, 265 
element, vector components, 719 
equation, 228 

in normal form, 224 
space, 217 

equation, space, 261 


first quadratic fundamental form, 263 


Gauss curvature, 267 
geodesic line, 268 
harmonic, 601 
integral, 532, 722, 723 
first type, 532 
general form, 537 
second type, 535, 536 
line element, 263 
line of curvature, 266 
metric, 265 
minimal, 267 
normal, 262-264 
normal vector, 262 
oriented, 535 
patch, area, 265 
polyhedron, 153 
principal normal section, 266 
pyramid, 154 
quadratic, 228 
radius of principal curvature, 265 
rectangular parallelepiped, 153 
rectilinear generator, 226 
representation with splines, 996 
rotation symmetric, 218 
ruled, 268 
second order, 224, 228 
central surfaces, 228 
invariant signs, 229 
types, 228 


second quadratic fundamental form, 266 


sphere, 158 
angent plane, 262 
orus, 159 
ransversal, 866 
switch 
algebra, 395, 399 

unction, 399 

value, 399 

Sylvester, theorem of inertia, 318 
symbol 


&ronecker, 283 


internal representation (computer), 1001 
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Landau, 57 
Legendre, 379 


symmetry 


axial, 134 
central, 133 
element, 346 
Fourier expansion, 476 
group, 346 
applications in physics, 351 
crystallography, 348 
molecules, 347 
quantum mechanics, 350 
operation, 346 
crystal lattice structure, 349 
improper orthogonal mapping, 346 
reflection, 346 
rotation, 346 
without fixed point, 346 
with respect to a line, 134 


system 


Canonical, 573 
chaotic according to Devaney, 889 
cognitive, 428 
complete, 676 
dynamical, 857 
chaotic, 889 
conservative, 858 
continuous, 857 
C’-smooth, 857 
discrete, 858 
dissipative, 858 
ergodic, 877 
invertible, 857 
mixing, 877 
motion, 857 
reconstruction space, 890 
reconstruction theorem, 890 
time continuous, 857 
time discrete, 857 
time dynamical, 858 
volume decreasing, 858 
volume preserving, 858 
volume shrinking, 858 
four points, 217 
generators, 337 
knowledge based interpolation, 430 
mixing, 889 
Morse-Smale, 875 
normal equations, 313 
orthogonal, 674 
orthonormal, 674 
term-substitutions, 394 
trigonometric, 674 


system of algebraic equations, 39 


extraneous roots, 39 
irrational equations, 39 
vanishing denominator, 39 


system of linear equations 


compatible, 309 

consistent, 309 

existence of the solution, 312 
fundamental system, 309 
homogeneous, 308 


sys 


sys 
sys 


sys 


sys 


tab 


inconsistency, 310 
inconsistent, 309 
inhomogeneous, 308 
linear, 307 
numerical solution, 955 
Cholesky method, 958 
direct method, 955 
Gauss algorithm, 956 
iteration, 955 
iteration methods, 960 
least squares method, 955 
orthogonalization method, 958 
over-determined, 955, 958 
row echelon form, 955 
triangular decomposition, 957 
under-determined, 955 
over-determined, 313 
pivoting, 307, 310 
row echelon form, 312 
solution, 308 
solvability, 309, 310 
trivial solution, 309 
em of non-linear equations 
numerical solution, 955, 961 
Gauss-Newton method, 962 
iteration method, 961 
Newton’s method, 962 
em of polynomial equations, 39 
ems of differential equations, first-order linear 
homogeneous systems, 559 
inhomogeneous systems, 559 
ems of differential equations, linear 
constant coefficients, 558 
ems of partial differential equations 
canonical systems, 573 
first-order, 573 
normal system, 574 


e with double entry, 120 


tacnode, curve, 250 
tangent 


circle, 199 

formula, 143 

hyperbolic, 89 
geometric definition, 132 

law, 143 

plane, 160, 264, 448 
surface, 262 

plane curve, 244 

polygon, 138 

space curve, 257, 259 

trigonometric, 77 
geometric definition, 131 


tangential 


element, 357 
vector, 357 


tautology 


Boolean function, 397 
predicate logic, 327 
propositional logic, 325 


Taylor 


expansion, 57, 442, 471 
one variable, 471 
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Taylor (continued) 
several variables, 450 
vector function, 702 
formula, 442 
several variables, 450 
two variables, 449 
series, 442, 471 
analytic function, 751 
one variable, 471 
several variables, 450 
theorem, 442 
several variables, 449 
telegraphic equation, 585 
telephone-call distribution, 829 
tensor, 280 
addition, subtraction, 287 
alternating, 284 
antisymmetric, 283 
components, 281 
contraction, 282, 287 
definition, 281 
dyadic product, 282 
eigenvalue, 283 
generalized Kronecker delta, 283 
inertia, 282 
invariant, 284 


linear transformation, coordinates, 280 


multiplication, 287 
oversliding, 287 
product, 282 
vectors, 273 
rank 0, 1,2, n, 281 
rules of calculation, 282, 283 
skew-symmetric, 283, 287 
spur, 284 
symmetric, 283, 287 
tension, 282 
trace, 284 
transformation invariance, 283 
tensor product approach, 999 
tent mapping, 876 
term algebra, 395 
test 
X? test, 835 
goodness of fit, 834 
hypothesis, 839 
independence, two variables, 840 
prime number 
AKS, primality, 383 
Fermat, 382 
Lucas-Lehmer, 371 
Rabin-Miller, 382 
sieve of Eratosthenes, 370 
statistical, 834 
test problem, linear, 972 
tetragon, 136 
tetrahedron, 155, 217 
Thales theorem, 140, 142 
theorem 
Abel (power series), 469 
Afraimovich-Shilnikov, 904 
alternating point, 988 
Andronov-Pontryagin, 874 


theorem (continued I) 


Andronov-Witt, 864, 865 
Apollonius, 200 
Arzela-Ascoli, 686 
Baire category, 666 
Banach 
continuity, inverse operator, 679 
fixed-point, 689 
Banach-Steinhaus, 678 
Bayes, 810, 811 
Berge, 409 
inomial, 12 
Birkhoff, 395, 877 
Block-Guckenheimer-Misiuriewicz, 889 
Bolzano 
one variable, 61 
several variables, 124 
oundedness of a function 
one variable, 61 
several variables, 124 
Cauchy integral theorem, 747 
Cayley, 339, 408 
center manifold 
differential equations, 892 
mappings, 897 
Chasles, 359 
Chebyshev, 489 
Chinese remainder, 379 
Clebsch-Gordan, 345 
closed graph, 678 
constant analytic function, 733 
convergence, measurable function, 697 
decomposition, 334 
Denjoy, 906 


Dirac, 406 
Douady-Oesterlé, 886 
Euclidean algorithm, 374 
Euler (polyeder), 155 
Euler (theory of numbers), 381 
Euler-Hierholzer, 405 
Fatou, 697 
Fermat, 381, 441 
Fermat-Euler, 381 
fixed-point 

Banach, 666, 689 

Brouwer, 691 

Schauder, 691 
Floquet, 863 
undamental integral calculus, 511 


undamental theorem of algebra, 364 
Girard, 165 
Grobman-Hartman, 871, 873 
Hadamard-Perron, 867, 872 
Hahn (extension theorem), 683 
Hellinger- Toeplitz, 678 
Hilbert-Schmidt, 688 

Holladay, 997 

Hurwitz, 556 

intermediate value 

one variable, 61 

several variables, 124 


differentiability, respect to initial conditions, 857 


undamental of elementary number theory, 371 
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theorem (continued II) theorem (continued III) 
KAM (Kolmogorov-Arnold-Moser), 875 Wilson, 381 
Krein-Losanovskij, 679 Wintner-Conti, 860 
Kupka, Smale, 890 Young, 884, 885 
Kuratowski, 410 theorems 
Lagrange, 338 Euclidean, 142 
Lebesgue, 697 propositional logic, 324 
Ledrappier, 886 theory 
Leibniz, 463 distribution, 774 
Lerey-Schauder, 692 elementary number, 370 
Levi, B., 697 ergodic dynamical systems, 877 
limits field, 729 
functions, 458 function, 731 
sequences of numbers, 458 graph, algorithms, 401 
Lindeberg-Levy, 825 probability, 805 
Liouville, 733, 861 risk, 21 
Lyapunov, 863 set, 327 
maximum value, analytic function, 733 spectral, 680 
Meusnier, 265 vector fields, 701 
monotone convergence, 697 theta function, 764 
nested balls, 666 time frequency analysis, 803 
Ore, 406 tolerance, 417 
Oseledec, 880 topological 
Palis-Smale, 875 conjugation, 870 
Picard-Lindelóf, 668, 857 equivalence, 870 
Poincaré-Bendixson, 865 torsion, 236 
Posa, 406 torsion, space curve, 260 
Ptolemy’s, 137 torus, 159, 888, 889 
Pythagoras differential equation, linear, autonomous, 863 
general triangle, 143 dissolving, 904 
orthogonal space, 674 formation, 900, 905 
right-angled triangle, 142 invariant set, 866 
Radon-Nikodym, 697 losing smoothness, 904 
Riemann, 463 resonance torus, 904 
Riesz, 682 trace 
Riesz-Fischer, 676 matrix, 270 
Rolle, 441 tensor, 284 
Sauer, Yorke, Casdagli, 891 tractrix, 108 
Schauder, 687 trail, 404 
Schwarz, exchange, 448 Euler trail, 405 
Sharkovsky, 889 open, 405 
Shilnikov, 903 trajectory, 857 
Shinai, 889 transform, 767 
Shoshitaishvili, 892 transformation 
Smale, 902 algebraic expression, identical, 11 
stability in the first approximation, 864 cyclic permutations, 142 
Sturm, 45 invariance, cartesian tensor, 283 
superposition law, 728 iterative methods, 319 
Sylvester, of inertia, 318 linear, mappings, operators, homomorphisms, 658 
Takens, 890 orthogonal coordinates, 212 
Taylor, 442 orthogonal-similarity, eigenvalue problem, 317 
one variable, 471 principal axes 
several variables, 449 eigenvalue problem, 317 
Thales, 140, 142 quadratic form, 317 
total probability, 810 ensor, 283 
Tutte, 409 similarity, eigenvalue problem, 316, 317 
variation of constants, 862 transformation (coordinates, plane) 
Vieta, 44 parallel translation, 191 
Wedderburn, 361 rotation, coordinate system, 191 
Weierstrass, 468, 665 transformation (coordinates, space) 
one variable, 61 basic transformations, 231 
several variables, 125 parallel translation, 212 
Whitney, 890 rotation, coordinate system, 212 
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translation, rotation, etc., 231 
transformation (geometric), 234, 295 

affine, 230 

basic, 231 

bending, 237 

composition, 232 

contraction, 236 

coordinates, 229, 295 

deformations, 236 

mirror reflection, 230, 233 

object, 229 

properties, 230 

rotation (turning), 229, 234 

scaling, 230, 234 

shearing, 230, 234 

torsion, 236 

translation, 229, 234 
ransformation (geometric, 2D), 229 
ransformation (geometric, 3D), 233 

rotation of the object, 213 
ransformation (groups) 

affine, 2-dimensional space, 355 

covering, 336 
ransformation (plane) 

Cartesian ++ polar coordinates, 192, 453 
ransformation (space) 

Cartesian «» cylindrical coordinates, 212 

Cartesian + spherical coordinates, 212 
ransition 

matrix, probability, 827 

matrix, stochastic process, 826 

probability, stochastic, 826 
ransitivity, 29 
ranslation 

invariance, 283, 284 

invariance, cartesian tensor, 283 

primitive, 348 
ransport, network, 411 
ransposition law, 325 
rapezoid, 136 

Hermite's trapezoidal formula, 965 
rapezoidal 

formula, 964 

sum, 964 
raversing, 148 
ree, 407 

hight, 407 

ordered binary, 407 

regular binary, 407 

rooted, 407 

spanning, 407, 408 
minimum, 408 


triangle 
altitude, 133 
area, 194 


bisector, 133 

center of gravity, 133 
circumcircle, 133 
congruent, 134 
coordinates, 982 
equilateral, 133 
Euler, 164 

incircle, 133 


inequality, 182 
axioms of norm, 276 
complex numbers, 30 
metric space, 662 
norm, 669 
real numbers, 30 
unitary space, 673 
isosceles, 133 
median, 133 
orthocenter, 133 
Pascal, 13 
plane, 132 
area, 142, 144 
basic problems, 144 
Euclidean theorems, 142 
general, 142 
incircle, 144 
radius of circumcircle, 143 
right-angled, 133, 142 
tangent formula, 143 
polar, 164 
similar, 134 
spherical, 163 
basic problems, 169 
calculation, 169 
Euler, 164 
oblique, 171 
right-angled, 169 
riangular 
decomposition, 956 
matrix, 271 
lower, 27 
upper, 271 


riangulation, geodesy 
first fundamental problem, 148 
second fundamental problem, 149 
rigonometry 
plain, 142 
spherical, 160 
rihedral 
angle, 152, 164 
moving, 256, 257 
riple (ordered 3 tuple), 331 
riple integral, 527 
application, 532 
rochoid, 102 
runcation, measurement error, 848 
ruth 
compositions of propositions, 323 
function, 323, 324 
conjunction, 323 
disjunction, 323 
equivalence, 323 
implication, 323 
NAND function, 325 
negation, 323 
NOR function, 325 
table, 323 
true or false, 323 
value, 323 
turbulence, 858, 904 
Tutte theorem, 409 


riangularization, FEM (finite element method), 979 
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two lines, transformation, 207 
two-body problem, 574 
type, universal algebra, 394 


umbilical point, 267 
uncertainty 
absolute, 852 
fuzzy, 413 
quantum mechanical, 596 
relation, 596 
relative, 852 
uncertainty relation, 596 
ungula, cylinder, 156 


union 
fuzzy sets, 418, 419 
sets, 328 


units, imaginary 
generalized i, j, k, 290 
units, physical 
currently accepted for use with SI, 1056 
international system SI, 1055 
additional units, 1055 
basic units, 1055 
universal quantifier, 326 
universal substitution, 491 
urn model, 814 


vagueness, 413 
valence 
in-valence, 401 
out-valence, 40 
value 
expected, measurement, 813 
system (function of several variables), 118 
true, measurement, 850 
van der Pol differential equation, 895 
variable 
Boolean, 397 
bound variable, predicate logic, 326 
dependent, 48, 307 
free, predicate logic, 326 
independent, 48, 118, 307 
linguistic, 414 
propositional, 323 
random, 811, 825 
slack, 910 
variance, 813 
distribution, 813 
sample function, 831 
statistics, 834 
two-dimensional distribution, 840 
variation 
function, 61 
of constants, method, 554 
variation of constants 
differential equation, linear, 560 
theorem, 862 
variational calculus, 610 
auxiliary curve, 612 
comparable curve, 612 
Euler differential equation, 612 
side-condition, 610 
variational equation, 864, 881, 979 


equilibrium point points, 871 
variational problem, 610, 974 
brachistochrone problem, 611 
Dirichlet, 617 
extremal curves, 611 
first order, 610 
first variation, 619 
functional, 611 
higher order, 610 
higher order derivatives, 614 
isoperimetric, general, 611 
more general, 618 
numerical solution, 618 
direct method, 618 
finite element method, 619 
gradient method, 619 
Ritz method, 618 
parametric representation, 610, 615 
second variation, 619 
several unknown functions, 615 
side conditions, 614 
simple 
one variable, 611 
several variable, 617 
variety, 395 
vector 
absolute value, 181 
affine coordinates, 183 
axial, 181 
reflection behavior, 287 
base, 187 
base vector 
reciprocal, 186 
bound, 181 
Cartesian coordinates, 183 
column, 271 
components, 707 
conjugate, 932 
coordinates, 183 
Darboux vector, 261 
decomposition, 183 
diagram, oscillations, 85 
differentiation rules, 701 
direction in space, 181 
direction, vector triple, 181 
directional coefficient, 184 
expansion coefficient, 184 
field, 70 
free, 181 
left-singular, 321 
length, 190 
line, 708 
magnitude, 181 
matrix, 271 
metric coefficients, 186 
notion, 181 
null vector, 182 
polar, 18 
reflection behavior, 287 
pole, origin, 182 
position vector, 182 
complex-number plane, 34 
probability, 827 
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vector (continued) 


vec 


vec 


vec 


vec 
vec 


radius vector, 182 
complex-number plane, 34 

random, 830 

reciprocal, 187 

reciprocal basis vectors, 187 

residual, 313 

right-singular, 321 

row, 271 

scalar invariant, 716 

sliding, 181 

space, 655 

stochastic, 826 

unit, 181 

zero vector, 182 

or algebra, 181 

geometric application, 190 

notions and principles, 181 

or analysis, 701 

quaternions, 305 

or equation 

line, 189 

plane, 189 

space curve, 256, 259 

or equations, 188 

or field, 704, 857 

Cartesian coordinates, 706 

central, 705 

circular field, 705 

components, 707 

contour integral, 721 

coordinate definition, 706 

cylindrical, 705 

cylindrical coordinates, 706 

irectional derivative, 708 

ivergence, 712 

point-like source, 727 

rotation, 713 

sink, 712 

source, 712 

spherical, 705 

spherical coordinates, 706 


vector function, 701, 704 


vec 
vec 
vec 


vec 
vec 


vec 


derivative, 701 
ifferentiation, 701 
hodograph, 701 

linear, 286 

scalar variable, 701 
Taylor expansion, 702 
or gradient, 711 

or iteration, 319, 321 

or lattice, 660 
homomorphism, 661 

or potential, 727 

or product, 184 

hints, 273 

Lie algebra, 360 
quaternions, 291, 293 
representation in coordinates, 188 
or space, 365, 654 

all null sequences, 656 
bounded sequences, 656 


B(T), 656 


C([a, b]), 656 
C? (a, δ), 656 
complex, 655 


convergent sequences, 656 
Euclidean, 367 


R? (3 dimensional), 289 


R^ (4 dimensional), 289 
R" (n dimensional), 272 
finite sequence of numbers, 655 
F”, 655 
F(T), 656 
functions, 656 
infinite-dimensional, 366, 657 
LP. 697 
1’, 656 
n dimensional, 272, 366 
ordered by a cone, 659 
partial ordering, 659 
real, 365, 655 
5 of all sequences, 655 
sequences, 655 
vector subspace 
stable, 867, 872 
unstable, 867, 872 
vectors, 181, 271 
angle between, 190 
collinear, 182 
collinearity, 185 
commutative law, 273 
coplanar, 182 
cyclic permutation, 209 
double vector product, 185 
dyadic product, 273 
equality, 181 
Lagrange identity, 186 
inear combination, 183, 185 
mixed product, 185, 188 
Cartesian coordinates, 186 
orthogonality, 185 
roducts 
affine coordinates, 186 
Cartesian coordinates, 186 
roducts, properties, 184 
scalar product, 273 
Cartesian coordinates, 186 


sum, 182 

ensor product, 273 
riple product, 185 
vector product 


representation in coordinates, 188 


Venn diagram, 328 

verification, proof, 5 

vertex 
angle, 129 
degree, 401 
ellipse, 199 
graph, 401 
hyperbola, 201 
initial, 401 
isolated, 401 
level, 407 


representation in coordinates, 187 
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parabola, 204 
plane curve, 250 
sink, 411 
source, 411 
terminal, 401 
vertices, distance, 404 
vibration, differential equation 
bar, 580 
round membrane, 581 
string, 579 
Vieta, root theorem, 44 
Volterra integral equation, 621, 667 
first kind, 643 
second kind, 645 
volume 
barrel, 159 
block, 153 
cone, 157 
cube, 154 
cylinder, 156 
double integral, 527 
element, vector components, 719 
hollow cylinder, 157 
obelisk, 155 
parallelepipedon with vectors, 190 
polyhedron, 153 
prism, 153 
pyramid, 154 
rectangular parallelepiped, 153 
sphere, 158 
subset, 693 
tetrahedron, 217 
torus, 159 
triple integral, 532 
wedge, 155 
volume derivative, 710 
volume integral, 527 
volume scale, 116 


Walsh 
functions, 804 
systems, 804 
wave 
non-linear, 604 
parameter, 84 
plane, 801 
expansion in terms of spherical functions, 565 
wave equation 
n-dimensional, 590 
one-dimensional, 792 
Schroedinger equation, 593 
wave function 
classical problem, 590 
expasion of eigenfunctions, 595 
heat-conduction equation, 592 
Schroedinger equation, 592 
statistical interpretation, 594 
wave-particle duality, 595 
wavelet, 801 
Daubechies, 802 
Haar, 801 
Mexican hat, 801 
orthogonal, 802 


transformation, 800, 801 
discrete, 803 
discrete, Haar, 803 
dyadic, 802 
fast, 803 
Weber 
differential equation, 601 
function, 562 
wedge, 155 
Weibull distribution, 821 
Weierstrass 
approximation theorem, 665 
criterion, uniform convergence, 468 
orm of Euler Differential equation, 616 
unction, 765 
heorem, 468 
one variable, 61 
several variables, 125 
weight 
measurement, 854 
of orthogonality, 570 
statistical, 813 
weighting factor, statistical, 850 
Whitney, theorem, 890 
witch of Agnesi, 95 
word, 662 
work (mechanics) 
general, 522 
special, 504 
Wronskian determinant, 553, 862 


Young scheme, 345 


zenith, 144 
distance, 144 

zero divisor, 361 

zero matrix, 269 

zero point, 1 

zero-point translational energy, 598 

zero-point vibration energy, 603 

Z-transformation, 794 
applications, 798 
convolution, 796 
damping, 796 
definition, 794 
difference, 796 
differentiation, 796 
integration, 796 
inverse, 797 
original sequence, 794 
rules of calculation, 795 
summation, 795 
transform, 794 
translation, 795 

Z-transformable, 794 


MATHEMATICAL SYMBOLS 


Relational Symbols 
equal to 7 approximately equal to < less than or equal to 
= identically equal to < less than > greater than or equal to 
— equal to by definition > greater than # unequal to, different from 
< much less than >> much greater than E corresponding to 
< partial order relation > partial order relation 
Greek Alphabet 
Aa Alpha BB Beta Γη Gamma A Delta Es Epsilon Ζς Zeta 
Hn Eta O09 Theta Lt Tota Kk Kappa AA Lambda Mpu Mu 
Nv Nu BE Xi Oo Omicron In Pi Pp Rho Xo Sigma 
Tr Tau Tv Upsilon p Phi Xx Chi V Psi Nw Omega 
Constants 
const constant amount (constant) C = 0.57722... Euler constant 
T = 3.14159... ratio of the perimeter of the circle to e = 2.71828... base of the natural logarithms 
the diameter 
Algebra 

A, B propositions 2A, Α negation of the proposition A 
AAB, N conjunction, logical AND AVB,U disjunction, logical OR 
A=>B implication, IF A THEN B ASB equivalence, A IF AND ONLY IF B 
A, B,C,... sets IN set of natural numbers 
A closure of the set A or complement of Z set of the integers 

A with respect to a universal set Q set of the rational numbers 
ACB A is a proper subset of B R set of the real numbers 
ACB A is a subset of B R4 set of the positive real numbers 
A\B difference of two sets R” n-dimensional Euclidean vector space 
AAB symmetric difference ο set of the complex numbers 
AxB Cartesian product RoS relation product 
TEA x is an element of A r¢A r is not an element of A 
card A cardinal number of the set A 0 empty set, zero set 
ANB intersection of two sets Niz Αι intersection of n sets A; 
AUB union of two sets Uia Αι union of n sets A; 
Vr for all elements x da there exists an element x 
{x € X :p(x)} subset of all z from X {x:p(x)}, set of all x with the 

of the property p(x) {x|p(x)} property p(x) 
T:X 3 Y mapping T from the space X = isomorphy of groups 

into the space Y ^R equivalence relation 
e residue class addition © residue class multiplication 


H = Hı § Hə orthogonal decomposition of space H A & B Kronecker product 


supp support iff if and only if 
supM supremum: least upper bound of the non-empty set M (M C R) bounded above 


inf M infimum: greatest lower bound of the non-empty set M (M C R) bounded below 
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B Mathematical Symbols 


a, b] closed interval, i.e., {ce R:a<ax<b} 
(a,b), ]a,b[ open interval, i.e., {ce R:a<ax<b} 
(a,b],]a,b] interval open from left, i.e, {ce R:a<x<b} 


a,b),{a,b[ interval open from right, ie, {2 € R:a< x< b} 


signa sign of the number a, e.g., sign (+3) = +1, sign0 = 0 
a | absolute value of the number a 
[a] integer, greater or equal to a (compare [x], p. C) 


a to the power m, a to the m-th 

να square root of a 

Va n-th root of a 

ορια. logarithm of the number a to the base b, e.g., log; 32 = 5 
oga decimal logarithm (base 10) of the number a, e.g., 1g100 = 2 


na natural logarithm (base €) of the number a, e.g., ne = 1 
a|b a is a divisor of b, a devides b, the ratio of a to b 
a fb a is not a divisor of b 
a = b mod m, a = b(m) a is congruent to b modulo m, i.e., b — a is divisible by m 
g.C.d.(a1, a2, ..., απ) greatest common divisor of a1, d2,...,@n 
lc.m.(a1,a2, ..., αμ) least common multiple of a1,a2,...,a; 
n 
( 4 binomial coefficient, n choose k 
a 
(2) Legendre symbol 
m!=1-2-3-...-n factorial, e.g., 61 = 1-2-3-4-5-6=720; specially: 0! = 1! = 1 
(2n)!! = 2-4-6..... (2n) 22". n5; in particular: 0! = 1!!=1 
(2n 4- 1) 21: 3-5... (2n 1) 
A = (aij) matrix A with elements a;; AT transposed matrix 
AC inverse matrix AH adjoint matrix 
E-—(0j) unit matrix 0 zero matrix 
rank rank of a matrix trace trace of a matrix 
det A, D determinant of the square matrix A 
δι] Kronecker symbol: δὲ; = 0 for i 4 j and δι; = 1 for i = j 
a column vector in R” 
a? unit vector in the direction of (parallel to) a 


mm 
s τι!» 
οι 


norm of a 


vectors in IR? 


i,j, k 6, Ε.Ε. basis vectors (orthonormed) of the Cartesian coordinate system 
Gy, Qy, Az coordinates (components) of the vector à 

ja absolute value, length of the vector ἃ 

aa multiplication of a vector by a scalar 

a- b, ab, (ab) scalar product, dot product 

axb, [ab] vector product, cross product 

abé = ᾱ- (b xc) parallelepipedal product, mixed product (triple scalar product) 
0. 0 zero vector 

T tensor 

G = (V, E) graph with the set of vertices V and the set of edges E 


Mathematical Symbols C 


Geometry 

d orthogonal (perpendicular) | parallel 

equal and parallel ~ similar, e.g., AABC ~ ADEF; pro- 

portional 
A triangle x angle, e.g., X ABC 
s arc segment, e.g., AB the arc between A and B rad radian 
2 degree 
f minute > as measure of angle and circular arc, e.g., 32° 14' 11.5" 
4 second 
gon measure in grades (360° = 400 gon, see p. 131 and table3.5, p. 146) 
AB the line segment between A and B 
— 
AB the directed line segment from A to B, the vector from A to B 
Complex Numbers 
i (sometimes j) imaginary unit (i? = —1) 1 imaginary unit in computer algebra 
Re (z) real part of the number 2 Im (z) imaginary part of the number z 
E absolute value of z argz argument of the number z 
Z or z* complex conjugate of z, e.g., z = 2+ 3i, Lnz logarithm (natural) of a complex num- 
z=2-3i ber z 
Trigonometric Functions, Hyperbolic Functions 
sin sine cos cosine 
tan tangent cot cotangent 
sec secant cosec cosecant 
arcsin principal value of arc sine (sine inverse) arccos principal value of arc cosine (cosine inverse) 
arctan principal value of arc tangent (tangent inverse) arccot principal value of arc cotangent (cotangent in- 
verse) 
arcsec principal value of arc secant (secant inverse) arccosec principal value of arc cosecant (cosecant in- 
verse) 
sinh hyperbolic sine cosh hyperbolic cosine 
tanh hyperbolic tangent coth hyperbolic cotangent 
sech hyperbolic secant cosech hyperbolic cosecant 
Arsinh — area-hyperbolic sine Arcosh area-hyperbolic cosine 
Artanh —area-hyperbolic tangent Arcoth area-hyperbolic cotangent 
Arsech — area-hyperbolic secant Arcosech area-hyperbolic cosecant 
Analysis 


t=1 

II. Wis 

fO ¢0) 

A 

d 

d d d" 


A is the limit of the sequence (xn). We also write £n > A as n — oo; 
e.g., lim (1+ i =e 
"noo n . 
B is the limit of the function f(x) as x tends to a 
greatest integer less or equal to x (compare p. B) 


Landau symbol “small o" means: f(x)/g(x) + 0 as z > a 
Landau symbol “big O” means: f(x)/g(x) + C (C = const, C #0) asx > a 


sum of n terms for i equals 1 to n 


product of n factors for i equals 1 to n 


notation for a function, e.g., y = f(x), 
difference or increment, e.g., Ax (delta 
differential, e.g., dx (differential of x) 


determination of the first, second, ..., n-th derivative with respect to x 


D Mathematical Symbols 


f), f" G), f" (2). 
γω)... f (v) 


jd. y 0 

ð 0 93 

Ox’ Oy’? 0x?" 7 
92 

OrOy 


D 
grad 
div 
rot 
ὃν ὃν ὃν 
ο ὅτι ὅτ, δε 
Y δι. dy! 2 
93 93 o? 
mE Oy? 02 
op 
δα 
b 


first, second,...,n-th derivative of the function f(a) or of the function y 


determination of the first, second, ..., n-th partial derivative 


determination of the second partial derivative first with respect to 2, 
then with respect to y 


first, second, ... partial derivative of function f(x, y) 


differential operator, e.g., Dy = y', D?y = y" 
gradient of a scalar field (grad y = Vy) 
divergence of a vector field (div v = V- V) 


rotation or curl of a vector field (rot ¥ = V x v) 


nabla operator, here in Cartesian coordinates (also called the 
Hamiltonian differential operator, not to be confused with the Hamil- 
ton operator in quantum mechanics) 


Laplace operator 


directional derivative, i.e., derivative of a 

op . 
z5 —&- grado 
δᾶ 


scalar field y into the direction a: 


definite integral of the function f between the limits a and b 


line integral of the first kind with respect to the space curve C with 
arclength s 


©) 
f(x,y, z) ds integral along a closed curve (circulatory integral) 
(C) 
Ez y)dS — / f(a, y) dx dy double integral over a planar region § 
(S) (S) 
J f(z,y,z)dS = i f(a,y,z)dS Mr integral of the first kind over a spatial surface S (see (8.151b), 
(S) Pi 


(8) 
/ f(a.y,z)dV = i f(x,y, 2)dadydz triple integral or volume integral over the volume V 
(V) (V) 


(8) (S) 
V(z)- d$ = dj V£) -dS 

ὦ E surface integrals over a closed surface in vector analysis 
VG) x d8 = ἥνω κ d$ 

(S) (S) 

A — max! expression Α is to be maximized, similarly min!, extreme! 


A = max expression A is maximal, similarly min, extreme. 


